Reprint

Statistical Data Modeling and Machine Learning with Applications

Edited by
December 2021
184 pages
  • ISBN978-3-0365-2692-8 (Hardback)
  • ISBN978-3-0365-2693-5 (PDF)

This book is a reprint of the Special Issue Statistical Data Modeling and Machine Learning with Applications that was published in

Computer Science & Mathematics
Engineering
Physical Sciences
Public Health & Healthcare
Summary

The modeling and processing of empirical data is one of the main subjects and goals of statistics. Nowadays, with the development of computer science, the extraction of useful and often hidden information and patterns from data sets of different volumes and complex data sets in warehouses has been added to these goals. New and powerful statistical techniques with machine learning (ML) and data mining paradigms have been developed. To one degree or another, all of these techniques and algorithms originate from a rigorous mathematical basis, including probability theory and mathematical statistics, operational research, mathematical analysis, numerical methods, etc. Popular ML methods, such as artificial neural networks (ANN), support vector machines (SVM), decision trees, random forest (RF), among others, have generated models that can be considered as straightforward applications of optimization theory and statistical estimation. The wide arsenal of classical statistical approaches combined with powerful ML techniques allows many challenging and practical problems to be solved.

This Special Issue belongs to the section “Mathematics and Computer Science”. Its aim is to establish a brief collection of carefully selected papers presenting new and original methods, data analyses, case studies, comparative studies, and other research on the topic of statistical data modeling and ML as well as their applications. Particular attention is given, but is not limited, to theories and applications in diverse areas such as computer science, medicine, engineering, banking, education, sociology, economics, among others.

Format
  • Hardback
License
© 2022 by the authors; CC BY-NC-ND license
Keywords
mathematical competency; assessment; machine learning; classification and regression tree; CART ensembles and bagging; ensemble model; multivariate adaptive regression splines; cross-validation; dam inflow prediction; long short-term memory; wavelet transform; input predictor selection; hyper-parameter optimization; brain-computer interface; EEG motor imagery; CNN-LSTM architectures; real-time motion imagery recognition; artificial neural networks; banking; hedonic prices; housing; quantile regression; data quality; citizen science; consensus models; clustering; Gower’s interpolation formula; Gower’s metric; mixed data; multidimensional scaling; classification; data-adaptive kernel functions; image data; multi-category classifier; predictive models; support vector machine; stochastic gradient descent; damped Newton; convexity; METABRIC dataset; breast cancer subtyping; deep forest; multi-omics data; machine learning; categorical data; similarity; feature selection; kernel density estimation; non-linear optimization; kernel clustering; n/a