Skip to Content

21,972 Results Found

  • Article
  • Open Access
38 Citations
5,599 Views
16 Pages

Framework for the Ensemble of Feature Selection Methods

  • Maritza Mera-Gaona,
  • Diego M. López,
  • Rubiel Vargas-Canas and
  • Ursula Neumann

1 September 2021

Feature selection (FS) has attracted the attention of many researchers in the last few years due to the increasing sizes of datasets, which contain hundreds or thousands of columns (features). Typically, not all columns represent relevant values. Con...

  • Article
  • Open Access
211 Citations
13,141 Views
8 Pages

4 May 2020

As datasets continue to increase in size, it is important to select the optimal feature subset from the original dataset to obtain the best performance in machine learning tasks. Highly dimensional datasets that have an excessive number of features c...

  • Article
  • Open Access
22 Citations
4,802 Views
24 Pages

Enhancing Big Data Feature Selection Using a Hybrid Correlation-Based Feature Selection

  • Masurah Mohamad,
  • Ali Selamat,
  • Ondrej Krejcar,
  • Ruben Gonzalez Crespo,
  • Enrique Herrera-Viedma and
  • Hamido Fujita

30 November 2021

This study proposes an alternate data extraction method that combines three well-known feature selection methods for handling large and problematic datasets: the correlation-based feature selection (CFS), best first search (BFS), and dominance-based...

  • Article
  • Open Access
38 Citations
8,765 Views
17 Pages

2 July 2019

Feature interaction is a newly proposed feature relevance relationship, but the unintentional removal of interactive features can result in poor classification performance for this relationship. However, traditional feature selection algorithms mainl...

  • Article
  • Open Access
6 Citations
3,233 Views
22 Pages

On the Relationship between Feature Selection Metrics and Accuracy

  • Elise Epstein,
  • Naren Nallapareddy and
  • Soumya Ray

11 December 2023

Feature selection metrics are commonly used in the machine learning pipeline to rank and select features before creating a predictive model. While many different metrics have been proposed for feature selection, final models are often evaluated by ac...

  • Article
  • Open Access
1 Citations
1,303 Views
21 Pages

Dual-Regularized Feature Selection for Class-Specific and Global Feature Associations

  • Chenchen Wang,
  • Jun Wang,
  • Yanfei Li,
  • Chengkai Piao and
  • Jinmao Wei

13 February 2025

Understanding feature associations is vital for selecting the most informative features. Existing methods primarily focus on global feature associations, which capture overall relationships across all samples. However, they often overlook class-speci...

  • Article
  • Open Access
1,116 Views
20 Pages

PWFS: Probability-Weighted Feature Selection

  • Mehmet B. Ayanoglu and
  • Ismail Uysal

Feature selection has been a fundamental research area for both conventional and contemporary machine learning since the beginning of predictive analytics. From early statistical methods, such as principal component analysis, to more recent and data-...

  • Article
  • Open Access
43 Citations
5,544 Views
18 Pages

Feature selection for multiple types of data has been widely applied in mild cognitive impairment (MCI) and Alzheimer’s disease (AD) classification research. Combining multi-modal data for classification can better realize the complementarity o...

  • Article
  • Open Access
16 Citations
4,242 Views
16 Pages

10 January 2020

Feature selection is a way of reducing the features of data such that, when the classification algorithm runs, it produces better accuracy. In general, conventional feature selection is quite unstable when faced with changing data characteristics. It...

  • Article
  • Open Access
17 Citations
3,582 Views
24 Pages

15 February 2019

Multi-label text classification refers to a text divided into multiple categories simultaneously, which corresponds to a text associated with multiple topics in the real world. The feature space generated by text data has the characteristics of high...

  • Proceeding Paper
  • Open Access
3 Citations
2,521 Views
3 Pages

Feature Selection in Big Image Datasets

  • J. Guzmán Figueira-Domínguez,
  • Verónica Bolón-Canedo and
  • Beatriz Remeseiro

In computer vision, current feature extraction techniques generate high dimensional data. Both convolutional neural networks and traditional approaches like keypoint detectors are used as extractors of high-level features. However, the resulting data...

  • Review
  • Open Access
93 Citations
26,427 Views
26 Pages

Radiomics and Its Feature Selection: A Review

  • Wenchao Zhang,
  • Yu Guo and
  • Qiyu Jin

27 September 2023

Medical imaging plays an indispensable role in evaluating, predicting, and monitoring a range of medical conditions. Radiomics, a specialized branch of medical imaging, utilizes quantitative features extracted from medical images to describe underlyi...

  • Article
  • Open Access
16 Citations
8,535 Views
26 Pages

15 November 2016

Multi-label feature selection is designed to select a subset of features according to their importance to multiple labels. This task can be achieved by ranking the dependencies of features and selecting the features with the highest rankings. In a mu...

  • Article
  • Open Access
5 Citations
1,837 Views
24 Pages

18 November 2024

In multi-label data, a sample is associated with multiple labels at the same time, and the computational complexity is manifested in the high-dimensional feature space as well as the interdependence and unbalanced distribution of labels, which leads...

  • Article
  • Open Access
6 Citations
2,009 Views
18 Pages

7 March 2023

Axis-orbit recognition is an essential means for the fault diagnosis of hydropower units. An axis-orbit recognition method based on feature combination and feature selection is proposed, aiming to solve the problems of the low recognition accuracy, p...

  • Article
  • Open Access
1 Citations
1,300 Views
15 Pages

22 November 2024

Effective gene feature selection is critical for enhancing the interpretability and accuracy of genetic data analysis, particularly in the realm of disease prediction and precision medicine. Most evolutionary feature selection algorithms tend to beco...

  • Article
  • Open Access
19 Citations
5,690 Views
28 Pages

Ontology-Based Feature Selection: A Survey

  • Konstantinos Sikelis,
  • George E. Tsekouras and
  • Konstantinos Kotis

The Semantic Web emerged as an extension to the traditional Web, adding meaning (semantics) to a distributed Web of structured and linked information. At its core, the concept of ontology provides the means to semantically describe and structure info...

  • Review
  • Open Access
16 Citations
5,136 Views
29 Pages

Mathematical Methods in Feature Selection: A Review

  • Firuz Kamalov,
  • Hana Sulieman,
  • Ayman Alzaatreh,
  • Maher Emarly,
  • Hasna Chamlal and
  • Murodbek Safaraliev

18 March 2025

Feature selection is essential in machine learning and data science. Recently, there has been a growing effort to apply various mathematical methods to construct novel feature selection algorithms. In this study, we present a comprehensive state-of-t...

  • Review
  • Open Access
10 Citations
8,125 Views
25 Pages

Causality, Machine Learning, and Feature Selection: A Survey

  • Asmae Lamsaf,
  • Rui Carrilho,
  • João C. Neves and
  • Hugo Proença

9 April 2025

Causality, which involves distinguishing between cause and effect, is essential for understanding complex relationships in data. This paper provides a review of causality in two key areas: causal discovery and causal inference. Causal discovery trans...

  • Article
  • Open Access
48 Citations
11,624 Views
22 Pages

31 March 2021

This paper proposes different classification algorithms—logistic regression, support vector machine, K-nearest neighbors, and random forest—in order to identify which candidates are likely to default for a credit scoring model. Three different featur...

  • Article
  • Open Access
663 Views
27 Pages

Feature selection is essential for enhancing classification accuracy, reducing overfitting, and improving interpretability in high-dimensional datasets. Evolutionary Feature Selection (EFS) methods employ a threshold parameter θ to decide featu...

  • Article
  • Open Access
4 Citations
3,175 Views
25 Pages

8 August 2024

In the field of geomatics, artificial intelligence (AI) and especially machine learning (ML) are rapidly transforming the field of geomatics with respect to collecting, managing, and analyzing spatial data. Feature selection as a building block in ML...

  • Article
  • Open Access
89 Citations
11,965 Views
14 Pages

Many biological or medical data have numerous features. Feature selection is one of the data preprocessing steps that can remove the noise from data as well as save the computing time when the dataset has several hundred thousand or more features. An...

  • Article
  • Open Access
3 Citations
5,030 Views
20 Pages

Feature Selection based on the Local Lift Dependence Scale

  • Diego Marcondes,
  • Adilson Simonis and
  • Junior Barrera

30 January 2018

This paper uses a classical approach to feature selection: minimization of a cost function applied on estimated joint distributions. However, in this new formulation, the optimization search space is extended. The original search space is the Boolean...

  • Article
  • Open Access
328 Views
26 Pages

Feature Selection Using Nearest Neighbor Gaussian Processes

  • Konstantin Posch,
  • Maximilian Arbeiter,
  • Christian Truden,
  • Martin Pleschberger and
  • Jürgen Pilz

29 January 2026

We introduce a novel Bayesian approach for feature (variable) selection using Gaussian process regression, which is crucial for enhancing interpretability and model regularization. Our method employs nearest neighbor Gaussian processes as scalable ap...

  • Article
  • Open Access
6 Citations
2,714 Views
24 Pages

Manifold Feature Fusion with Dynamical Feature Selection for Cross-Subject Emotion Recognition

  • Yue Hua,
  • Xiaolong Zhong,
  • Bingxue Zhang,
  • Zhong Yin and
  • Jianhua Zhang

23 October 2021

Affective computing systems can decode cortical activities to facilitate emotional human–computer interaction. However, personalities exist in neurophysiological responses among different users of the brain–computer interface leads to a difficulty fo...

  • Article
  • Open Access
10 Citations
3,299 Views
19 Pages

25 November 2019

In recent years, there has been a growing interest in the problem of multi-label streaming feature selection with no prior knowledge of the feature space. However, the algorithms proposed to handle this problem seldom consider the group structure of...

  • Article
  • Open Access
3 Citations
2,833 Views
19 Pages

Feature Selection Methods for Extreme Learning Machines

  • Yanlin Fu,
  • Qing Wu,
  • Ke Liu and
  • Haotian Gao

30 August 2022

Extreme learning machines (ELMs) have gained acceptance owing to their high efficiency and outstanding generalization ability. As a key component of data preprocessing, feature selection methods can decrease the noise or irrelevant data for ELMs. How...

  • Article
  • Open Access
1,846 Views
23 Pages

Comparison of Feature Selection Methods—Modelling COPD Outcomes

  • Jorge Cabral,
  • Pedro Macedo,
  • Alda Marques and
  • Vera Afreixo

Selecting features associated with patient-centered outcomes is of major relevance yet the importance given depends on the method. We aimed to compare stepwise selection, least absolute shrinkage and selection operator, random forest, Boruta, extreme...

  • Article
  • Open Access
3 Citations
2,710 Views
19 Pages

17 July 2023

Multi-label streaming feature selection has received widespread attention in recent years because the dynamic acquisition of features is more in line with the needs of practical application scenarios. Most previous methods either assume that the labe...

  • Article
  • Open Access
19 Citations
7,979 Views
16 Pages

1 April 2017

We propose a novel feature selection method based on quadratic mutual information which has its roots in Cauchy–Schwarz divergence and Renyi entropy. The method uses the direct estimation of quadratic mutual information from data samples using Gaussi...

  • Article
  • Open Access
12 Citations
4,575 Views
16 Pages

A Feature Selection Algorithm Performance Metric for Comparative Analysis

  • Werner Mostert,
  • Katherine M. Malan and
  • Andries P. Engelbrecht

22 March 2021

This study presents a novel performance metric for feature selection algorithms that is unbiased and can be used for comparative analysis across feature selection problems. The baseline fitness improvement (BFI) measure quantifies the potential value...

  • Article
  • Open Access
1,979 Views
20 Pages

A Model for Feature Selection with Binary Particle Swarm Optimisation and Synthetic Features

  • Samuel Olusegun Ojo,
  • Juliana Adeola Adisa,
  • Pius Adewale Owolawi and
  • Chunling Tu

25 July 2024

Recognising patterns and inferring nonlinearities between data that are seemingly random and stochastic in nature is one of the strong suites of machine learning models. Given a set of features, the ability to distinguish between useful features and...

  • Article
  • Open Access
10 Citations
3,511 Views
24 Pages

Online Streaming Feature Selection via Conditional Independence

  • Dianlong You,
  • Xindong Wu,
  • Limin Shen,
  • Yi He,
  • Xu Yuan,
  • Zhen Chen,
  • Song Deng and
  • Chuan Ma

8 December 2018

Online feature selection is a challenging topic in data mining. It aims to reduce the dimensionality of streaming features by removing irrelevant and redundant features in real time. Existing works, such as Alpha-investing and Online Streaming Featur...

  • Article
  • Open Access
14 Citations
4,053 Views
18 Pages

29 August 2020

This paper proposes a feature selection (FS) approach, namely, correlation and fitness value-based feature selection (CFFS). CFFS is an improvement feature selection approach of correlation-based feature selection (CFS) for the common failure cases o...

  • Article
  • Open Access
20 Citations
5,300 Views
20 Pages

Feature Selection on 2D and 3D Geometric Features to Improve Facial Expression Recognition

  • Vianney Perez-Gomez,
  • Homero V. Rios-Figueroa,
  • Ericka Janet Rechy-Ramirez,
  • Efrén Mezura-Montes and
  • Antonio Marin-Hernandez

27 August 2020

An essential aspect in the interaction between people and computers is the recognition of facial expressions. A key issue in this process is to select relevant features to classify facial expressions accurately. This study examines the selection of o...

  • Feature Paper
  • Article
  • Open Access
57 Citations
7,724 Views
22 Pages

Feature Selection for Recommender Systems with Quantum Computing

  • Riccardo Nembrini,
  • Maurizio Ferrari Dacrema and
  • Paolo Cremonesi

28 July 2021

The promise of quantum computing to open new unexplored possibilities in several scientific fields has been long discussed, but until recently the lack of a functional quantum computer has confined this discussion mostly to theoretical algorithmic pa...

  • Article
  • Open Access
24 Citations
5,229 Views
21 Pages

A Bootstrap Framework for Aggregating within and between Feature Selection Methods

  • Reem Salman,
  • Ayman Alzaatreh,
  • Hana Sulieman and
  • Shaimaa Faisal

6 February 2021

In the past decade, big data has become increasingly prevalent in a large number of applications. As a result, datasets suffering from noise and redundancy issues have necessitated the use of feature selection across multiple domains. However, a comm...

  • Article
  • Open Access
9 Citations
3,547 Views
24 Pages

SA-FEM: Combined Feature Selection and Feature Fusion for Students’ Performance Prediction

  • Mingtao Ye,
  • Xin Sheng,
  • Yanjie Lu,
  • Guodao Zhang,
  • Huiling Chen,
  • Bo Jiang,
  • Senhao Zou and
  • Liting Dai

15 November 2022

Around the world, the COVID-19 pandemic has created significant obstacles for education, driving people to discover workarounds to maintain education. Because of the excellent benefit of cheap-cost information distribution brought about by the advent...

  • Article
  • Open Access
27 Citations
7,570 Views
16 Pages

21 March 2022

Insurance is a data-rich sector, hosting large volumes of customer data that is analysed to evaluate risk. Machine learning techniques are increasingly used in the effective management of insurance risk. Insurance datasets by their nature, however, a...

  • Article
  • Open Access
47 Citations
5,784 Views
15 Pages

In the field of machine learning, a considerable amount of research is involved in the interpretability of models and their decisions. The interpretability contradicts the model quality. Random Forests are among the best quality technologies of machi...

  • Article
  • Open Access
41 Citations
8,856 Views
17 Pages

Finger Vein Recognition with Personalized Feature Selection

  • Xiaoming Xi,
  • Gongping Yang,
  • Yilong Yin and
  • Xianjing Meng

22 August 2013

Finger veins are a promising biometric pattern for personalized identification in terms of their advantages over existing biometrics. Based on the spatial pyramid representation and the combination of more effective information such as gray, texture...

  • Article
  • Open Access
6 Citations
3,154 Views
16 Pages

Stability of Feature Selection in Multi-Omics Data Analysis

  • Tomasz Łukaszuk,
  • Jerzy Krawczuk,
  • Kamil Żyła and
  • Jacek Kęsik

28 November 2024

In the rapidly evolving field of multi-omics data analysis, understanding the stability of feature selection is critical for reliable biomarker discovery and clinical applications. This study investigates the stability of feature-selection methods ac...

  • Article
  • Open Access
44 Citations
5,448 Views
13 Pages

An Experimental Comparison of Feature-Selection and Classification Methods for Microarray Datasets

  • Nicole Dalia Cilia,
  • Claudio De Stefano,
  • Francesco Fontanella,
  • Stefano Raimondo and
  • Alessandra Scotto di Freca

10 March 2019

In the last decade, there has been a growing scientific interest in the analysis of DNA microarray datasets, which have been widely used in basic and translational cancer research. The application fields include both the identification of oncological...

  • Feature Paper
  • Article
  • Open Access
9 Citations
5,458 Views
10 Pages

21 January 2019

Feature selection aims to select the smallest feature subset that yields the minimum generalization error. In the rich literature in feature selection, information theory-based approaches seek a subset of features such that the mutual information bet...

  • Article
  • Open Access
32 Citations
4,712 Views
17 Pages

Binary Competitive Swarm Optimizer Approaches for Feature Selection

  • Jingwei Too,
  • Abdul Rahim Abdullah and
  • Norhashimah Mohd Saad

Feature selection is known as an NP-hard combinatorial problem in which the possible feature subsets increase exponentially with the number of features. Due to the increment of the feature size, the exhaustive search has become impractical. In additi...

  • Article
  • Open Access
3 Citations
4,749 Views
14 Pages

7 August 2019

Analysis of high-dimensional data is a challenge in machine learning and data mining. Feature selection plays an important role in dealing with high-dimensional data for improvement of predictive accuracy, as well as better interpretation of the data...

  • Article
  • Open Access
2 Citations
2,535 Views
23 Pages

Enhanced Feature Selection via Hierarchical Concept Modeling

  • Jarunee Saelee,
  • Patsita Wetchapram,
  • Apirat Wanichsombat,
  • Arthit Intarasit,
  • Jirapond Muangprathub,
  • Laor Boongasame and
  • Boonyarit Choopradit

26 November 2024

The objectives of feature selection include simplifying modeling and making the results more understandable, improving data mining efficiency, and providing clean and understandable data preparation. With big data, it also allows us to reduce computa...

  • Article
  • Open Access
1 Citations
2,107 Views
18 Pages

Gain-Loss Evaluation-Based Generic Selection for Steganalysis Feature

  • Ruixia Jin,
  • Yihao Wang,
  • Yuanyuan Ma,
  • Tao Li and
  • Xintao Duan

24 September 2021

Fewer contribution feature components in the image high-dimensional steganalysis feature are able to increase the spatio-temporal complexity of detecting the stego images, and even reduce the detection accuracy. In order to maintain or even improve t...

  • Article
  • Open Access
11 Citations
3,765 Views
18 Pages

An Improved Artificial Bee Colony for Feature Selection in QSAR

  • Yanhong Lin,
  • Jing Wang,
  • Xiaolin Li,
  • Yuanzi Zhang and
  • Shiguo Huang

9 April 2021

Quantitative Structure–Activity Relationship (QSAR) aims to correlate molecular structure properties with corresponding bioactivity. Chance correlations and multicollinearity are two major problems often encountered when generating QSAR models. Featu...

of 440