Next Article in Journal
On the Connection between Spherical Laplace Transform and Non-Euclidean Fourier Analysis
Previous Article in Journal
Solving Robust Variants of the Maximum Weighted Independent Set Problem on Trees
Open AccessArticle

A New K-Nearest Neighbors Classifier for Big Data Based on Efficient Data Pruning

1
Computer Engineering Department, Faculty of Engineering, University of Birjand, Birjand 9717434765, Iran
2
Institute of Structural Mechanics, Bauhaus Universität Weimar, 99423 Weimar, Germany
3
Kalman Kando Faculty of Electrical Engineering, Obuda University, 1034 Budapest, Hungary
4
Faculty of Health, Queensland University of Technology, 130 Victoria Park Road, Kelvin Grove, QLD 4059, Australia
5
Department of Mathematics and Informatics, J. Selye University, 94501 Komarno, Slovakia
6
Department for Management of Science and Technology Development, Ton Duc Thang University, Ho Chi Minh City, Vietnam
7
Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City, Vietnam
*
Author to whom correspondence should be addressed.
Mathematics 2020, 8(2), 286; https://doi.org/10.3390/math8020286
Received: 26 November 2019 / Revised: 26 December 2019 / Accepted: 8 January 2020 / Published: 20 February 2020
The K-nearest neighbors (KNN) machine learning algorithm is a well-known non-parametric classification method. However, like other traditional data mining methods, applying it on big data comes with computational challenges. Indeed, KNN determines the class of a new sample based on the class of its nearest neighbors; however, identifying the neighbors in a large amount of data imposes a large computational cost so that it is no longer applicable by a single computing machine. One of the proposed techniques to make classification methods applicable on large datasets is pruning. LC-KNN is an improved KNN method which first clusters the data into some smaller partitions using the K-means clustering method; and then applies the KNN for each new sample on the partition which its center is the nearest one. However, because the clusters have different shapes and densities, selection of the appropriate cluster is a challenge. In this paper, an approach has been proposed to improve the pruning phase of the LC-KNN method by taking into account these factors. The proposed approach helps to choose a more appropriate cluster of data for looking for the neighbors, thus, increasing the classification accuracy. The performance of the proposed approach is evaluated on different real datasets. The experimental results show the effectiveness of the proposed approach and its higher classification accuracy and lower time cost in comparison to other recent relevant methods. View Full-Text
Keywords: K-nearest neighbors; KNN; classifier; machine learning; big data; clustering; cluster shape; cluster density; classification; reinforcement learning; machine learning for big data; data science; computation; artificial intelligence K-nearest neighbors; KNN; classifier; machine learning; big data; clustering; cluster shape; cluster density; classification; reinforcement learning; machine learning for big data; data science; computation; artificial intelligence
Show Figures

Figure 1

MDPI and ACS Style

Saadatfar, H.; Khosravi, S.; Joloudari, J.H.; Mosavi, A.; Shamshirband, S. A New K-Nearest Neighbors Classifier for Big Data Based on Efficient Data Pruning. Mathematics 2020, 8, 286. https://doi.org/10.3390/math8020286

AMA Style

Saadatfar H, Khosravi S, Joloudari JH, Mosavi A, Shamshirband S. A New K-Nearest Neighbors Classifier for Big Data Based on Efficient Data Pruning. Mathematics. 2020; 8(2):286. https://doi.org/10.3390/math8020286

Chicago/Turabian Style

Saadatfar, Hamid; Khosravi, Samiyeh; Joloudari, Javad H.; Mosavi, Amir; Shamshirband, Shahaboddin. 2020. "A New K-Nearest Neighbors Classifier for Big Data Based on Efficient Data Pruning" Mathematics 8, no. 2: 286. https://doi.org/10.3390/math8020286

Find Other Styles
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Search more from Scilit
 
Search
Back to TopTop