Next Article in Journal
A New Crop Spectral Signatures Database Interactive Tool (CSSIT)
Next Article in Special Issue
Graph Theoretic and Pearson Correlation-Based Discovery of Network Biomarkers for Cancer
Previous Article in Journal
A Novel Hybrid Model for Stock Price Forecasting Based on Metaheuristics and Support Vector Machine
Previous Article in Special Issue
Isolation, Characterization, and Agent-Based Modeling of Mesenchymal Stem Cells in a Bio-construct for Myocardial Regeneration Scaffold Design
Article Menu
Issue 2 (June) cover image

Export Article

Open AccessArticle

Ensemble Based Classification of Sentiments Using Forest Optimization Algorithm

1
Department of Computer Science; National University of Computer and Emerging Sciences, Lahore 54770, Pakistan
2
School of Science and Technology; University of Management and Technology, Lahore 54782, Pakistan
*
Author to whom correspondence should be addressed.
Received: 8 April 2019 / Revised: 20 May 2019 / Accepted: 21 May 2019 / Published: 23 May 2019
  |  
PDF [925 KB, uploaded 26 June 2019]
  |  

Abstract

Feature subset selection is a process to choose a set of relevant features from a high dimensionality dataset to improve the performance of classifiers. The meaningful words extracted from data forms a set of features for sentiment analysis. Many evolutionary algorithms, like the Genetic Algorithm (GA) and Particle Swarm Optimization (PSO), have been applied to feature subset selection problem and computational performance can still be improved. This research presents a solution to feature subset selection problem for classification of sentiments using ensemble-based classifiers. It consists of a hybrid technique of minimum redundancy and maximum relevance (mRMR) and Forest Optimization Algorithm (FOA)-based feature selection. Ensemble-based classification is implemented to optimize the results of individual classifiers. The Forest Optimization Algorithm as a feature selection technique has been applied to various classification datasets from the UCI machine learning repository. The classifiers used for ensemble methods for UCI repository datasets are the k-Nearest Neighbor (k-NN) and Naïve Bayes (NB). For the classification of sentiments, 15–20% improvement has been recorded. The dataset used for classification of sentiments is Blitzer’s dataset consisting of reviews of electronic products. The results are further improved by ensemble of k-NN, NB, and Support Vector Machine (SVM) with an accuracy of 95% for the classification of sentiment tasks. View Full-Text
Keywords: feature subset selection; classification; ensemble; evolutionary algorithms; data mining; sentiment analysis feature subset selection; classification; ensemble; evolutionary algorithms; data mining; sentiment analysis
Figures

Figure 1

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).
SciFeed

Share & Cite This Article

MDPI and ACS Style

Naz, M.; Zafar, K.; Khan, A. Ensemble Based Classification of Sentiments Using Forest Optimization Algorithm. Data 2019, 4, 76.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Metrics

Article Access Statistics

1

Comments

[Return to top]
Data EISSN 2306-5729 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top