Next Article in Journal
A Multi-Modality Deep Network for Cold-Start Recommendation
Previous Article in Journal
A Machine Learning Approach for Air Quality Prediction: Model Regularization and Optimization
Previous Article in Special Issue
Reimaging Research Methodology as Data Science
Open AccessArticle

A Rule Extraction Study from SVM on Sentiment Analysis

by Guido Bologna 1,2,*,†,‡ and Yoichi Hayashi 3,‡
1
Department of Computer Science, University of Applied Sciences and Arts of Western Switzerland, Rue de la Prairie 4, 1202 Geneva, Switzerland
2
Department of Computer Science, University of Geneva, Route de Drize 7, 1227 Carouge, Switzerland
3
Department of Computer Science, Meiji University, Tama-ku, Kawasaki Kanagawa 214-8571, Japan
*
Author to whom correspondence should be addressed.
Current Address: University of Applied Sciences and Arts of Western Switzerland, Rue de la Prairie 4, 1202 Geneva, Switzerland.
These authors contributed equally to this work.
Big Data Cogn. Comput. 2018, 2(1), 6; https://doi.org/10.3390/bdcc2010006
Received: 15 December 2017 / Revised: 16 February 2018 / Accepted: 28 February 2018 / Published: 2 March 2018
(This article belongs to the Special Issue Big Data Analytic: From Accuracy to Interpretability)
A natural way to determine the knowledge embedded within connectionist models is to generate symbolic rules. Nevertheless, extracting rules from Multi Layer Perceptrons (MLPs) is NP-hard. With the advent of social networks, techniques applied to Sentiment Analysis show a growing interest, but rule extraction from connectionist models in this context has been rarely performed because of the very high dimensionality of the input space. To fill the gap we present a case study on rule extraction from ensembles of Neural Networks and Support Vector Machines (SVMs), the purpose being the characterization of the complexity of the rules on two particular Sentiment Analysis problems. Our rule extraction method is based on a special Multi Layer Perceptron architecture for which axis-parallel hyperplanes are precisely located. Two datasets representing movie reviews are transformed into Bag-of-Words vectors and learned by ensembles of neural networks and SVMs. Generated rules from ensembles of MLPs are less accurate and less complex than those extracted from SVMs. Moreover, a clear trade-off appears between rules’ accuracy, complexity and covering. For instance, if rules are too complex, less complex rules can be re-extracted by sacrificing to some extent their accuracy. Finally, rules can be viewed as feature detectors in which very often only one word must be present and a longer list of words must be absent. View Full-Text
Keywords: rule extraction; Support Vector Machines; ensembles; sentiment analysis rule extraction; Support Vector Machines; ensembles; sentiment analysis
Show Figures

Figure 1

MDPI and ACS Style

Bologna, G.; Hayashi, Y. A Rule Extraction Study from SVM on Sentiment Analysis. Big Data Cogn. Comput. 2018, 2, 6.

Show more citation formats Show less citations formats
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Back to TopTop