Long Length Document Classification by Local Convolutional Feature Aggregation
AbstractThe exponential increase in online reviews and recommendations makes document classification and sentiment analysis a hot topic in academic and industrial research. Traditional deep learning based document classification methods require the use of full textual information to extract features. In this paper, in order to tackle long document, we proposed three methods that use local convolutional feature aggregation to implement document classification. The first proposed method randomly draws blocks of continuous words in the full document. Each block is then fed into the convolution neural network to extract features and then are concatenated together to output the classification probability through a classifier. The second model improves the first by capturing the contextual order information of the sampled blocks with a recurrent neural network. The third model is inspired by the recurrent attention model (RAM), in which a reinforcement learning module is introduced to act as a controller for selecting the next block position based on the recurrent state. Experiments on our collected four-class arXiv paper dataset show that the three proposed models all perform well, and the RAM model achieves the best test accuracy with the least information. View Full-Text
Share & Cite This Article
Liu, L.; Liu, K.; Cong, Z.; Zhao, J.; Ji, Y.; He, J. Long Length Document Classification by Local Convolutional Feature Aggregation. Algorithms 2018, 11, 109.
Liu L, Liu K, Cong Z, Zhao J, Ji Y, He J. Long Length Document Classification by Local Convolutional Feature Aggregation. Algorithms. 2018; 11(8):109.Chicago/Turabian Style
Liu, Liu; Liu, Kaile; Cong, Zhenghai; Zhao, Jiali; Ji, Yefei; He, Jun. 2018. "Long Length Document Classification by Local Convolutional Feature Aggregation." Algorithms 11, no. 8: 109.
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.