Next Article in Journal
Pedagogy before Technology: A Design-Based Research Approach to Enhancing Skills Development in Paramedic Science Using Mixed Reality
Next Article in Special Issue
Analysis of Document Pre-Processing Effects in Text and Opinion Mining
Previous Article in Journal
A Comparative Study of Web Content Management Systems
Previous Article in Special Issue
Usability as the Key Factor to the Design of a Web Server for the CReF Protein Structure Predictor: The wCReF
Article Menu

Export Article

Open AccessArticle
Information 2018, 9(2), 28;

Experimental Analysis of Stemming on Jurisprudential Documents Retrieval

Departamento de Computação, Universidade Federal de Sergipe—UFS, São Cristóvão/SE 49100-000, Brazil
This paper is an extended version of our paper published in the ICEIS 2017: 19th International Conference on Enterprise Information Systems, Porto, Portugal, 26–29 April 2017.
Author to whom correspondence should be addressed.
Received: 3 January 2018 / Revised: 24 January 2018 / Accepted: 25 January 2018 / Published: 27 January 2018
(This article belongs to the Special Issue Information Technology: New Generations (ITNG 2017))


Stemming algorithms are commonly used during textual preprocessing phase in order to reduce data dimensionality. However, this reduction presents different efficacy levels depending on the domain that it is applied to. Thus, for instance, there are reports in the literature that show the effect of stemming when applied to dictionaries or textual bases of news. On the other hand, we have not found any studies analyzing the impact of radicalization on Brazilian judicial jurisprudence, composed of decisions handed down by the judiciary, a fundamental instrument for law professionals to play their role. Thus, this work presents two complete experiments, showing the results obtained through the analysis and evaluation of the stemmers applied on real jurisprudential documents, originating from the Court of Justice of the State of Sergipe. In the first experiment, the results showed that, among the analyzed algorithms, the RSLP (Removedor de Sufixos da Lingua Portuguesa) possessed the greatest capacity of dimensionality reduction of the data. In the second one, through the evaluation of the stemming algorithms on the legal documents retrieval, the RSLP-S (Removedor de Sufixos da Lingua Portuguesa Singular) and UniNE (University of Neuchâtel), less aggressive stemmers, presented the best cost-benefit ratio, since they reduced the dimensionality of the data and increased the effectiveness of the information retrieval evaluation metrics in one of analyzed collections. View Full-Text
Keywords: experimental software engineering; judicial documents; dimensionality reduction; jurisprudence experimental software engineering; judicial documents; dimensionality reduction; jurisprudence

Figure 1

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

Share & Cite This Article

MDPI and ACS Style

N. de Oliveira, R.A.; C. Junior, M. Experimental Analysis of Stemming on Jurisprudential Documents Retrieval. Information 2018, 9, 28.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics



[Return to top]
Information EISSN 2078-2489 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top