applsci-logo

Journal Browser

Journal Browser

Text Mining with Information Extraction: Latest Advances and Prospects

A special issue of Applied Sciences (ISSN 2076-3417). This special issue belongs to the section "Computing and Artificial Intelligence".

Deadline for manuscript submissions: 20 July 2026 | Viewed by 328

Special Issue Editors


E-Mail Website
Guest Editor
Decision Support Systems Laboratory, School of Electrical and Computer Engineering, National Technical University of Athens, Iroon Polytechniou 9, 15780 Athens, Greece
Interests: social media analysis; AI and health; biomedical text mining; computational social science; text mining

E-Mail Website
Guest Editor
Division of Industrial Electric Devices and Decision Systems, National Technical University of Athens, Athens, Greece
Interests: decision support systems; e-government; interoperability; cybersecurity

E-Mail Website
Guest Editor
Decision Support Systems Laboratory, School of Electrical and Computer Engineering, National Technical University of Athens, Iroon Polytechniou 9, 15780 Athens, Greece
Interests: artificial intelligence; big data; analytics; decision support systems; data engineering
Special Issues, Collections and Topics in MDPI journals

Special Issue Information

Dear Colleagues,

In the era of data deluge, text mining and information extraction have become critical tools for transforming unstructured textual data into actionable knowledge. With the rapid evolution of machine learning, natural language processing (NLP), and large language models (LLMs), the synergy between text mining and information extraction has entered a transformative phase.

This Special Issue aims to showcase cutting-edge research, methodologies, and applications at the intersection of text mining and information extraction. We invite contributions that explore innovative approaches, theoretical foundations, and practical implementations that push the boundaries of what is possible in this fast-evolving field.

From large-scale scientific corpora and biomedical studies to social media, legal documents, and news articles, text mining with information extraction is proving essential across numerous real-world applications. These technologies are driving innovation in both academic research and industry by enabling the discovery of hidden patterns, the automation of knowledge curation, and the extraction of actionable insights from complex and noisy textual data. This Special Issue encourages submissions that explore diverse, domain-specific use cases, as well as those that address the methodological and computational challenges involved in scaling and adapting information extraction systems across various contexts.

Topics of interest include (but are not limited to) the following:

  • Novel algorithms and models for information extraction;
  • Integration of LLMs in text mining pipelines;
  • Named entity recognition and relation extraction;
  • Event detection and temporal information extraction;
  • Multilingual and cross-lingual information extraction;
  • Information extraction from scientific research, legal texts, clinical notes, or social media;
  • Text mining for knowledge base population and ontology learning;
  • Applications of information extraction in domains such as healthcare, finance, security, or education;
  • Benchmarking and evaluation of text mining/information extraction systems;
  • Ethics, bias, and fairness in text mining and information extraction.

Dr. Loukas Ilias
Dr. Dimitrios Askounis
Dr. Spiros Mouzakitis
Guest Editors

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 250 words) can be sent to the Editorial Office for assessment.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Applied Sciences is an international peer-reviewed open access semimonthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2400 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

  • text mining
  • information extraction
  • natural language processing (NLP)
  • named entity recognition (NER)
  • relation extraction
  • large language models (LLMs)
  • knowledge discovery
  • unstructured data analysis
  • domain-specific IE applications
  • multilingual text processing

Benefits of Publishing in a Special Issue

  • Ease of navigation: Grouping papers by topic helps scholars navigate broad scope journals more efficiently.
  • Greater discoverability: Special Issues support the reach and impact of scientific research. Articles in Special Issues are more discoverable and cited more frequently.
  • Expansion of research network: Special Issues facilitate connections among authors, fostering scientific collaborations.
  • External promotion: Articles in Special Issues are often promoted through the journal's social media, increasing their visibility.
  • Reprint: MDPI Books provides the opportunity to republish successful Special Issues in book format, both online and in print.

Further information on MDPI's Special Issue policies can be found here.

Published Papers (1 paper)

Order results
Result details
Select all
Export citation of selected articles as:

Research

25 pages, 2732 KB  
Article
Irony and Sarcasm Detection in Turkish Texts: A Comparative Study of Transformer-Based Models and Ensemble Learning
by Murat Eser and Metin Bilgin
Appl. Sci. 2025, 15(23), 12498; https://doi.org/10.3390/app152312498 - 25 Nov 2025
Viewed by 169
Abstract
Irony and sarcasm are forms of expression that emphasize the inconsistency between what is said and what is meant. Correctly classifying such expressions is an important text mining problem, especially on user-centered platforms such as social media. Due to the increasing prevalence of [...] Read more.
Irony and sarcasm are forms of expression that emphasize the inconsistency between what is said and what is meant. Correctly classifying such expressions is an important text mining problem, especially on user-centered platforms such as social media. Due to the increasing prevalence of implicit expressions, this topic has become a significant area of research in Natural Language Processing (NLP). However, the simultaneous detection of ironic and sarcastic expressions is highly challenging, as both types of implicit sentiments often convey closely related meanings. To address the detection of irony and sarcasm, this study compares the performance of transformer-based models and an ensemble learning method on Turkish texts, using five textual datasets—monogram, bigram, trigram, quadrigram, and omnigram—that share the same textual content but differ in context length. To improve classification performance, an ensemble learning approach based on the Artificial Rabbit Optimization (ARO) algorithm was implemented, combining the outputs of the models to produce final predictions. The experimental results indicate that as the context width of the datasets increases, the models achieve better predictions, leading to improvements across all performance metrics. The ensemble learning method outperformed individual models in all metrics, with performance increasing as the context expanded, achieving the highest success in the omnigram dataset with 76.71% accuracy, 74.64% precision, 73.29% sensitivity, and 73.96% F-Score. This study demonstrates that both model architecture and data structure are decisive factors in text classification performance, showing that community methods can make significant contributions to the effectiveness of deep learning solutions in low-resource languages. Full article
Show Figures

Figure 1

Back to TopTop