Submit to Special Issue Submit Abstract to Special Issue Review for Applied Sciences Propose a Special Issue

Journal Menu

Journal Browser

Deep Learning and Its Applications in Natural Language Processing

Print Special Issue Flyer
Special Issue Editors
Special Issue Information
Keywords
Benefits of Publishing in a Special Issue
Published Papers

A special issue of Applied Sciences (ISSN 2076-3417). This special issue belongs to the section "Computing and Artificial Intelligence".

Deadline for manuscript submissions: 20 February 2026 | Viewed by 3887

Share This Special Issue

Special Issue Editors

Dr. Caren Han

E-Mail Website
Guest Editor

1. School of Computer Science and Information Systems, University of Melbourne, Melbourne, Australia
2. School of Computer Science, University of Sydney, Sydney, Australia
Interests: natural langauge processing; multimodal learning

Dr. Josiah Poon

E-Mail Website
Guest Editor

School of Computer Science, University of Sydney, Sydney, Australia
Interests: natural langauge processing; medical text mining

Dr. Siwen Luo

E-Mail Website
Guest Editor Assistant

School of Physics, Maths and Computing, Computer Science and Software Engineering, University of Western Australia, Perth, Australia
Interests: natural langauge processing; large language model

Dr. Yihao Ding

E-Mail Website
Guest Editor Assistant

School of Computing and Information Systems, University of Melbourne, Melbourne, Australia
Interests: natural langauge processing; visually rich document understanding

Special Issue Information

Dear Colleagues,

This Special Issue, entitled “Deep Learning and Its Applications in Natural Language Processing”, aims to showcase the latest advancements and innovative techniques in the intersection of deep learning and NLP. In recent years, deep learning has revolutionized the way that machines understand and generate human language, unlocking new capabilities across various NLP tasks such as machine translation, text summarization, sentiment analysis, and conversational agents. This Special Issue aims to explore both theoretical developments and practical applications, emphasizing emerging models like transformers, large language models, and multimodal architectures. We welcome contributions that address challenges such as model interpretability, bias mitigation, and multilingual understanding. In addition, we encourage submissions focused on novel methods for data augmentation, domain adaptation, and task-specific fine-tuning. By gathering insights from leading researchers, this Special Issue aims to provide an in-depth perspective on the evolving role of deep learning in NLP, contributing to our understanding and the development of future AI-driven language technologies.

Dr. Caren Han
Dr. Josiah Poon
Guest Editors

Dr. Siwen Luo
Dr. Yihao Ding
Guest Editor Assistants

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Applied Sciences is an international peer-reviewed open access semimonthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2400 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

deep learning
natural language processing (NLP)
large language models (LLMs)
multimodal models
text generation
NLP applications
financial NLP
medical NLP

Benefits of Publishing in a Special Issue

Ease of navigation: Grouping papers by topic helps scholars navigate broad scope journals more efficiently.
Greater discoverability: Special Issues support the reach and impact of scientific research. Articles in Special Issues are more discoverable and cited more frequently.
Expansion of research network: Special Issues facilitate connections among authors, fostering scientific collaborations.
External promotion: Articles in Special Issues are often promoted through the journal's social media, increasing their visibility.
Reprint: MDPI Books provides the opportunity to republish successful Special Issues in book format, both online and in print.

Further information on MDPI's Special Issue policies can be found here.

Published Papers (3 papers)

Download All Papers

Order results

Result details

Show export options Show export options

Select all

Export citation of selected articles as:

Research

28 pages, 2674 KB

Open AccessArticle

Fine-Tuning a Large Language Model for the Classification of Diseases Caused by Environmental Pollution

by Julio Fernando Hernández-Angeles, Alberto Jorge Rosales-Silva, Jean Marie Vianney-Kinani, Juan Pablo Francisco Posadas-Durán, Francisco Javier Gallegos-Funes, Erick Velázquez-Lozada, Armando Adrián Miranda-González, Dilan Uriostegui-Hernandez and Juan Manuel Estrada-Soubran

Appl. Sci. 2025, 15(17), 9772; https://doi.org/10.3390/app15179772 - 5 Sep 2025

Viewed by 757

Abstract

Environmental pollution poses an increasing threat to public health, particularly in urban areas with high levels of pollutant exposure. To address this challenge, this study proposes a model based on fine-tuning the LLaMA 3 large language model for the classification of pollution-related diseases using user-reported symptoms. A balanced dataset was employed, with examples evenly distributed across 10 common diseases, and several preprocessing techniques were applied, including tokenization, normalization, noise removal, and data augmentation. The model was fine-tuned using the QLoRA technique, which integrates quantization with low-rank adaptation, enabling both training and inference on resource-constrained hardware. During training, a consistent reduction in loss and a progressive improvement in validation accuracy were observed. Moreover, the confusion matrix demonstrated a high classification success rate with minimal misclassification across classes. The findings suggest that optimized large language models can be effectively applied in settings with limited computational infrastructure, supporting the early diagnosis of diseases associated with environmental factors. Full article

(This article belongs to the Special Issue Deep Learning and Its Applications in Natural Language Processing)

► Show Figures

Figure 1

17 pages, 2827 KB

Open AccessArticle

Low-Resourced Alphabet-Level Pivot-Based Neural Machine Translation for Translating Korean Dialects

by Junho Park and Seong-Bae Park

Appl. Sci. 2025, 15(17), 9459; https://doi.org/10.3390/app15179459 - 28 Aug 2025

Viewed by 857

Abstract

Developing a machine translator from a Korean dialect to a foreign language presents significant challenges due to a lack of a parallel corpus for direct dialect translation. To solve this issue, this paper proposes a pivot-based machine translation model that consists of two sub-translators. The first sub-translator is a sequence-to-sequence model with minGRU as an encoder and GRU as a decoder. It normalizes a dialect sentence into a standard sentence, and it employs alphabet-level tokenization. The other type of sub-translator is a legacy translator, such as off-the-shelf neural machine translators or LLMs, which translates the normalized standard sentence to a foreign sentence. The effectiveness of the alphabet-level tokenization and the minGRU encoder for the normalization model is demonstrated through empirical analysis. Alphabet-level tokenization is proven to be more effective for Korean dialect normalization than other widely used sub-word tokenizations. The minGRU encoder exhibits comparable performance to GRU as an encoder, and it is faster and more effective in managing longer token sequences. The pivot-based translation method is also validated through a broad range of experiments, and its effectiveness in translating Korean dialects to English, Chinese, and Japanese is demonstrated empirically. Full article

(This article belongs to the Special Issue Deep Learning and Its Applications in Natural Language Processing)

► Show Figures

Figure 1

17 pages, 1467 KB

Open AccessArticle

Confidence-Based Knowledge Distillation to Reduce Training Costs and Carbon Footprint for Low-Resource Neural Machine Translation

by Maria Zafar, Patrick J. Wall, Souhail Bakkali and Rejwanul Haque

Appl. Sci. 2025, 15(14), 8091; https://doi.org/10.3390/app15148091 - 21 Jul 2025

Viewed by 1380

Abstract

The transformer-based deep learning approach represents the current state-of-the-art in machine translation (MT) research. Large-scale pretrained transformer models produce state-of-the-art performance across a wide range of MT tasks for many languages. However, such deep neural network (NN) models are often data-, compute-, space-, power-, and energy-hungry, typically requiring powerful GPUs or large-scale clusters to train and deploy. As a result, they are often regarded as “non-green” and “unsustainable” technologies. Distilling knowledge from large deep NN models (teachers) to smaller NN models (students) is a widely adopted sustainable development approach in MT as well as in broader areas of natural language processing (NLP), including speech, and image processing. However, distilling large pretrained models presents several challenges. First, increased training time and cost that scales with the volume of data used for training a student model. This could pose a challenge for translation service providers (TSPs), as they may have limited budgets for training. Moreover, CO₂ emissions generated during model training are typically proportional to the amount of data used, contributing to environmental harm. Second, when querying teacher models, including encoder–decoder models such as NLLB, the translations they produce for low-resource languages may be noisy or of low quality. This can undermine sequence-level knowledge distillation (SKD), as student models may inherit and reinforce errors from inaccurate labels. In this study, the teacher model’s confidence estimation is employed to filter those instances from the distilled training data for which the teacher exhibits low confidence. We tested our methods on a low-resource Urdu-to-English translation task operating within a constrained training budget in an industrial translation setting. Our findings show that confidence estimation-based filtering can significantly reduce the cost and CO₂ emissions associated with training a student model without drop in translation quality, making it a practical and environmentally sustainable solution for the TSPs. Full article

(This article belongs to the Special Issue Deep Learning and Its Applications in Natural Language Processing)

► Show Figures

Journal Menu

Journal Browser

Deep Learning and Its Applications in Natural Language Processing

Share This Special Issue

Special Issue Editors

Special Issue Information

Keywords

Benefits of Publishing in a Special Issue

Published Papers (3 papers)

Research

Further Information

Guidelines

MDPI Initiatives

Follow MDPI