Next Article in Journal
A Review of Non-Fully Supervised Deep Learning for Medical Image Segmentation
Previous Article in Journal / Special Issue
Clinical Applicability and Cross-Dataset Validation of Machine Learning Models for Binary Glaucoma Detection
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Multihead Average Pseudo-Margin Learning for Disaster Tweet Classification

by
Iustin Sîrbu
1,*,†,
Robert-Adrian Popovici
1,†,
Traian Rebedea
1,2,* and
Ștefan Trăușan-Matu
1
1
Faculty of Automatic Control and Computer Science, National University of Science and Technology POLITEHNICA Bucharest, Bucharest 060042, Romania
2
NVIDIA, Santa Clara, CA 95051, USA
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Information 2025, 16(6), 434; https://doi.org/10.3390/info16060434 (registering DOI)
Submission received: 7 April 2025 / Revised: 21 May 2025 / Accepted: 22 May 2025 / Published: 24 May 2025
(This article belongs to the Special Issue Machine Learning and Artificial Intelligence with Applications)

Abstract

During natural disasters, social media platforms, such as X (formerly Twitter), become a valuable source of real-time information, with eyewitnesses and affected individuals posting messages about the produced damage and the victims. Although this information can be used to streamline the intervention process of local authorities and to achieve a better distribution of available resources, manually annotating these messages is often infeasible due to time and cost constraints. To address this challenge, we explore the use of semi-supervised learning, a technique that leverages both labeled and unlabeled data, to enhance neural models for disaster tweet classification. Specifically, we investigate state-of-the-art semi-supervised learning models and focus on co-training, a less-explored approach in recent years. Moreover, we propose a novel hybrid co-training architecture, Multihead Average Pseudo-Margin, which obtains state-of-the-art results on several classification tasks. Our approach extends the advantages of the voting mechanism from Multihead Co-Training by using the Average Pseudo-Margin (APM) score to improve the quality of the pseudo-labels and self-adaptive confidence thresholds for improving imbalanced classification. Our method achieves up to 7.98% accuracy improvement in low-data scenarios and 2.84% improvement when using the entire labeled dataset, reaching 89.55% accuracy on the Humanitarian task and 91.23% on the Informative task. These results demonstrate the potential of our approach in addressing the critical need for automated disaster tweet classification. We made our code publicly available for future research.
Keywords: semi-supervised learning; disaster tweet classification; co-training; machine learning; multimodal learning semi-supervised learning; disaster tweet classification; co-training; machine learning; multimodal learning

Share and Cite

MDPI and ACS Style

Sîrbu, I.; Popovici, R.-A.; Rebedea, T.; Trăușan-Matu, Ș. Multihead Average Pseudo-Margin Learning for Disaster Tweet Classification. Information 2025, 16, 434. https://doi.org/10.3390/info16060434

AMA Style

Sîrbu I, Popovici R-A, Rebedea T, Trăușan-Matu Ș. Multihead Average Pseudo-Margin Learning for Disaster Tweet Classification. Information. 2025; 16(6):434. https://doi.org/10.3390/info16060434

Chicago/Turabian Style

Sîrbu, Iustin, Robert-Adrian Popovici, Traian Rebedea, and Ștefan Trăușan-Matu. 2025. "Multihead Average Pseudo-Margin Learning for Disaster Tweet Classification" Information 16, no. 6: 434. https://doi.org/10.3390/info16060434

APA Style

Sîrbu, I., Popovici, R.-A., Rebedea, T., & Trăușan-Matu, Ș. (2025). Multihead Average Pseudo-Margin Learning for Disaster Tweet Classification. Information, 16(6), 434. https://doi.org/10.3390/info16060434

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop