Next Article in Journal
The Algebraic Theory of Operator Matrix Polynomials with Applications to Aeroelasticity in Flight Dynamics and Control
Previous Article in Journal
Optimization for Sustainability: A Comparative Analysis of Evolutionary Crossover Operators for the Traveling Salesman Problem (TSP) with a Case Study on Croatia
Previous Article in Special Issue
A Novel Hybrid Attention-Based RoBERTa-BiLSTM Model for Cyberbullying Detection
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

A Context-Aware Representation-Learning-Based Model for Detecting Human-Written and AI-Generated Cryptocurrency Tweets Across Large Language Models

1
School of Systems and Technology, University of Management and Technology, Lahore 54770, Pakistan
2
Department of Finance, Bucharest University of Economic Studies, 6 Piata Romana, 010374 Bucharest, Romania
3
Arfa Karim Technology Incubator, Lahore 54000, Pakistan
*
Author to whom correspondence should be addressed.
Math. Comput. Appl. 2025, 30(6), 130; https://doi.org/10.3390/mca30060130
Submission received: 15 October 2025 / Revised: 25 November 2025 / Accepted: 26 November 2025 / Published: 29 November 2025

Abstract

The extensive use of large language models (LLMs), particularly in the finance sector, raises concerns about the authenticity and reliability of generated text. Developing a robust method for distinguishing between human-written and AI-generated financial content is therefore essential. This study addressed this challenge by constructing a dataset based on financial tweets, where original financial tweet texts were regenerated using six LLMs, resulting in seven distinct classes: human-authored text, LLaMA3.2, Phi3.5, Gemma2, Qwen2.5, Mistral, and LLaVA. A context-aware representation-learning-based model, namely DeBERTa, was extensively fine-tuned for this task. Its performance was compared to that of other transformer variants (DistilBERT, BERT Base Uncased, ELECTRA, and ALBERT Base V1) as well as traditional machine learning models (logistic regression, naive Bayes, random forest, decision trees, XGBoost, AdaBoost, and voting (AdaBoost, GradientBoosting, XGBoost)) using Word2Vec embeddings. The proposed DeBERTa-based model achieved an impressive test accuracy, precision, recall, and F1-score, all reaching 94%. In contrast, competing transformer models achieved test accuracies ranging from 0.78 to 0.80, while traditional machine learning models yielded a significantly lower performance (0.39–0.80). These results highlight the effectiveness of context-aware representation learning in distinguishing between human-written and AI-generated financial text, with significant implications for text authentication, authorship verification, and financial information security.
Keywords: large language models; AI-generated text; financial; cryptocurrency; text classification; DeBERTa; context-aware learning; Word2Vec; text authentication; machine learning; transformer large language models; AI-generated text; financial; cryptocurrency; text classification; DeBERTa; context-aware learning; Word2Vec; text authentication; machine learning; transformer

Share and Cite

MDPI and ACS Style

Arshed, M.A.; Gherghina, Ş.C.; Khalil, I.; Muavia, H.; Saleem, A.; Saleem, H. A Context-Aware Representation-Learning-Based Model for Detecting Human-Written and AI-Generated Cryptocurrency Tweets Across Large Language Models. Math. Comput. Appl. 2025, 30, 130. https://doi.org/10.3390/mca30060130

AMA Style

Arshed MA, Gherghina ŞC, Khalil I, Muavia H, Saleem A, Saleem H. A Context-Aware Representation-Learning-Based Model for Detecting Human-Written and AI-Generated Cryptocurrency Tweets Across Large Language Models. Mathematical and Computational Applications. 2025; 30(6):130. https://doi.org/10.3390/mca30060130

Chicago/Turabian Style

Arshed, Muhammad Asad, Ştefan Cristian Gherghina, Iqra Khalil, Hasnain Muavia, Anum Saleem, and Hajran Saleem. 2025. "A Context-Aware Representation-Learning-Based Model for Detecting Human-Written and AI-Generated Cryptocurrency Tweets Across Large Language Models" Mathematical and Computational Applications 30, no. 6: 130. https://doi.org/10.3390/mca30060130

APA Style

Arshed, M. A., Gherghina, Ş. C., Khalil, I., Muavia, H., Saleem, A., & Saleem, H. (2025). A Context-Aware Representation-Learning-Based Model for Detecting Human-Written and AI-Generated Cryptocurrency Tweets Across Large Language Models. Mathematical and Computational Applications, 30(6), 130. https://doi.org/10.3390/mca30060130

Article Metrics

Back to TopTop