AI | March 2026 - Browse Articles

22 pages, 3493 KB

Open AccessArticle

Deepfake Detection Using Multimodal CLIP-Based SigLIP-2 Vision Transformers

by Joe Soundararajan and Dong Xu

AI 2026, 7(3), 115; https://doi.org/10.3390/ai7030115 - 19 Mar 2026

Viewed by 2702

Background: Deepfakes pose a growing threat to the integrity of visual media, motivating detectors that remain reliable as forgeries become increasingly realistic. Methods: We propose a deepfake detection framework built on CLIP-derived SigLIP-2 vision transformers and a multi-task design that jointly performs (i) [...] Read more.

Background: Deepfakes pose a growing threat to the integrity of visual media, motivating detectors that remain reliable as forgeries become increasingly realistic. Methods: We propose a deepfake detection framework built on CLIP-derived SigLIP-2 vision transformers and a multi-task design that jointly performs (i) classification and (ii) manipulated-region localization when pixel-level supervision is available. We evaluated the approach on three public benchmarks of increasing complexity—HiDF, SID_Set (SIDA), and CiFake—using each dataset’s official partitions where provided (SID_Set uses the predefined train/validation split) and a standardized preprocessing and training pipeline across experiments. Results: On HiDF, our model achieved strong performance on both video and image tracks (AUC up to 0.931 on video and 0.968 on images), yielding large gains relative to previously reported HiDF baselines under their published settings. On SID_Set, the model achieved 99.1% three-class accuracy (real/synthetic/tampered) and produced accurate localization masks for many tampered regions, while we explicitly documented the split protocol and leakage checks to support the validity of the evaluation. On CiFake, the model exceeded 95% accuracy and attained an AUC of 0.986. Conclusions: Overall, the results indicate that SigLIP-2 representations combined with multi-task training can deliver high detection accuracy and interpretable localization on challenging, realistic forgeries, while highlighting the importance of clearly stated evaluation protocols for fair comparison. Full article

(This article belongs to the Section AI Systems: Theory and Applications)

► Show Figures

Figure 1

29 pages, 3025 KB

Open AccessArticle

Trust Triangle: A Reliability-Validity-Generation Framework for Explainable Credit Card Fraud Detection with RAG-Enhanced LLMs Reasoning

by Jin-Ching Shen, Nai-Ching Su and Yi-Bing Lin

AI 2026, 7(3), 114; https://doi.org/10.3390/ai7030114 - 19 Mar 2026

Viewed by 938

Journal Menu

Journal Browser

AI, Volume 7, Issue 3 (March 2026) – 36 articles

Further Information

Guidelines

MDPI Initiatives

Follow MDPI