Previous Article in Journal
Evaluating the Integrity of LLM-Generated Citations: Prevalence and Risks of Fabricated References in Scientific Literature
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Data Descriptor

VaxiGen Database of Tumor Immunogens

by
Stanislav Sotirov
1,2,*,
Ivan Dimitrov
1,2 and
Irini Doytchinova
1,2
1
Drug Design and Bioinformatics Lab, Faculty of Pharmacy, Medical University of Sofia, 2, Dunav Str., 1000 Sofia, Bulgaria
2
Centre of Excellence in Informatics and Information and Communication Technologies, 1113 Sofia, Bulgaria
*
Author to whom correspondence should be addressed.
Data 2026, 11(5), 123; https://doi.org/10.3390/data11050123
Submission received: 13 March 2026 / Revised: 13 May 2026 / Accepted: 18 May 2026 / Published: 20 May 2026

Abstract

Peptide-based cancer vaccines have emerged as a prominent focus in contemporary oncological research, as the quest for innovative cancer treatment modalities continues to gain momentum. A pivotal facet of their development is the precise delineation and characterization of immunogenic tumor antigens. In this context, VaxiJen stands out as one of the most widely used and cited computational servers for predicting immunogenicity, making it an invaluable tool for in silico antigen prediction. However, the database underpinning VaxiJen’s predictions has not undergone a comprehensive update for over fifteen years. To address this, a systematic search of the PubMed database was conducted to identify scholarly articles reporting data on novel immunogenic proteins and peptides undergoing human testing. The corresponding sequences of these proteins and peptides were subsequently curated from UniProtKB. Therefore, in this study, we introduce an updated dataset encompassing a repertoire of tumor immunogens, comprising 546 full-length human proteins and 212 human tumor peptides, as well as tumor non-immunogens, comprising 548 full-length human proteins and 181 human tumor peptides. The recently compiled VaxiGen tumor dataset is openly accessible. Researchers can conveniently download, search, and process it. This dataset, when paired with a suitable negative dataset, can further serve as a valuable training set, thereby facilitating improved predictions of the potential immunogenicity of hitherto uncharacterized protein or peptide sequences.
Keywords: tumor; immunogenicity; bioinformatics; database; online tumor; immunogenicity; bioinformatics; database; online

Share and Cite

MDPI and ACS Style

Sotirov, S.; Dimitrov, I.; Doytchinova, I. VaxiGen Database of Tumor Immunogens. Data 2026, 11, 123. https://doi.org/10.3390/data11050123

AMA Style

Sotirov S, Dimitrov I, Doytchinova I. VaxiGen Database of Tumor Immunogens. Data. 2026; 11(5):123. https://doi.org/10.3390/data11050123

Chicago/Turabian Style

Sotirov, Stanislav, Ivan Dimitrov, and Irini Doytchinova. 2026. "VaxiGen Database of Tumor Immunogens" Data 11, no. 5: 123. https://doi.org/10.3390/data11050123

APA Style

Sotirov, S., Dimitrov, I., & Doytchinova, I. (2026). VaxiGen Database of Tumor Immunogens. Data, 11(5), 123. https://doi.org/10.3390/data11050123

Article Metrics

Back to TopTop