Big Data and Cognitive Computing, Volume 10, Issue 2
2026 February - 28 articles
Cover Story: News data powers research in economics, social science, and NLP, yet full-text corpora are often expensive or hard to access. We introduce gdeltnews (https://github.com/iandreafc/gdeltnews), an open-source Python package that reconstructs near-complete online news articles from the GDELT Web News NGrams 3.0 dataset by assembling overlapping fragments with positional constraints. Validated on 2211 URL-matched articles from major U.S. outlets, the reconstructions achieve up to ~95% similarity. The tool enables scalable, reproducible, near-zero-cost access to global news text for custom analysis. View this paper - Issues are regarded as officially published after their release is announced to the table of contents alert mailing list .
- You may sign up for email alerts to receive table of contents of newly released issues.
- PDF is the official format for papers published in both, html and pdf forms. To view the papers in pdf format, click on the "PDF Full-text" link, and use the free Adobe Reader to open them.