Skip to Content

Big Data and Cognitive Computing, Volume 10, Issue 2

2026 February - 28 articles

Cover Story: News data powers research in economics, social science, and NLP, yet full-text corpora are often expensive or hard to access. We introduce gdeltnews (https://github.com/iandreafc/gdeltnews), an open-source Python package that reconstructs near-complete online news articles from the GDELT Web News NGrams 3.0 dataset by assembling overlapping fragments with positional constraints. Validated on 2211 URL-matched articles from major U.S. outlets, the reconstructions achieve up to ~95% similarity. The tool enables scalable, reproducible, near-zero-cost access to global news text for custom analysis. View this paper
  • Issues are regarded as officially published after their release is announced to the table of contents alert mailing list .
  • You may sign up for email alerts to receive table of contents of newly released issues.
  • PDF is the official format for papers published in both, html and pdf forms. To view the papers in pdf format, click on the "PDF Full-text" link, and use the free Adobe Reader to open them.

Articles (28)

  • Article
  • Open Access
435 Views
23 Pages

This paper presents a comparative analysis of read performance for PostgreSQL and MongoDB in e-commerce scenarios, using identical datasets in a resource-constrained single-host environment. The results demonstrate that PostgreSQL executes complex an...

  • Perspective
  • Open Access
367 Views
74 Pages

The convergence of Lean manufacturing principles with Industry 4.0 has yielded significant operational improvements, yet the emerging paradigm of Industry 6.0—characterized by antifragile, autonomous, and sustainable systems—demands a fun...

  • Article
  • Open Access
286 Views
38 Pages

Visual exploration of time series data is essential for uncovering meaningful insights in domains such as healthcare monitoring and financial analysis, yet it remains computationally challenging due to the combinatorial explosion of potential subsequ...

  • Article
  • Open Access
382 Views
17 Pages

Skill Classification of Youth Table Tennis Players Using Sensor Fusion and the Random Forest Algorithm

  • Yung-Hoh Sheu,
  • Cheng-Yu Huang,
  • Li-Wei Tai,
  • Tzu-Hsuan Tai and
  • Sheng K. Wu

This study addresses the issue of inaccurate results in traditional table tennis player classification, which is often influenced by subjective judgment and environmental factors, by proposing a youth table tennis player classification system based o...

  • Article
  • Open Access
269 Views
25 Pages

To meet the requirements for the automatic alignment, insertion, and inspection of guide-tube opening pins on the upper core plate in a component pool during refueling outages of nuclear power units, this paper proposes a cognition-enhanced visual-se...

  • Article
  • Open Access
325 Views
17 Pages

Large Language Model (LLM) inference engines are becoming critical system infrastructure, yet their increasing architectural complexity makes defects difficult to be diagnosed and repaired. Existing reliability studies predominantly focus on model be...

  • Article
  • Open Access
456 Views
14 Pages

Financial sentiment analysis leverages natural language processing techniques to quantitatively assess sentiment polarity and emotional tendencies in financial texts. Its practical application in investment decision-making and risk management faces t...

  • Article
  • Open Access
426 Views
22 Pages

Bias Correction and Explainability Framework for Large Language Models: A Knowledge-Driven Approach

  • Xianming Yang,
  • Qi Li,
  • Chengdong Qian,
  • Haitao Wang,
  • Yonghui Wu and
  • Wei Wang

Large Language Models (LLMs) have demonstrated extraordinary capabilities in natural language generation; however, their real-world deployment is frequently hindered by the generation of factually incorrect or biased content, along with an inherent d...

  • Article
  • Open Access
256 Views
24 Pages

Metaheuristic optimization algorithms have demonstrated their effectiveness in solving complex optimization tasks, such as those related to Intrusion Detection Systems (IDSs). It was widely used to enhance the detection rate of various types of cyber...

  • Article
  • Open Access
337 Views
20 Pages

Although retrieval-augmented generation (RAG) technology mitigates the hallucination issue in large language models (LLMs) by incorporating external knowledge, and combining reasoning models can further enhance RAG system performance, retrieval noise...

  • Review
  • Open Access
1,019 Views
17 Pages

Large language models (LLMs) are now routine writing tools across various domains, intensifying questions about when text should be treated as human-authored, artificial intelligence (AI)-generated, or collaboratively produced. This rapid review aims...

  • Article
  • Open Access
266 Views
32 Pages

The enhanced long-range navigation (eLoran) system serves as an important backup method for the global navigation satellite system (GNSS) system. In long-distance transmission scenarios, the signal propagation delay of the eLoran system is affected b...

  • Article
  • Open Access
573 Views
28 Pages

Modern ICT Tools and Video Content in Athletes’ Education—Inspiration from Corporate Learning and Development

  • Martin Mičiak,
  • Dominika Toman,
  • Milan Kubina,
  • Tatiana Poljaková,
  • Klaudia Ivanovič,
  • Kvetoslava Šimová,
  • Anna Majchráková,
  • Ivana Bystrická,
  • Linda Kováčik and
  • Tibor Furmánek

Active athletes represent a specific target for learning and development. Their schedules, including training sessions and competitions, leave little time for education. However, athletes still need skills beyond sports to ensure they are prepared fo...

  • Article
  • Open Access
289 Views
16 Pages

CCTD-MARL: Coupled Communication-Task Decoupling Framework for Multi-Agent Systems Under Partial Observability

  • Kehan Li,
  • Zhenya Wang,
  • Xin Tang,
  • Heng You,
  • Long Hu,
  • Haidong Xie and
  • Min Chen

Although multi-agent reinforcement learning (MARL) has achieved significant success in various domains, its deployment in real-world scenarios remains challenging, particularly in communication-constrained environments involving multi-task coupling....

  • Article
  • Open Access
344 Views
27 Pages

Hybrid Method of Organizing Information Search in Logistics Systems Based on Vector-Graph Structure and Large Language Models

  • Vadim Voloshchuk,
  • Yaroslav Melnik,
  • Irina Safronenkova,
  • Egor Lishchenko,
  • Oleg Kartashov and
  • Alexander Kozlovskiy

In logistics systems, the organization of information retrieval plays a key role in human interaction with technical systems to ensure decision-making speed, route optimization, planning, and resource allocation. At the same time, the efficiency of t...

  • Article
  • Open Access
297 Views
14 Pages

Improving Transferability of Adversarial Attacks via Maximization and Targeting from Image to Video Quality Assessment

  • Georgii Gotin,
  • Ekaterina Shumitskaya,
  • Dmitriy Vatolin and
  • Anastasia Antsiferova

This paper proposes a novel method for transferable adversarial attacks from Image Quality Assessment (IQA) to Video Quality Assessment (VQA) models. Attacking modern VQA models is challenging due to their high complexity and the temporal nature of v...

  • Article
  • Open Access
419 Views
25 Pages

SiAraSent: From Features to Deep Transformers for Large-Scale Arabic Sentiment Analysis

  • Omar Almousa,
  • Yahya Tashtoush,
  • Anas AlSobeh,
  • Plamen Zahariev and
  • Omar Darwish

Sentiment analysis of Arabic text, particularly on social media platforms, presents a formidable set of unique challenges that stem from the language’s complex morphology, its numerous dialectal variations, and the frequent and nuanced use of e...

  • Article
  • Open Access
345 Views
26 Pages

To overcome the limitations of current link prediction methods in effectively leveraging topological information and node importance, this paper introduces a new model called AMPS (Adaptive Multi-scale Potential-enhanced Path Similarity). The model i...

  • Article
  • Open Access
417 Views
20 Pages

In practical well-logging datasets, severe missing values, anomalous disturbances, and highly imbalanced lithology classes are pervasive. To address these challenges, this study proposes a well-logging lithology identification framework that combines...

  • Review
  • Open Access
1,778 Views
34 Pages

In the big data era, understanding and influencing consumer behavior in digital marketing increasingly relies on large-scale data and AI-driven analytics. This narrative, concept-driven review examines how big data technologies and machine learning r...

  • Article
  • Open Access
750 Views
18 Pages

News data have become essential resources across various disciplines. Still, access to full-text news corpora remains challenging due to high costs and the limited availability of free alternatives. This paper presents a novel Python package (gdeltne...

  • Article
  • Open Access
373 Views
39 Pages

In this paper, an explainable decision-making and guidance integration method is developed based on dynamic Bayesian network and the optimized control method. The proposed method can be applied for the autonomous decision-making and guidance in the g...

  • Article
  • Open Access
343 Views
24 Pages

Speaker anonymization effectively conceals speaker identity in speech signals to protect privacy. To address issues in existing anonymization systems, including reduced voice distinguishability, limited anonymized voices, reliance on an external spea...

  • Article
  • Open Access
398 Views
15 Pages

Cybersecurity has become one of the top priorities in Saudi Arabia, playing a key role in achieving Vision 2030 and advancing the kingdom’s position in digital transformation. This study investigates how cybersecurity knowledge, attitudes, and...

  • Article
  • Open Access
267 Views
16 Pages

LLM4ATS: Applying Large Language Models for Auto-Testing Scripts in Automobiles

  • Zeyuan Li,
  • Wei Li,
  • Yuezhao Liu,
  • Wenhao Li and
  • Min Chen

This paper introduces LLM4ATS, a framework integrating large language models, RAG, and closed-loop verification to automatically generate highly reliable automotive automated test scripts from natural language descriptions. Addressing the complex lin...

  • Article
  • Open Access
456 Views
19 Pages

In the context of a new wave of scientific and technological revolution and industrial transformation, this study proposes an emerging technology identification framework that integrates a High-Value Patent Knowledge Graph with Social Network Analysi...

  • Article
  • Open Access
456 Views
20 Pages

Renewable energy sources like wind turbines and solar panels are integrated into modern power grids as Distributed Energy Resources (DERs). These DERs can operate independently or as part of microgrids. Interconnecting multiple microgrids creates Net...

XFacebookLinkedIn
Big Data Cogn. Comput. - ISSN 2504-2289