You are currently on the new version of our website. Access the old version .

24,071 Results Found

  • Article
  • Open Access
3,872 Views
18 Pages

DefAn: Definitive Answer Dataset for LLM Hallucination Evaluation

  • A. B. M. Ashikur Rahman,
  • Saeed Anwar,
  • Muhammad Usman,
  • Irfan Ahmad and
  • Ajmal Mian

28 October 2025

Large Language Models (LLMs) represent a major step in AI development and are increasingly used in daily applications. However, they are prone to hallucinations, generating claims that contradict established facts, deviating from prompts, and produci...

  • Article
  • Open Access
1 Citations
2,120 Views
19 Pages

Applicability Evaluation of the Global Synthetic Tropical Cyclone Hazard Dataset in Coastal China

  • Xiaomin Li,
  • Qi Hou,
  • Jie Zhang,
  • Suming Zhang,
  • Xuexue Du and
  • Tangqi Zhao

A tropical cyclone dataset is an important data source for tropical cyclone disaster research, and the evaluation of its applicability is a necessary prerequisite. The Global Synthetic Tropical Cyclone Hazard (GSTCH) dataset is a dataset of global tr...

  • Article
  • Open Access
2 Citations
1,526 Views
22 Pages

Educational Evaluation with MLLMs: Framework, Dataset, and Comprehensive Assessment

  • Yuqing Chen,
  • Yixin Li,
  • Yupei Ren,
  • Yixin Liu and
  • Yiping Ma

19 September 2025

With the rapid development of Multimodal Large Language Models (MLLMs) in education, their applications have mainly focused on content generation tasks such as text writing and courseware production. However, automated assessment of non-exam learning...

  • Article
  • Open Access
975 Views
14 Pages

A Dataset and Experimental Evaluation of a Parallel Conflict Detection Solution for Model-Based Diagnosis

  • Jessica Janina Cabezas-Quinto,
  • Cristian Vidal-Silva,
  • Jorge Serrano-Malebrán and
  • Nicolás Márquez

29 August 2025

This article presents a dataset and experimental evaluation of a parallelized variant of Junker’s QuickXPlain algorithm, designed to efficiently compute minimal conflict sets in constraint-based diagnosis tasks. The dataset includes performance...

  • Article
  • Open Access
14 Citations
4,911 Views
17 Pages

Statistical Evaluation and Analysis of Road Extraction Methodologies Using a Unique Dataset from Remote Sensing

  • Guilherme Pina Cardim,
  • Erivaldo Antônio da Silva,
  • Mauricio Araújo Dias,
  • Ignácio Bravo and
  • Alfredo Gardel

18 April 2018

In the scientific literature, multiple studies address the application of road extraction methodologies to a particular cartographic dataset. However, it is difficult for any study to perform a more reliable comparison among road extraction methodolo...

  • Data Descriptor
  • Open Access
9 Citations
4,769 Views
9 Pages

24 August 2019

Datasets are important for researchers to build models and test how these perform, as well as to reproduce research experiments from others. This data paper presents the NILM Performance Evaluation dataset (NILMPEds), which is aimed primarily at rese...

  • Article
  • Open Access
20 Citations
9,488 Views
23 Pages

Epitope Prediction Based on Random Peptide Library Screening: Benchmark Dataset and Prediction Tools Evaluation

  • Pingping Sun,
  • Wenhan Chen,
  • Yanxin Huang,
  • Hongyan Wang,
  • Zhiqiang Ma and
  • Yinghua Lv

16 June 2011

Epitope prediction based on random peptide library screening has become a focus as a promising method in immunoinformatics research. Some novel software and web-based servers have been proposed in recent years and have succeeded in given test cases....

  • Data Descriptor
  • Open Access
4 Citations
5,113 Views
10 Pages

Organ-On-A-Chip (OOC) Image Dataset for Machine Learning and Tissue Model Evaluation

  • Valērija Movčana,
  • Arnis Strods,
  • Karīna Narbute,
  • Fēlikss Rūmnieks,
  • Roberts Rimša,
  • Gatis Mozoļevskis,
  • Maksims Ivanovs,
  • Roberts Kadiķis,
  • Kārlis Gustavs Zviedris and
  • Arturs Abols
  • + 3 authors

1 February 2024

Organ-on-a-chip (OOC) technology has emerged as a groundbreaking approach for emulating the physiological environment, revolutionizing biomedical research, drug development, and personalized medicine. OOC platforms offer more physiologically relevant...

  • Article
  • Open Access
680 Views
48 Pages

AutoML-Based Prediction of Unconfined Compressive Strength of Stabilized Soils: A Multi-Dataset Evaluation on Worldwide Experimental Data

  • Romulo Murucci Oliveira,
  • Deivid Campos,
  • Katia Vanessa Bicalho,
  • Bruno da S. Macêdo,
  • Matteo Bodini,
  • Camila Martins Saporetti and
  • Leonardo Goliatt

18 December 2025

Unconfined Compressive Strength (UCS) of stabilized soils is commonly used for evaluating the effectiveness of soil improvement techniques. Achieving target UCS values through conventional trial-and-error approaches requires extensive laboratory expe...

  • Article
  • Open Access
1 Citations
1,615 Views
17 Pages

Applicability Evaluation of Antarctic Ozone Reanalysis and Merged Satellite Datasets

  • Junzhe Chen,
  • Yu Zhang,
  • Houxiang Shi,
  • Hao Hu and
  • Jianjun Xu

10 June 2025

In this study, based on total column ozone observations from eight Antarctic stations, we evaluate the applicability of ERA5, C3S-MSR, MERRA-2, and JRA-55 reanalysis datasets and the NIWA-BS merged satellite dataset, in terms of interannual variation...

  • Article
  • Open Access
4 Citations
3,962 Views
26 Pages

27 May 2025

The integration of dynamic hand gesture recognition in computer vision-based systems promises enhanced human–computer interaction, providing a natural and intuitive way of communicating. However, achieving real-time performance efficiency is a...

  • Article
  • Open Access
47 Citations
5,810 Views
20 Pages

Evaluation of Eight Global Precipitation Datasets in Hydrological Modeling

  • Yiheng Xiang,
  • Jie Chen,
  • Lu Li,
  • Tao Peng and
  • Zhiyuan Yin

19 July 2021

The number of global precipitation datasets (PPs) is on the rise and they are commonly used for hydrological applications. A comprehensive evaluation on their performance in hydrological modeling is required to improve their performance. This study c...

  • Article
  • Open Access
3 Citations
2,083 Views
24 Pages

Multiscale Evaluation of Gridded Precipitation Datasets across Varied Elevation Zones in Central Asia’s Hilly Region

  • Manuchekhr Gulakhmadov,
  • Xi Chen,
  • Aminjon Gulakhmadov,
  • Muhammad Umar Nadeem,
  • Nekruz Gulahmadov and
  • Tie Liu

17 October 2023

The lack of observed data makes research on the cryosphere and ecology extremely difficult, especially in Central Asia’s hilly regions. Before their direct hydroclimatic uses, the performance study of gridded precipitation datasets (GPDS) is of...

  • Article
  • Open Access
7 Citations
2,728 Views
19 Pages

27 January 2024

Due to the scarcity of meteorological stations on the Tibetan Plateau (TP), owing to the high altitude and harsh climate, studies often resort to satellite, reanalysis, and merged multi-source precipitation data. This necessitates an evaluation of TP...

  • Article
  • Open Access
1 Citations
792 Views
24 Pages

26 September 2025

As the Pamir Plateau is known as the “Water Tower of Central Asia”, accurate precipitation dataset is essential for the study of climate and hydrology in this region. Based on the monthly precipitation observations from 268 meteorological...

  • Article
  • Open Access
36 Citations
4,924 Views
19 Pages

Evaluation of Sixteen Gridded Precipitation Datasets over the Caribbean Region Using Gauge Observations

  • Abel Centella-Artola,
  • Arnoldo Bezanilla-Morlot,
  • Michael A. Taylor,
  • Dimitris A. Herrera,
  • Daniel Martinez-Castro,
  • Isabelle Gouirand,
  • Maibys Sierra-Lorenzo,
  • Alejandro Vichot-Llano,
  • Tannecia Stephenson and
  • Milena Alpizar
  • + 2 authors

9 December 2020

The existence of several gridded precipitation products (GPP) has facilitated studies related to climate change, climate modeling, as well as a better understanding of the physical processes underpinning this key variable. Due to complexities in esti...

  • Article
  • Open Access
34 Citations
5,539 Views
24 Pages

Satellite-Based Precipitation Datasets Evaluation Using Gauge Observation and Hydrological Modeling in a Typical Arid Land Watershed of Central Asia

  • Jiabin Peng,
  • Tie Liu,
  • Yue Huang,
  • Yunan Ling,
  • Zhengyang Li,
  • Anming Bao,
  • Xi Chen,
  • Alishir Kurban and
  • Philippe De Maeyer

11 January 2021

Hydrological modeling has always been a challenge in the data-scarce watershed, especially in the areas with complex terrain conditions like the inland river basin in Central Asia. Taking Bosten Lake Basin in Northwest China as an example, the accura...

  • Article
  • Open Access
5 Citations
2,718 Views
18 Pages

27 August 2022

Global land use/cover change (LUCC) datasets are essential for quantitatively assessing the impacts of LUCC on global change, but many uncertainties in existing global datasets seriously hamper climate modeling. Evaluating the reliability of existing...

  • Article
  • Open Access
16 Citations
3,739 Views
20 Pages

21 September 2020

Air temperature and precipitation are two important meteorological factors affecting the earth’s energy exchange and hydrological process. High quality temperature and precipitation forcing datasets are of great significance to agro-meteorology...

  • Article
  • Open Access
26 Citations
3,833 Views
26 Pages

19 March 2021

Multi-source soil moisture (SM) products provide a vigorous tool for the estimation of soil moisture on a large scale, but it is crucial to carry out the evaluation of those products before further application. In the present work, an evaluation fram...

  • Data Descriptor
  • Open Access
10 Citations
7,851 Views
18 Pages

Thailand Raw Water Quality Dataset Analysis and Evaluation

  • Jaturapith Krohkaew,
  • Pongpon Nilaphruek,
  • Niti Witthayawiroj,
  • Sakchai Uapipatanakul,
  • Yamin Thwe and
  • Padma Nyoman Crisnapati

4 September 2023

Sustainable water quality data are important for understanding historical variability and trends in river regimes, as well as the impact of industrial waste on the health of aquatic ecosystems. Sustainable water management practices heavily depend on...

  • Article
  • Open Access
3,048 Views
17 Pages

16 February 2025

Developing robust and reliable models for Named Entity Recognition (NER) in the Russian language presents significant challenges due to the linguistic complexity of Russian and the limited availability of suitable training datasets. This study introd...

  • Article
  • Open Access
1 Citations
1,981 Views
18 Pages

An Evaluation of Large Language Models for Supplementing a Food Extrusion Dataset

  • Necva Bölücü,
  • Jordan Pennells,
  • Huichen Yang,
  • Maciej Rybinski and
  • Stephen Wan

15 April 2025

Food extrusion is a widely used processing technique that transforms raw ingredients into structured food products—foods with well-defined textures, shapes, and functionalities—through mechanical shear and thermal energy. Despite its broa...

  • Article
  • Open Access
8 Citations
4,015 Views
16 Pages

11 July 2023

Steady-state visual evoked potential (SSVEP)-based brain–computer interface (BCI) systems have been extensively researched over the past two decades, and multiple sets of standard datasets have been published and widely used. However, there are...

  • Article
  • Open Access
42 Citations
5,672 Views
20 Pages

The Proposition and Evaluation of the RoEduNet-SIMARGL2021 Network Intrusion Detection Dataset

  • Maria-Elena Mihailescu,
  • Darius Mihai,
  • Mihai Carabas,
  • Mikołaj Komisarek,
  • Marek Pawlicki,
  • Witold Hołubowicz and
  • Rafał Kozik

24 June 2021

Cybersecurity is an arms race, with both the security and the adversaries attempting to outsmart one another, coming up with new attacks, new ways to defend against those attacks, and again with new ways to circumvent those defences. This situation c...

  • Review
  • Open Access
2 Citations
1,944 Views
34 Pages

A Comprehensive Benchmarking Framework for Sentinel-2 Sharpening: Methods, Dataset, and Evaluation Metrics

  • Matteo Ciotola,
  • Giuseppe Guarino,
  • Antonio Mazza,
  • Giovanni Poggi and
  • Giuseppe Scarpa

7 June 2025

The advancement of super-resolution and sharpening algorithms for satellite images has significantly expanded the potential applications of remote sensing data. In the case of Sentinel-2, despite significant progress, the lack of standardized dataset...

  • Article
  • Open Access
21 Citations
10,300 Views
16 Pages

Evidence from recent research shows that automatic visual evaluation (AVE) of photographic images of the uterine cervix using deep learning-based algorithms presents a viable solution for improving cervical cancer screening by visual inspection with...

  • Article
  • Open Access
32 Citations
5,326 Views
17 Pages

Evaluation of Potential Evapotranspiration Based on CMADS Reanalysis Dataset over China

  • Ye Tian,
  • Kejun Zhang,
  • Yue-Ping Xu,
  • Xichao Gao and
  • Jie Wang

23 August 2018

Potential evapotranspiration (PET) is used in many hydrological models to estimate actual evapotranspiration. The calculation of PET by the Food and Agriculture Organization of the United Nations (FAO) Penman–Monteith method requires data for s...

  • Article
  • Open Access
7 Citations
4,058 Views
21 Pages

19 February 2025

With the rapid development of large visual language models (LVLMs) and multimodal large language models (MLLMs), these models have demonstrated strong performance in various multimodal tasks. However, alleviating the generation of hallucinations rema...

  • Article
  • Open Access
20 Citations
5,476 Views
31 Pages

13 June 2023

Cybersecurity has become one of the focuses of organisations. The number of cyberattacks keeps increasing as Internet usage continues to grow. As new types of cyberattacks continue to emerge, researchers focus on developing machine learning (ML)-base...

  • Proceeding Paper
  • Open Access
2,224 Views
3 Pages

Building High-Quality Datasets for Information Retrieval Evaluation at a Reduced Cost

  • David Otero,
  • Daniel Valcarce,
  • Javier Parapar and
  • Álvaro Barreiro

Information Retrieval is not any more exclusively about document ranking. Continuously new tasks are proposed on this and sibling fields. With this proliferation of tasks, it becomes crucial to have a cheap way of constructing test collections to eva...

  • Article
  • Open Access
3 Citations
3,165 Views
21 Pages

Evaluating the Accuracy of a Gridded Near-Surface Temperature Dataset over Mainland China

  • Meijuan Qiu,
  • Buchun Liu,
  • Yuan Liu,
  • Yueying Zhang and
  • Shuai Han

High-resolution meteorological data products are crucial for agrometeorological studies. Here, we study the accuracy of an important gridded dataset, the near-surface temperature dataset from the 5 km × 5 km resolution China dataset of meteorol...

  • Article
  • Open Access
4 Citations
1,581 Views
28 Pages

22 January 2025

With the accelerating pace of global warming, the imperative of selecting robust, long-term drought monitoring tools is becoming increasingly pronounced. In this study, we computed the Standardized Precipitation Evapotranspiration Index (SPEI) at bot...

  • Article
  • Open Access
13 Citations
3,397 Views
23 Pages

14 March 2024

Structural health monitoring and condition assessment of existing bridge decks is a growing challenge. Conventional manned inspections are costly, labor-intensive, and often risky to execute. Sub-surface delamination, a leading cause of deck replacem...

  • Article
  • Open Access
8 Citations
7,706 Views
23 Pages

Bird Object Detection: Dataset Construction, Model Performance Evaluation, and Model Lightweighting

  • Yang Wang,
  • Jiaogen Zhou,
  • Caiyun Zhang,
  • Zhaopeng Luo,
  • Xuexue Han,
  • Yanzhu Ji and
  • Jihong Guan

14 September 2023

The application of object detection technology has a positive auxiliary role in advancing the intelligence of bird recognition and enhancing the convenience of bird field surveys. However, challenges arise due to the absence of dedicated bird dataset...

  • Systematic Review
  • Open Access
340 Views
27 Pages

Advances in Face Recognition: A Comprehensive Review of Feature Extraction and Dataset Evaluation

  • Syed Murtaza Hussain Abidi,
  • Syed Ali Hassan,
  • Syed Muhammad Raza and
  • Michail J. Beliatis

Face recognition has become a major research area due to the rapid growth of intelligent software applications. However, reliable face identification remains challenging because human facial features vary significantly under different conditions. Ori...

  • Article
  • Open Access
2 Citations
2,235 Views
19 Pages

A Bilingual Basque–Spanish Dataset of Parliamentary Sessions for the Development and Evaluation of Speech Technology

  • Amparo Varona,
  • Mikel Penagarikano,
  • Germán Bordel and
  • Luis Javier Rodriguez-Fuentes

27 February 2024

The development of speech technology requires large amounts of data to estimate the underlying models. Even when relying on large multilingual pre-trained models, some amount of task-specific data on the target language is needed to fine-tune those m...

  • Data Descriptor
  • Open Access
1 Citations
2,584 Views
14 Pages

25 June 2024

In the age of abundant digital content, children and adolescents face the challenge of developing new information literacy competencies, particularly those pertaining to online inquiry, in order to thrive academically and personally. This article add...

  • Article
  • Open Access
74 Citations
10,381 Views
21 Pages

3DRIED: A High-Resolution 3-D Millimeter-Wave Radar Dataset Dedicated to Imaging and Evaluation

  • Shunjun Wei,
  • Zichen Zhou,
  • Mou Wang,
  • Jinshan Wei,
  • Shan Liu,
  • Jun Shi,
  • Xiaoling Zhang and
  • Fan Fan

25 August 2021

Millimeter-wave (MMW) 3-D imaging technology is becoming a research hotspot in the field of safety inspection, intelligent driving, etc., due to its all-day, all-weather, high-resolution and non-destruction feature. Unfortunately, due to the lack of...

  • Article
  • Open Access
18 Citations
5,689 Views
32 Pages

22 November 2024

This study focuses on the construction and evaluation of a high-quality Chinese Manchu music dataset designed to facilitate Artificial Intelligence (AI) research and applications within cultural heritage and ethnomusicology. Through a systematic coll...

  • Article
  • Open Access
1 Citations
3,017 Views
20 Pages

We introduce an emotional stimuli detection task that targets extracting emotional regions that evoke people’s emotions (i.e., emotional stimuli) in artworks. This task offers new challenges to the community because of the diversity of artwork...

  • Article
  • Open Access
7 Citations
2,909 Views
16 Pages

31 July 2024

Site-specific weed management employs image data to generate maps through various methodologies that classify pixels corresponding to crop, soil, and weed. Further, many studies have focused on identifying specific weed species using spectral data. N...

  • Article
  • Open Access
5 Citations
4,436 Views
15 Pages

A Benchmark for the Evaluation of Corner Detectors

  • Yang Zhang,
  • Baojiang Zhong and
  • Xun Sun

23 November 2022

Corners are an important kind of image feature and play a crucial role in solving various tasks. Over the past few decades, a great number of corner detectors have been proposed. However, there is no benchmark dataset with labeled ground-truth corner...

  • Article
  • Open Access
13 Citations
6,701 Views
21 Pages

Artificial Intelligence for Text-Based Vehicle Search, Recognition, and Continuous Localization in Traffic Videos

  • Karen Panetta,
  • Landry Kezebou,
  • Victor Oludare,
  • James Intriligator and
  • Sos Agaian

6 December 2021

The concept of searching and localizing vehicles from live traffic videos based on descriptive textual input has yet to be explored in the scholarly literature. Endowing Intelligent Transportation Systems (ITS) with such a capability could help solve...

  • Feature Paper
  • Article
  • Open Access
26 Citations
8,020 Views
19 Pages

A New Dataset and Performance Evaluation of a Region-Based CNN for Urban Object Detection

  • Alex Dominguez-Sanchez,
  • Miguel Cazorla and
  • Sergio Orts-Escolano

In recent years, we have seen a large growth in the number of applications which use deep learning-based object detectors. Autonomous driving assistance systems (ADAS) are one of the areas where they have the most impact. This work presents a novel s...

  • Article
  • Open Access
12 Citations
4,258 Views
16 Pages

Deep 3D Convolutional Neural Network for Facial Micro-Expression Analysis from Video Images

  • Kranthi Kumar Talluri,
  • Marc-André Fiedler and
  • Ayoub Al-Hamadi

1 November 2022

Micro-expression is the involuntary emotion of the human that reflects the genuine feelings that cannot be hidden. Micro-expression is exhibited by facial expressions that last for a short duration and have very low intensity. Because of these reason...

  • Article
  • Open Access
2 Citations
1,811 Views
17 Pages

Evaluation and Comparison of Five Long-Term Precipitation Datasets in the Hang-Jia-Hu Plain of Eastern China

  • Kunxin Wang,
  • Yaohui Qiang,
  • Wei Nie,
  • Peng Gou,
  • Feng Wang,
  • Yang Liu,
  • Xuepeng Zhang,
  • Tianyu Zhou and
  • Siyu Wang

15 July 2024

This study analyzed the applicability of five long-term precipitation datasets in the Hang-Jia-Hu Plain of eastern China based on meteorological observation data. The accuracy of each dataset at different time scales (yearly, monthly) was analyzed. B...

  • Article
  • Open Access
467 Views
16 Pages

xScore: A Simple Metric for Cross-Domain Robustness in Lightweight Vision Models

  • Weidong Zhang,
  • Pak Lun Kevin Ding,
  • Baoxin Li and
  • Huan Liu

23 December 2025

Lightweight vision models are widely deployed in mobile and embedded systems, where strict computational and memory budgets demand compact architectures. However, their evaluation remains dominated by ImageNet—a single, large natural-image data...

  • Article
  • Open Access
1 Citations
2,240 Views
29 Pages

30 June 2023

In social interactions, people who are perceived as competent win more chances, tend to have more opportunities, and perform better in both personal and professional aspects of their lives. However, the process of evaluating competence is still poorl...

  • Article
  • Open Access
1,254 Views
21 Pages

Clinical Application of Vision Transformers for Melanoma Classification: A Multi-Dataset Evaluation Study

  • Antony Garcia,
  • Jixing Zhou,
  • Gabriela Pinero-Crespo,
  • Thomas Beachkofsky and
  • Xinming Huang

28 October 2025

Background: Melanoma is one of the most lethal skin cancers, with survival rates largely dependent on early detection, yet diagnosis remains difficult because of its visual similarity to benign nevi. Convolutional neural networks have achieved strong...

of 482