MDPI - Publisher of Open Access Journals

20 pages, 45555 KB

Open AccessArticle

FAIRHiveFrames-1K: A Public FAIR Dataset of 1265 Annotated Hive Frame Images with Preliminary YOLOv8 and YOLOv11 Baselines

by Vladimir Kulyukin, Reagan Hill and Aleksey Kulyukin

Sensors 2026, 26(8), 2518; https://doi.org/10.3390/s26082518 - 19 Apr 2026

Viewed by 391

Abstract

In precision apiculture, the portable digital camera is a cost-effective sensor for capturing hive images or videos used to quantify different colony variables. Openly accessible, well-annotated, interoperable cell-level image datasets are still the exception rather than the norm. This shortage constitutes a major [...] Read more.

In precision apiculture, the portable digital camera is a cost-effective sensor for capturing hive images or videos used to quantify different colony variables. Openly accessible, well-annotated, interoperable cell-level image datasets are still the exception rather than the norm. This shortage constitutes a major barrier to AI-driven approaches aimed at automating image-based comb analysis. In this article, we present FAIRHiveFrames-1K, a publicly available dataset of 1265 annotated hive frame images (1920 × 1080 PNG) designed to facilitate research in AI-intensive image-based comb analysis automation. The dataset, derived from a 2013–2022 U.S. Department of Agriculture–Agricultural Research Service multi-sensor research reservoir, includes 124,669 annotated regions of interest for seven biologically meaningful categories consistent with comb analysis literature and standard hive inspection protocols. FAIRHiveFrames-1K is curated according to FAIR principles (Findable, Accessible, Interoperable, Reusable) and distributed under CC-BY 4.0 with standard annotation formats, fixed training and validation splits, and reproducible benchmarking artifacts. To establish preliminary baseline performance, we iteratively tuned four YOLO architectures (YOLOv8n, YOLOv8s, YOLOv11n, YOLOv11s) under a shared tuning protocol over the period of dataset growth. Full article

(This article belongs to the Special Issue Advanced Wireless Sensor Network Deployment in Smart Cities, Industry 4.0, and Agriculture 4.0 (2nd Edition))

► Show Figures

Figure 1

12 pages, 1134 KB

Open AccessArticle

AI-Assisted Preoperative Diagnosis of Wilms Tumor

by Mustafa Alper Akay, Ozan Can Tatar, Elif Tatar, Uğur Demirsoy, Yonca Anık, Gülşen Ekingen Yıldız and Onursal Varlıklı

Life 2026, 16(4), 659; https://doi.org/10.3390/life16040659 - 13 Apr 2026

Viewed by 512

Abstract

Preoperative differentiation of Wilms tumor and neuroblastoma on pediatric abdominal computed tomography (CT) images may be challenging because of overlapping imaging features. We aimed to develop an artificial intelligence-assisted lesion-localization model for exploratory diagnostic support in this differential setting. In this single-center, retrospective, [...] Read more.

Preoperative differentiation of Wilms tumor and neuroblastoma on pediatric abdominal computed tomography (CT) images may be challenging because of overlapping imaging features. We aimed to develop an artificial intelligence-assisted lesion-localization model for exploratory diagnostic support in this differential setting. In this single-center, retrospective, image-level study, a YOLO26s detector was trained on preoperative contrast-enhanced CT PNG images with histopathology-anchored labels. The dataset comprised 3553 images, including 2103 lesion-positive images and 1450 background-negative images, partitioned into training, validation, and test subsets. On the held-out test set, the model achieved a precision of 0.954, a recall of 0.951, an mAP@0.5 of 0.977, and an mAP@0.5:0.95 of 0.732. Class-specific mAP@0.5:0.95 values were 0.734 for neuroblastoma and 0.730 for Wilms tumor. At the image level, tumor-present versus background-negative discrimination yielded 99.5% sensitivity, 89.0% specificity, a 93.0% positive predictive value, a 99.2% negative predictive value, and 95.3% accuracy. YOLO26s showed strong within-dataset performance for lesion localization and differential support between Wilms tumor and neuroblastoma. Full article

(This article belongs to the Special Issue Advancing Cancer Radiodiagnostics Through Artificial Intelligence: Techniques, Applications, and Challenges)

► Show Figures

Figure 1

16 pages, 1873 KB

Open AccessArticle

Prompt-Guided Structured Multimodal NER with SVG and ChatGPT

by Yuzhou Ma, Haolong Qian, Shujun Xia and Wei Li

Electronics 2026, 15(6), 1276; https://doi.org/10.3390/electronics15061276 - 18 Mar 2026

Viewed by 451

Abstract

Multimodal named entity recognition (MNER) leverages both textual and visual information to improve entity recognition, particularly in unstructured scenarios such as social media. While existing approaches predominantly rely on raster images (e.g., JPEG, PNG), scalable vector graphics (SVG) offer unique advantages in resolution [...] Read more.

Multimodal named entity recognition (MNER) leverages both textual and visual information to improve entity recognition, particularly in unstructured scenarios such as social media. While existing approaches predominantly rely on raster images (e.g., JPEG, PNG), scalable vector graphics (SVG) offer unique advantages in resolution independence and structured semantic representation—an underexplored potential in multimodal learning. To fill this gap, we propose MNER-SVG, the first framework that incorporates SVG as a visual modality and enhances it with ChatGPT-generated auxiliary knowledge. Specifically, we introduce a Multimodal Similar Instance Perception Module that retrieves semantically relevant examples and prompts ChatGPT to generate contextual explanations. We further construct a Full-Text Graph and a Multimodal Interaction Graph, which are processed via Graph Attention Networks (GATs) to achieve fine-grained cross-modal alignment and feature fusion. Finally, a Conditional Random Field (CRF) layer is employed for structured decoding. To support evaluation, we present SvgNER, the first MNER dataset annotated with SVG-specific visual content. Extensive experiments demonstrate that MNER-SVG achieves state-of-the-art performance with an F1 score of 82.23%, significantly outperforming both text-only and existing multimodal baselines. This work validates the feasibility and potential of integrating vector graphics and large language model-generated knowledge into multimodal NER, opening a new research direction for structured visual semantics in fine-grained multimodal understanding. Full article

(This article belongs to the Section Artificial Intelligence)

► Show Figures

Figure 1

28 pages, 14615 KB

Open AccessFeature PaperArticle

Anatomic Interactive Atlas of the Loggerhead Sea Turtle (Caretta caretta) Coelomic Cavity

by Alberto Arencibia, Aday Melián and Jorge Orós

Animals 2026, 16(5), 754; https://doi.org/10.3390/ani16050754 - 28 Feb 2026

Viewed by 2049

Abstract

The coelomic cavity of sea turtles is affected by congenital, developmental, traumatic, infectious, and organ- or system-specific disorders, making accurate anatomical knowledge essential for veterinary practice. This study presents an open-access, interactive two-dimensional (2D) anatomical atlas of the coelomic cavity of the loggerhead [...] Read more.

The coelomic cavity of sea turtles is affected by congenital, developmental, traumatic, infectious, and organ- or system-specific disorders, making accurate anatomical knowledge essential for veterinary practice. This study presents an open-access, interactive two-dimensional (2D) anatomical atlas of the coelomic cavity of the loggerhead sea turtle (Caretta caretta), developed using images obtained from osteology, gross anatomical dissections, computed tomography (CT), and magnetic resonance imaging (MRI). The atlas comprises six osteology images, sixteen anatomical dissection images, eight transverse CT images acquired using bone and soft-tissue windows, six three-dimensional (3D) volume-rendered CT images, and fourteen MRI images (four transverse, five dorsal, and five sagittal), all provided in PNG format. Relevant anatomical structures were segmented and colour-coded for each figure using manual layer-based segmentation software. The Unity 3D platform was employed for image visualisation and assessment, supporting the development of interactive two-dimensional content. This atlas serves as a useful interactive tool for anatomical learning and clinical reference for professionals and students engaged in the conservation of loggerhead sea turtles. Full article

(This article belongs to the Section Herpetology)

► Show Figures

Figure 1

17 pages, 6883 KB

Open AccessArticle

A Comparative Evaluation of Super-Resolution Methods for Spectral Images Using Pretrained RGB Models

by Navid Shokoohi, Abdelhamid N. Fsian, Jean-Baptiste Thomas and Pierre Gouton

Sensors 2026, 26(2), 683; https://doi.org/10.3390/s26020683 - 20 Jan 2026

Viewed by 820

Abstract

The spatial resolution of spectral imaging systems is fundamentally constrained by hardware trade-offs, and the availability of large-scale annotated spectral datasets remains limited. This study presents a comprehensive evaluation of super-resolution (SR) methods across interpolation-based, CNN-based, GAN-based, and diffusion-based approaches. Using a synthetic [...] Read more.

The spatial resolution of spectral imaging systems is fundamentally constrained by hardware trade-offs, and the availability of large-scale annotated spectral datasets remains limited. This study presents a comprehensive evaluation of super-resolution (SR) methods across interpolation-based, CNN-based, GAN-based, and diffusion-based approaches. Using a synthetic 30-band spectral representation reconstructed from RGB with the MST++ model as a proxy ground truth, we arrange non-adjacent triplets as three-channel PNG inputs to ensure compatibility with existing SR architectures. A unified pipeline enables reproducible evaluation at ×2, ×4, and ×8 scales on 50 unseen images, with performance assessed using PSNR, SSIM, and SAM. Results confirm that bicubic interpolation remains a spectrally reliable baseline; shallow CNNs (SRCNN, FSRCNN) generalize well without fine-tuning; and ESRGAN improves spatial detail at the expense of spectral accuracy. Diffusion models (SR3, ResShift, SinSR), evaluated in a zero-shot setting without spectral-domain adaptation, exhibit unstable performance and require spectrum-aware training to preserve spectral structure effectively. The findings underscore a persistent trade-off between perceptual sharpness and spectral fidelity, highlighting the importance of domain-aware objectives when applying generative SR models to spectral data. This work provides reproducible baselines and a flexible evaluation framework to support future research in spectral image restoration. Full article

(This article belongs to the Special Issue Feature Papers in "Sensing and Imaging" Section 2025&2026)

► Show Figures

Figure 1

23 pages, 2069 KB

Open AccessArticle

Early Lung Cancer Detection via AI-Enhanced CT Image Processing Software

by Joel Silos-Sánchez, Jorge A. Ruiz-Vanoye, Francisco R. Trejo-Macotela, Marco A. Márquez-Vera, Ocotlán Diaz-Parra, Josué R. Martínez-Mireles, Miguel A. Ruiz-Jaimes and Marco A. Vera-Jiménez

Diagnostics 2025, 15(21), 2691; https://doi.org/10.3390/diagnostics15212691 - 24 Oct 2025

Cited by 1 | Viewed by 3789

Abstract

Background/Objectives: Lung cancer remains the leading cause of cancer-related mortality worldwide among both men and women. Early and accurate detection is essential to improve patient outcomes. This study explores the use of artificial intelligence (AI)-based software for the diagnosis of lung cancer through [...] Read more.

Background/Objectives: Lung cancer remains the leading cause of cancer-related mortality worldwide among both men and women. Early and accurate detection is essential to improve patient outcomes. This study explores the use of artificial intelligence (AI)-based software for the diagnosis of lung cancer through the analysis of medical images in DICOM format, aiming to enhance image visualization, preprocessing, and diagnostic precision in chest computed tomography (CT) scans. Methods: The proposed system processes DICOM medical images converted to standard formats (JPG or PNG) for preprocessing and analysis. An ensemble of classical machine learning algorithms—including Random Forest, Gradient Boosting, Support Vector Machine, and K-Nearest Neighbors—was implemented to classify pulmonary images and predict the likelihood of malignancy. Image normalization, denoising, segmentation, and feature extraction were performed to improve model reliability and reproducibility. Results: The AI-enhanced system demonstrated substantial improvements in diagnostic accuracy and robustness compared with individual classifiers. The ensemble model achieved a classification accuracy exceeding 90%, highlighting its effectiveness in identifying malignant and non-malignant lung nodules. Conclusions: The findings indicate that AI-assisted CT image processing can significantly contribute to the early detection of lung cancer. The proposed methodology enhances diagnostic confidence, supports clinical decision-making, and represents a viable step toward integrating AI into radiological workflows for early cancer screening. Full article

(This article belongs to the Special Issue Application of Machine Learning in Disease Screening, Diagnosis and Prognosis)

► Show Figures

Figure 1

13 pages, 317 KB

Open AccessArticle

Enhancing JPEG XL’s Weighted Average Predictor: Genetic Algorithm Optimization of Expanded Sub-Predictor Ensemble

by Xavier Hill Roy and Mahmoud R. El-Sakka

Electronics 2025, 14(20), 4116; https://doi.org/10.3390/electronics14204116 - 21 Oct 2025

Viewed by 1064

Abstract

Lossless image compression relies heavily on prediction algorithms to reduce spatial redundancy before entropy coding. The JPEG XL standard employs a weighted average predictor that combines four sub-predictors with adaptive weighting; however, it uses fixed initial scaling factors regardless of the image content. [...] Read more.

Lossless image compression relies heavily on prediction algorithms to reduce spatial redundancy before entropy coding. The JPEG XL standard employs a weighted average predictor that combines four sub-predictors with adaptive weighting; however, it uses fixed initial scaling factors regardless of the image content. This study introduces WOP8 (weighted optimization predictor for 8 sub-predictors), which extends the predictor diversity and optimizes initial weights using a genetic algorithm. Four additional predictors were incorporated—adaptive MED (JPEG-LS), enhanced adaptive median, Paeth (PNG), and GAP-based (CALIC)—forming an eight-predictor ensemble. A genetic algorithm with a population of 30 and 24 generations optimized the weight configurations by minimizing the compressed file size of the training data. Experiments were conducted on the Kodak and Tecnick datasets to evaluate performance and generalizability. The Kodak color dataset showed notable gains: with the weighted average predictor in isolation, WOP8 achieved a 0.24 BPP reduction (2.7% improvement) at high effort levels. Under standard JPEG XL operation mode, improvements were minor but consistent. These results confirm the value of targeted predictor optimization and demonstrate that genetic algorithms can effectively discover dataset-specific weighting patterns, offering a foundation for future component-level enhancements in JPEG XL. Full article

(This article belongs to the Special Issue Artificial Intelligence in Graphics and Images)

► Show Figures

Figure 1

11 pages, 957 KB

Open AccessTechnical Note

vvv2_align_SE, vvv2_align_PE/vvv2_display: Galaxy-Based Workflows and Tool Designed to Perform, Summarize and Visualize Variant Calling and Annotation in Viral Genome Assemblies

by Alexandre Flageul, Edouard Hirchaud, Céline Courtillon, Flora Carnet, Paul Brown, Béatrice Grasland and Fabrice Touzain

Viruses 2025, 17(10), 1385; https://doi.org/10.3390/v17101385 - 17 Oct 2025

Viewed by 857

Abstract

Background: Next-generation sequencing (NGS) analysis of viral samples generates results dispersed across multiple files—genome assembly, variant calling, and functional annotations—making integrated interpretation challenging. Variants often yield numerous low-frequency or non-significant variants, yet only a small fraction are biologically relevant. Virologists must manually [...] Read more.

Background: Next-generation sequencing (NGS) analysis of viral samples generates results dispersed across multiple files—genome assembly, variant calling, and functional annotations—making integrated interpretation challenging. Variants often yield numerous low-frequency or non-significant variants, yet only a small fraction are biologically relevant. Virologists must manually sift through extensive data to identify meaningful mutations, a time-consuming and error-prone process. To address these practical challenges, we developed vvv2_display, a dedicated summarization and visualization tool, integrated within comprehensive Galaxy workflows. Results: vvv2_display streamlines variant interpretation by consolidating key results into two concise and interoperable outputs. The first output is a PNG image showing alignment coverage depth and genomic annotations, with significant variants displayed along the genome as symbols whose height reflects frequency and shape indicates the affected protein. At a glance, this enables virologists to identify all deviations from a reference viral genome. Each significant variant is assigned a unique identifier that directly links to the second output: a tab-separated (TSV) text file listing only high-confidence variants, with frequencies, flanking nucleotides, and impacted genes and proteins. This cross-referenced design supports rapid, accurate, and intuitive data exploration. Availability: vvv2_display is open source, available on Github and installable via Mamba. Full article

(This article belongs to the Section Animal Viruses)

► Show Figures

Figure 1

24 pages, 15799 KB

Open AccessArticle

Performance Comparison of Embedded AI Solutions for Classification and Detection in Lung Disease Diagnosis

by Md Sabbir Ahmed, Stefano Giordano and Davide Adami

Appl. Sci. 2025, 15(17), 9345; https://doi.org/10.3390/app15179345 - 26 Aug 2025

Cited by 1 | Viewed by 2140

Abstract

Lung disease diagnosis from chest X-ray images is a critical task in clinical care, especially in resource-constrained settings where access to radiology expertise and computational infrastructure is limited. Recent advances in deep learning have shown promise, yet most studies focus solely on either [...] Read more.

Lung disease diagnosis from chest X-ray images is a critical task in clinical care, especially in resource-constrained settings where access to radiology expertise and computational infrastructure is limited. Recent advances in deep learning have shown promise, yet most studies focus solely on either classification or detection in isolation, rarely exploring their combined potential in an embedded, real-world setting. To address this, we present a dual deep learning approach that combines five-class disease classification and multi-label thoracic abnormality detection, optimized for embedded edge deployment. Specifically, we evaluate six state-of-the-art CNN architectures—ResNet101, DenseNet201, MobileNetV3-Large, EfficientNetV2-B0, InceptionResNetV2, and Xception—on both base (2020 images) and augmented (9875 images) datasets. Validation accuracies ranged from 55.3% to 70.7% on the base dataset and improved to 58.4% to 72.0% with augmentation, with MobileNetV3-Large achieving the highest accuracy on both. In parallel, we trained a YOLOv8n model for multi-label detection of 14 thoracic diseases. While not deployed in this work, its lightweight architecture makes it suitable for future use on embedded platforms. All classification models were evaluated for end-to-end inference on a Raspberry Pi 4 using a high-resolution chest X-ray image (2566 × 2566, PNG). MobileNetV3-Large demonstrated the fastest latency at 429.6 ms, and all models completed inference in under 2.4 s. These results demonstrate the feasibility of combining classification for rapid triage and detection for spatial interpretability in real-time, embedded clinical environments—paving the way for practical, low-cost AI-based decision support systems for surgery rooms and mobile clinical environments. Full article

► Show Figures

Figure 1

31 pages, 4412 KB

Open AccessArticle

Detection of Trees and Objects in Apple Orchard from LiDAR Point Cloud Data Using a YOLOv5 Framework

by Md Rejaul Karim, Md Nasim Reza, Shahriar Ahmed, Kyu-Ho Lee, Joonjea Sung and Sun-Ok Chung

Electronics 2025, 14(13), 2545; https://doi.org/10.3390/electronics14132545 - 24 Jun 2025

Cited by 2 | Viewed by 2919

Abstract

Object detection is crucial for smart apple orchard management using agricultural machinery to avoid obstacles. The objective of this study was to detect apple trees and other objects in an apple orchard using LiDAR and the YOLOv5 algorithm. A commercial LiDAR was attached [...] Read more.

Object detection is crucial for smart apple orchard management using agricultural machinery to avoid obstacles. The objective of this study was to detect apple trees and other objects in an apple orchard using LiDAR and the YOLOv5 algorithm. A commercial LiDAR was attached to a tripod to collect apple tree trunk data, which were then pre-processed and converted into PNG images. A pre-processed set of 1500 images was manually annotated with bounding boxes and class labels (trees, water tanks, and others) to train and validate the YOLOv5 object detection algorithm. The model, trained over 100 epochs, resulted in 90% precision, 87% recall, mAP@0.5 of 0.89, and mAP@0.5:0.95 of 0.48. The accuracy reached 89% with a low classification loss of 0.001. Class-wise accuracy was high for water tanks (96%) and trees (95%), while the “others” category had lower accuracy (82%) due to inter-class similarity. Accurate object detection is challenging since the apple orchard environment is complex and unstructured. Background misclassifications highlight the need for improved dataset balance, better feature discrimination, and refinement in detecting ambiguous objects. Full article

(This article belongs to the Special Issue Innovations in Intelligent Agriculture: Advanced AI and Robotics for Modern Farming)

► Show Figures

Figure 1

15 pages, 29428 KB

Open AccessArticle

Color as a High-Value Quantitative Tool for PET/CT Imaging

by Michail Marinis, Sofia Chatziioannou and Maria Kallergi

Information 2025, 16(5), 352; https://doi.org/10.3390/info16050352 - 27 Apr 2025

Cited by 1 | Viewed by 2714

Abstract

The successful application of artificial intelligence (AI) techniques for the quantitative analysis of hybrid medical imaging data such as PET/CT is challenged by the differences in the type of information and image quality between the two modalities. The purpose of this work was [...] Read more.

The successful application of artificial intelligence (AI) techniques for the quantitative analysis of hybrid medical imaging data such as PET/CT is challenged by the differences in the type of information and image quality between the two modalities. The purpose of this work was to develop color-based, pre-processing methodologies for PET/CT data that could yield a better starting point for subsequent diagnosis and image processing and analysis. Two methods are proposed that are based on the encoding of Hounsfield Units (HU) and Standardized Uptake Values (SUVs) in separate transformed .png files as reversible color information in combination with .png basic information metadata based on DICOM attributes. Linux Ubuntu using Python was used for the implementation and pilot testing of the proposed methodologies on brain ¹⁸F-FDG PET/CT scans acquired with different PET/CT systems. The range of HUs and SUVs was mapped using novel weighted color distribution functions that allowed for a balanced representation of the data and an improved visualization of anatomic and metabolic differences. The pilot application of the proposed mapping codes yielded CT and PET images where it was easier to pinpoint variations in anatomy and metabolic activity and offered a potentially better starting point for the subsequent fully automated quantitative analysis of specific regions of interest or observer evaluation. It should be noted that the output .png files contained all the raw values and may be treated as raw DICOM input data. Full article

(This article belongs to the Special Issue Deep Learning in Medical Image Analysis: Foundations, Techniques, and Applications)

► Show Figures

Figure 1

22 pages, 27292 KB

Open AccessArticle

Adversarial Robustness for Deep Learning-Based Wildfire Prediction Models

by Ryo Ide and Lei Yang

Fire 2025, 8(2), 50; https://doi.org/10.3390/fire8020050 - 26 Jan 2025

Cited by 2 | Viewed by 2949

Abstract

Rapidly growing wildfires have recently devastated societal assets, exposing a critical need for early warning systems to expedite relief efforts. Smoke detection using camera-based Deep Neural Networks (DNNs) offers a promising solution for wildfire prediction. However, the rarity of smoke across time and [...] Read more.

Rapidly growing wildfires have recently devastated societal assets, exposing a critical need for early warning systems to expedite relief efforts. Smoke detection using camera-based Deep Neural Networks (DNNs) offers a promising solution for wildfire prediction. However, the rarity of smoke across time and space limits training data, raising model overfitting and bias concerns. Current DNNs, primarily Convolutional Neural Networks (CNNs) and transformers, complicate robustness evaluation due to architectural differences. To address these challenges, we introduce WARP (Wildfire Adversarial Robustness Procedure), the first model-agnostic framework for evaluating wildfire detection models’ adversarial robustness. WARP addresses inherent limitations in data diversity by generating adversarial examples through image-global and -local perturbations. Global and local attacks superimpose Gaussian noise and PNG patches onto image inputs, respectively; this suits both CNNs and transformers while generating realistic adversarial scenarios. Using WARP, we assessed real-time CNNs and Transformers, uncovering key vulnerabilities. At times, transformers exhibited over 70% precision degradation under global attacks, while both models generally struggled to differentiate cloud-like PNG patches from real smoke during local attacks. To enhance model robustness, we proposed four wildfire-oriented data augmentation techniques based on WARP’s methodology and results, which diversify smoke image data and improve model precision and robustness. These advancements represent a substantial step toward developing a reliable early wildfire warning system, which may be our first safeguard against wildfire destruction. Full article

(This article belongs to the Special Issue Advanced Approaches to Wildfire Detection, Monitoring and Surveillance)

► Show Figures

Figure 1

20 pages, 7065 KB

Open AccessArticle

Laser Scan Compression for Rail Inspection

by Jeremiasz Hauck and Piotr Gniado

Sensors 2024, 24(20), 6722; https://doi.org/10.3390/s24206722 - 19 Oct 2024

Cited by 4 | Viewed by 3104

Abstract

The automation of rail track inspection addresses key issues in railway transportation, notably reducing maintenance costs and improving safety. However, it presents numerous technical challenges, including sensor selection, calibration, data acquisition, defect detection, and storage. This paper introduces a compression method tailored for [...] Read more.

The automation of rail track inspection addresses key issues in railway transportation, notably reducing maintenance costs and improving safety. However, it presents numerous technical challenges, including sensor selection, calibration, data acquisition, defect detection, and storage. This paper introduces a compression method tailored for laser triangulation scanners, which are crucial for scanning the entire rail track, including the rails, rail fasteners, sleepers, and ballast, and capturing rail profiles for geometry measurement. The compression technique capitalizes on the regularity of rail track data and the sensors’ limited measurement range and resolution. By transforming scans, they can be stored using widely available image compression formats, such as PNG. This method achieved a compression ratio of 7.5 for rail scans used in the rail geometry computation and maintained rail gauge reproducibility. For the scans employed in defect detection, a compression ratio of 5.6 was attained without visibly compromising the scan quality. Lossless compression resulted in compression ratios of 5.1 for the rail geometry computation scans and 3.8 for the rail track inspection scans. Full article

(This article belongs to the Collection 3D Imaging and Sensing System)

► Show Figures

Figure 1

10 pages, 13509 KB

Open AccessData Descriptor

Dataset of Registered Hematoxylin–Eosin and Ki67 Histopathological Image Pairs Complemented by a Registration Algorithm

by Dominika Petríková, Ivan Cimrák, Katarína Tobiášová and Lukáš Plank

Data 2024, 9(8), 100; https://doi.org/10.3390/data9080100 - 7 Aug 2024

Cited by 5 | Viewed by 5810

Abstract

In this work, we describe a dataset suitable for analyzing the extent to which hematoxylin–eosin (HE)-stained tissue contains information about the expression of Ki67 in immunohistochemistry staining. The dataset provides images of corresponding pairs of HE and Ki67 stainings and is complemented by [...] Read more.

In this work, we describe a dataset suitable for analyzing the extent to which hematoxylin–eosin (HE)-stained tissue contains information about the expression of Ki67 in immunohistochemistry staining. The dataset provides images of corresponding pairs of HE and Ki67 stainings and is complemented by algorithms for computing the Ki67 index. We introduce a dataset of high-resolution histological images of testicular seminoma tissue. The dataset comprises digitized histology slides from 77 conventional testicular seminoma patients, obtained via surgical resection. For each patient, two physically adjacent tissue sections are stained: one with hematoxylin and eosin, and one with Ki67 immunohistochemistry staining. This results in a total of 154 high-resolution images. The images are provided in PNG format, facilitating ease of use for image analysis compared to the original scanner output formats. Each image contains enough tissue to generate thousands of non-overlapping 224 × 224 pixel patches. This shows the potential to generate more than 50,000 pairs of patches, one with HE staining and a corresponding Ki67 patch that depicts a very similar part of the tissue. Finally, we present the results of applying a ResNet neural network for the classification of HE patches into categories according to their Ki67 label. Full article

(This article belongs to the Section Computational Biology, Bioinformatics, and Biomedical Data Science)

► Show Figures

Figure 1

16 pages, 12863 KB

Open AccessArticle

Research on Multi-Step Fruit Color Prediction Model of Tomato in Solar Greenhouse Based on Time Series Data

by Shufeng Liu, Hongrui Yuan, Yanping Zhao, Tianhua Li, Linlu Zu and Siyuan Chang

Agriculture 2024, 14(8), 1211; https://doi.org/10.3390/agriculture14081211 - 24 Jul 2024

Cited by 3 | Viewed by 2638

Abstract

Color change is the most obvious characteristic of the tomato ripening stage and an important indicator of the tomato ripening condition, which directly affects the commodity value of tomato. To visualize the color change of tomato fruit during the mature stage, this paper [...] Read more.

Color change is the most obvious characteristic of the tomato ripening stage and an important indicator of the tomato ripening condition, which directly affects the commodity value of tomato. To visualize the color change of tomato fruit during the mature stage, this paper proposes a gated recurrent unit network with an encoder–decoder structure. This structure dynamically simulates the growth and development of tomatoes using time-dependent lines, incorporating real-time information such as tomato color and shape. Firstly, the .json file was converted into a mask.png file, the tomato mask was extracted, and the tomato was separated from the complex background environment, thus successfully constructing the tomato growth and development dataset. The experimental results showed that for the gated recurrent unit network with the encoder–decoder structure proposed, when the hidden layer number was 1 and hidden layer number was 512, a high consistency and similarity between the model predicted image sequence and the actual growth and development image sequence was realized, and the structural similarity index measure was 0.746. It was proved that when the average temperature was 24.93 °C, the average soil temperature was 24.06 °C, and the average light intensity was 11.26 Klux, the environment was the most suitable for tomato growth. The environmental data-driven tomato growth model was constructed to explore the growth status of tomato under different environmental conditions, and thus, to understand the growth status of tomato in time. This study provides a theoretical foundation for determining the optimal greenhouse environmental conditions to achieve tomato maturity and it offers recommendations for investigating the growth cycle of tomatoes, as well as technical assistance for standardized cultivation in solar greenhouses. Full article

(This article belongs to the Special Issue Machine Vision Solutions and AI-Driven Systems in Agriculture)

► Show Figures

Figure 1

Search Results (43)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (43)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI