Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (132)

Search Parameters:
Keywords = resource acquisition features

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
27 pages, 9151 KB  
Article
A Dynamic Digital Twin Framework for Sustainable Facility Management in a Smart Campus: A Case Study of Chiang Mai University
by Sattaya Manokeaw, Pattaraporn Khuwuthyakorn, Ying-Chieh Chan, Naruephorn Tengtrairat, Manissaward Jintapitak, Orawit Thinnukool, Chinnapat Buachart, Thepparit Sinthamrongruk, Thidarat Kridakorn Na Ayutthaya, Natee Suriyanon, Somjintana Kanangkaew and Damrongsak Rinchumphu
Technologies 2025, 13(10), 439; https://doi.org/10.3390/technologies13100439 - 30 Sep 2025
Viewed by 926
Abstract
This study presents the development and deployment of a modular digital twin system designed to enhance sustainable facility management within a smart campus context. The system was implemented at the Faculty of Engineering, Chiang Mai University, and integrates 3D spatial modeling, real-time environmental [...] Read more.
This study presents the development and deployment of a modular digital twin system designed to enhance sustainable facility management within a smart campus context. The system was implemented at the Faculty of Engineering, Chiang Mai University, and integrates 3D spatial modeling, real-time environmental and energy sensor data, and multiscale dashboard visualization. Grounded in stakeholder-driven requirements, the platform emphasizes energy management, which is the top priority among campus administrators and technicians. The development process followed a four-phase methodology: (1) stakeholder consultation and requirement analysis; (2) physical data acquisition and 3D model generation; (3) sensor deployment using IoT technologies with NB-IoT and LoRaWAN protocols; and (4) real-time data integration via Firebase and standardized APIs. A suite of dashboards was developed to support interactive monitoring across faculty, building, floor, and room levels. System testing with campus users demonstrated high usability, intuitive spatial navigation, and actionable insights for energy consumption analysis. Feedback indicated strong interest in features supporting data export and predictive analytics. The platform’s modular and hardware-agnostic architecture enables future extensions, including occupancy tracking, water monitoring, and automated control systems. Overall, the digital twin system offers a replicable and scalable model for data-driven facility management aligned with sustainability goals. Its real-time, multiscale capabilities contribute to operational transparency, resource optimization, and climate-responsive campus governance, setting the foundation for broader applications in smart cities and built environment innovation. Full article
Show Figures

Figure 1

22 pages, 4596 KB  
Article
Image Super-Resolution Reconstruction Network Based on Structural Reparameterization and Feature Reuse
by Tianyu Li, Xiaoshi Jin, Qiang Liu and Xi Liu
Sensors 2025, 25(19), 5989; https://doi.org/10.3390/s25195989 - 27 Sep 2025
Viewed by 447
Abstract
In the task of integrated circuit micrograph acquisition, image super-resolution reconstruction technology can significantly enhance acquisition efficiency. With the advancement of deep learning techniques, the performance of image super-resolution reconstruction networks has improved markedly, but their demand for inference device memory has also [...] Read more.
In the task of integrated circuit micrograph acquisition, image super-resolution reconstruction technology can significantly enhance acquisition efficiency. With the advancement of deep learning techniques, the performance of image super-resolution reconstruction networks has improved markedly, but their demand for inference device memory has also increased substantially, greatly limiting their practical application in engineering and deployment on resource-constrained devices. Against this backdrop, we designed image super-resolution reconstruction networks based on feature reuse and structural reparameterization techniques, ensuring that the networks maintain reconstruction performance while being more suitable for deployment in resource-limited environments. Traditional image super-resolution reconstruction networks often redundantly compute similar features through standard convolution operations, leading to significant computational resource wastage. By employing low-cost operations, we replaced some redundant features with those generated from the inherent characteristics of the image and designed a reparameterization layer using structural reparameterization techniques. Building upon local feature fusion and local residual learning, we developed two efficient deep feature extraction modules, and forming the image super-resolution reconstruction networks. Compared to performance-oriented image super-resolution reconstruction networks (e.g., DRCT), our network reduces algorithm parameters by 84.5% and shortens inference time by 49.8%. In comparison with lightweight image reconstruction algorithms, our method improves the mean structural similarity index by 3.24%. Experimental results demonstrate that the image super-resolution reconstruction network based on feature reuse and structural reparameterization achieves an excellent balance between network performance and complexity. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

25 pages, 16306 KB  
Article
Mining Prediction Based on the Coupling of Structural-Alteration Anomalies in the Tsagaankhairkhan Copper–Gold Mine in Mongolia Through the Collaboration of Multi-Source Remote Sensing Data
by Jie Lv, Lei Zi, Chengzhuo Lu, Jingya Tong, He Chang, Wei Li and Wenbing Li
Minerals 2025, 15(10), 1005; https://doi.org/10.3390/min15101005 - 23 Sep 2025
Viewed by 328
Abstract
Against the backdrop of the continuous growth in global demand for mineral resources, efficient and accurate mineral exploration technologies are of paramount importance. Therefore, utilizing remote sensing technology, which features wide coverage, a non-contact nature, and multi-source data acquisition, is of great significance [...] Read more.
Against the backdrop of the continuous growth in global demand for mineral resources, efficient and accurate mineral exploration technologies are of paramount importance. Therefore, utilizing remote sensing technology, which features wide coverage, a non-contact nature, and multi-source data acquisition, is of great significance for conducting mineral resource exploration and prospecting research. This study focuses on the Tsagaankhairkhan copper–gold mining area in Mongolia and proposes a structural-alteration anomalies coupling mining prediction method based on the collaboration of multi-source remote sensing data. By comprehensively utilizing multi-source image data from Landsat-8, GF-2, and Sentinel-2, and employing methods such as principal component analysis (PCA), band ratio, and texture analysis, we effectively extracted structural information closely related to mineralization, as well as alteration anomaly information, including hydroxyl alteration anomalies and iron-staining alteration anomalies. Landsat-8 and Sentinel-2 data were employed to extract and mutually validate hydroxyl and iron-staining alteration anomaly information in the study area, thereby delineating alteration anomaly zones. By integrating the results of structural interpretation, the distribution of alteration anomaly information, and their spatial coupling characteristics, we explored the spatial coupling relationship between structural and alteration anomalies, analyzed their mineral control patterns, and identified 7 prospecting target areas. These target areas exhibit abundant mineral anomalies and favorable structural settings, indicating high metallogenic potential. The research findings provide crucial clues for the exploration of the Tsagaankhairkhan copper–gold mine in Mongolia and can guide future mineral exploration and development efforts. Full article
Show Figures

Figure 1

20 pages, 3989 KB  
Article
A2DSC-Net: A Network Based on Multi-Branch Dilated and Dynamic Snake Convolutions for Water Body Extraction
by Shuai Zhang, Chao Zhang, Qichao Zhao, Junjie Ma and Pengpeng Zhang
Water 2025, 17(18), 2760; https://doi.org/10.3390/w17182760 - 18 Sep 2025
Viewed by 346
Abstract
The accurate and efficient acquisition of the spatiotemporal distribution of surface water is of vital importance for water resource utilization, flood monitoring, and environmental protection. However, deep learning models often suffer from two major limitations when applied to high-resolution remote sensing imagery: the [...] Read more.
The accurate and efficient acquisition of the spatiotemporal distribution of surface water is of vital importance for water resource utilization, flood monitoring, and environmental protection. However, deep learning models often suffer from two major limitations when applied to high-resolution remote sensing imagery: the loss of small water body features due to encoder scale differences, and reduced boundary accuracy for narrow water bodies in complex backgrounds. To address these challenges, we introduce the A2DSC-Net, which offers two key innovations. First, a multi-branch dilated convolution (MBDC) module is designed to capture contextual information across multiple spatial scales, thereby enhancing the recognition of small water bodies. Second, a Dynamic Snake Convolution module is introduced to adaptively extract local features and integrate global spatial cues, significantly improving the delineation accuracy of narrow water bodies under complex background conditions. Ablation and comparative experiments were conducted under identical settings using the LandCover.ai and Gaofen Image Dataset (GID). The results show that A2DSC-Net achieves an average precision of 96.34%, average recall of 96.19%, average IoU of 92.8%, and average F1-score of 96.26%, outperforming classical segmentation models such as U-Net, DeepLabv3+, DANet, and PSPNet. These findings demonstrate that A2DSC-Net provides an effective and reliable solution for water body extraction from high-resolution remote sensing imagery. Full article
Show Figures

Figure 1

16 pages, 2720 KB  
Article
Multi-Trait Phenotypic Extraction and Fresh Weight Estimation of Greenhouse Lettuce Based on Inspection Robot
by Xiaodong Zhang, Xiangyu Han, Yixue Zhang, Lian Hu and Tiezhu Li
Agriculture 2025, 15(18), 1929; https://doi.org/10.3390/agriculture15181929 - 11 Sep 2025
Viewed by 461
Abstract
In situ detection of growth information in greenhouse crops is crucial for germplasm resource optimization and intelligent greenhouse management. To address the limitations of poor flexibility and low automation in traditional phenotyping platforms, this study developed a controlled environment inspection robot. By means [...] Read more.
In situ detection of growth information in greenhouse crops is crucial for germplasm resource optimization and intelligent greenhouse management. To address the limitations of poor flexibility and low automation in traditional phenotyping platforms, this study developed a controlled environment inspection robot. By means of a SCARA robotic arm equipped with an information acquisition device consisting of an RGB camera, a depth camera, and an infrared thermal imager, high-throughput and in situ acquisition of lettuce phenotypic information can be achieved. Through semantic segmentation and point cloud reconstruction, 12 phenotypic parameters, such as lettuce plant height and crown width, were extracted from the acquired images as inputs for three machine learning models to predict fresh weight. By analyzing the training results, a Backpropagation Neural Network (BPNN) with an added feature dimension-increasing module (DE-BP) was proposed, achieving improved prediction accuracy. The R2 values for plant height, crown width, and fresh weight predictions were 0.85, 0.93, and 0.84, respectively, with RMSE values of 7 mm, 6 mm, and 8 g, respectively. This study achieved in situ, high-throughput acquisition of lettuce phenotypic information under controlled environmental conditions, providing a lightweight solution for crop phenotypic information analysis algorithms tailored for inspection tasks. Full article
Show Figures

Figure 1

19 pages, 2646 KB  
Article
A Comprehensive Study of MCS-TCL: Multi-Functional Sampling for Trustworthy Compressive Learning
by Fuma Kimishima, Jian Yang and Jinjia Zhou
Information 2025, 16(9), 777; https://doi.org/10.3390/info16090777 - 7 Sep 2025
Viewed by 356
Abstract
Compressive Learning (CL) is an emerging paradigm that allows machine learning models to perform inference directly from compressed measurements, significantly reducing sensing and computational costs. While existing CL approaches have achieved competitive accuracy compared to traditional image-domain methods, they typically rely on reconstruction [...] Read more.
Compressive Learning (CL) is an emerging paradigm that allows machine learning models to perform inference directly from compressed measurements, significantly reducing sensing and computational costs. While existing CL approaches have achieved competitive accuracy compared to traditional image-domain methods, they typically rely on reconstruction to address information loss and often neglect uncertainty arising from ambiguous or insufficient data. In this work, we propose MCS-TCL, a novel and trustworthy CL framework based on Multi-functional Compressive Sensing Sampling. Our approach unifies sampling, compression, and feature extraction into a single operation by leveraging the compatibility between compressive sensing and convolutional feature learning. This joint design enables efficient signal acquisition while preserving discriminative information, leading to feature representations that remain robust across varying sampling ratios. To enhance the model’s reliability, we incorporate evidential deep learning (EDL) during training. EDL estimates the distribution of evidence over output classes, enabling the model to quantify predictive uncertainty and assign higher confidence to well-supported predictions. Extensive experiments on image classification tasks show that MCS-TCL outperforms existing CL methods, achieving state-of-the-art accuracy at a low sampling rate of 6%. Additionally, our framework reduces model size by 85.76% while providing meaningful uncertainty estimates, demonstrating its effectiveness in resource-constrained learning scenarios. Full article
(This article belongs to the Special Issue AI-Based Image Processing and Computer Vision)
Show Figures

Figure 1

27 pages, 6315 KB  
Article
A Method for the Extraction of Apocynum venetum L. Spatial Distribution in Yuli County, Xinjiang, via an Improved SegFormer Network
by Yixuan Wang, Hong Wang and Xinhui Wang
Remote Sens. 2025, 17(17), 3039; https://doi.org/10.3390/rs17173039 - 1 Sep 2025
Viewed by 847
Abstract
Efficient and accurate acquisition of spatial distribution information for Apocynum venetum L. is highly important for the sustainable development of agriculture in Yuli County, Xinjiang. As an important cash crop, Apocynum relies on specific natural conditions for growth, and its survival environment is [...] Read more.
Efficient and accurate acquisition of spatial distribution information for Apocynum venetum L. is highly important for the sustainable development of agriculture in Yuli County, Xinjiang. As an important cash crop, Apocynum relies on specific natural conditions for growth, and its survival environment is currently under severe threat. Therefore, accurately quantifying its spatial distribution information is crucial. This research takes Yuli County in Xinjiang as the study area and proposes an enhanced SegFormer model based on deep learning, aiming to realize the effective identification and extraction of Apocynum. The study indicates the following. (1) The improved SegFormer model adds smaller-scale feature layers in the encoder stage, allowing the improved model’s encoder to extract features at five scales: 1/4, 1/8, 1/16, 1/32, and 1/64; meanwhile, integrating the T2T-ViT backbone network into the encoder significantly enhances the precision and efficiency of Apocynum’s spatial distribution extraction. (2) Compared with Unet, TransUNet, and the original SegFormer, the improved SegFormer model outperforms the other models in terms of the mIoU, OA, and mPA metrics, achieving values of 88.22%, 93.98%, and 89.66%, respectively. (3) Ablation experiments show that the T2T_vit_14 model performs best among all the T2T-ViT configurations, with superior extraction effects on fragmented small plots compared with the other models. Therefore, the T2T_vit_14 model is integrated into the SegFormer model. This work improves the extraction accuracy and efficiency of the spatial distribution of Apocynum via an improved SegFormer model, which has strong stability and robustness and offers scientific evidence for resource protection, restoration planting, and germplasm breeding in Yuli County, Xinjiang. Full article
Show Figures

Graphical abstract

30 pages, 1566 KB  
Article
AHN-BudgetNet: Cost-Aware Multimodal Feature-Acquisition Architecture for Parkinson’s Disease Monitoring
by Moad Hani, Saïd Mahmoudi and Mohammed Benjelloun
Electronics 2025, 14(17), 3502; https://doi.org/10.3390/electronics14173502 - 1 Sep 2025
Viewed by 537
Abstract
Optimizing healthcare resources in neurodegenerative diseases requires balancing diagnostic performance with cost constraints. We introduce AHN-BudgetNet—a tiered, cost-aware assessment framework for Parkinson’s disease motor severity prediction—evaluated on 1387 simulated PPMI subjects via patient-level GroupKFold validation. Our analysis tested seven tier combinations encompassing demographic, [...] Read more.
Optimizing healthcare resources in neurodegenerative diseases requires balancing diagnostic performance with cost constraints. We introduce AHN-BudgetNet—a tiered, cost-aware assessment framework for Parkinson’s disease motor severity prediction—evaluated on 1387 simulated PPMI subjects via patient-level GroupKFold validation. Our analysis tested seven tier combinations encompassing demographic, self-reported, and clinical features. The baseline (T0) yields AUC = 0.65 (95% CI [0.629, 0.681]) at no cost. Self-assessments (T1) alone achieved an AUC = 0.69 (95% CI [0.643, 0.733]) at USD 75, with an efficiency of 1.07. The combined T0 + T1 set reached AUC = 0.75 (95% CI [0.729, 0.772]) at USD 75, with efficiency 1.43. T2 alone obtained AUC = 0.53 (95% CI [0.517, 0.542]) at USD 300 and efficiency 0.07. The full T0 + T1 + T2 set achieved the highest performance—AUC = 0.76 (95% CI [0.735, 0.774])—at USD 375, with efficiency 0.54, reflecting diminishing returns beyond T1. High-cost tiers (T3/T4) could not be empirically validated due to over 88% missing data, emphasizing the value of accessible assessments. Gaussian Mixture on Tier 0 features yielded a silhouette score of 0.54, compared to 0.53 for K-means, confirming that patient-reported outcomes can support clinical stratification. Our results underpin evidence-based resource allocation: budgets USD ≤ 75 prioritize T1, while budgets USD ≤ 375 justify a comprehensive assessment. This confirms that structured tier prioritization supports robust, resource-efficient diagnosis in resource-limited clinical environments. Full article
(This article belongs to the Special Issue Artificial Intelligence Methods for Biomedical Data Processing)
Show Figures

Figure 1

18 pages, 2565 KB  
Article
Rock Joint Segmentation in Drill Core Images via a Boundary-Aware Token-Mixing Network
by Seungjoo Lee, Yongjin Kim, Yongseong Kim, Jongseol Park and Bongjun Ji
Buildings 2025, 15(17), 3022; https://doi.org/10.3390/buildings15173022 - 25 Aug 2025
Viewed by 507
Abstract
The precise mapping of rock joint traces is fundamental to the design and safety assessment of foundations, retaining structures, and underground cavities in building and civil engineering. Existing deep learning approaches either impose prohibitive computational demands for on-site deployment or disrupt the topological [...] Read more.
The precise mapping of rock joint traces is fundamental to the design and safety assessment of foundations, retaining structures, and underground cavities in building and civil engineering. Existing deep learning approaches either impose prohibitive computational demands for on-site deployment or disrupt the topological continuity of subpixel lineaments that govern rock mass behavior. This study presents BATNet-Lite, a lightweight encoder–decoder architecture optimized for joint segmentation on resource-constrained devices. The encoder introduces a Boundary-Aware Token-Mixing (BATM) block that separates feature maps into patch tokens and directionally pooled stripe tokens, and a bidirectional attention mechanism subsequently transfers global context to local descriptors while refining stripe features, thereby capturing long-range connectivity with negligible overhead. A complementary Multi-Scale Line Enhancement (MLE) module combines depth-wise dilated and deformable convolutions to yield scale-invariant responses to joints of varying apertures. In the decoder, a Skeletal-Contrastive Decoder (SCD) employs dual heads to predict segmentation and skeleton maps simultaneously, while an InfoNCE-based contrastive loss enforces their topological consistency without requiring explicit skeleton labels. Training leverages a composite focal Tversky and edge IoU loss under a curriculum-thinning schedule, improving edge adherence and continuity. Ablation experiments confirm that BATM, MLE, and SCD each contribute substantial gains in boundary accuracy and connectivity preservation. By delivering topology-preserving joint maps with small parameters, BATNet-Lite facilitates rapid geological data acquisition for tunnel face mapping, slope inspection, and subsurface digital twin development, thereby supporting safer and more efficient building and underground engineering practice. Full article
Show Figures

Figure 1

18 pages, 2028 KB  
Article
Research on Single-Tree Segmentation Method for Forest 3D Reconstruction Point Cloud Based on Attention Mechanism
by Lishuo Huo, Zhao Chen, Lingnan Dai, Dianchang Wang and Xinrong Zhao
Forests 2025, 16(7), 1192; https://doi.org/10.3390/f16071192 - 19 Jul 2025
Viewed by 560
Abstract
The segmentation of individual trees holds considerable significance in the investigation and management of forest resources. Utilizing smartphone-captured imagery combined with image-based 3D reconstruction techniques to generate corresponding point cloud data can serve as a more accessible and potentially cost-efficient alternative for data [...] Read more.
The segmentation of individual trees holds considerable significance in the investigation and management of forest resources. Utilizing smartphone-captured imagery combined with image-based 3D reconstruction techniques to generate corresponding point cloud data can serve as a more accessible and potentially cost-efficient alternative for data acquisition compared to conventional LiDAR methods. In this study, we present a Sparse 3D U-Net framework for single-tree segmentation which is predicated on a multi-head attention mechanism. The mechanism functions by projecting the input data into multiple subspaces—referred to as “heads”—followed by independent attention computation within each subspace. Subsequently, the outputs are aggregated to form a comprehensive representation. As a result, multi-head attention facilitates the model’s ability to capture diverse contextual information, thereby enhancing performance across a wide range of applications. This framework enables efficient, intelligent, and end-to-end instance segmentation of forest point cloud data through the integration of multi-scale features and global contextual information. The introduction of an iterative mechanism at the attention layer allows the model to learn more compact feature representations, thereby significantly enhancing its convergence speed. In this study, Dongsheng Bajia Country Park and Jiufeng National Forest Park, situated in Haidian District, Beijing, China, were selected as the designated test sites. Eight representative sample plots within these areas were systematically sampled. Forest stand sequential photographs were captured using an iPhone, and these images were processed to generate corresponding point cloud data for the respective sample plots. This methodology was employed to comprehensively assess the model’s capability for single-tree segmentation. Furthermore, the generalization performance of the proposed model was validated using the publicly available dataset TreeLearn. The model’s advantages were demonstrated across multiple aspects, including data processing efficiency, training robustness, and single-tree segmentation speed. The proposed method achieved an F1 score of 91.58% on the customized dataset. On the TreeLearn dataset, the method attained an F1 score of 97.12%. Full article
(This article belongs to the Section Forest Inventory, Modeling and Remote Sensing)
Show Figures

Figure 1

20 pages, 2409 KB  
Article
Spatio-Temporal Deep Learning with Adaptive Attention for EEG and sEMG Decoding in Human–Machine Interaction
by Tianhao Fu, Zhiyong Zhou and Wenyu Yuan
Electronics 2025, 14(13), 2670; https://doi.org/10.3390/electronics14132670 - 1 Jul 2025
Viewed by 886
Abstract
Electroencephalography (EEG) and surface electromyography (sEMG) signals are widely used in human–machine interaction (HMI) systems due to their non-invasive acquisition and real-time responsiveness, particularly in neurorehabilitation and prosthetic control. However, existing deep learning approaches often struggle to capture both fine-grained local patterns and [...] Read more.
Electroencephalography (EEG) and surface electromyography (sEMG) signals are widely used in human–machine interaction (HMI) systems due to their non-invasive acquisition and real-time responsiveness, particularly in neurorehabilitation and prosthetic control. However, existing deep learning approaches often struggle to capture both fine-grained local patterns and long-range spatio-temporal dependencies within these signals, which limits classification performance. To address these challenges, we propose a lightweight deep learning framework that integrates adaptive spatial attention with multi-scale temporal feature extraction for end-to-end EEG and sEMG signal decoding. The architecture includes two core components: (1) an adaptive attention mechanism that dynamically reweights multi-channel time-series features based on spatial relevance, and (2) a multi-scale convolutional module that captures diverse temporal patterns through parallel convolutional filters. The proposed method achieves classification accuracies of 79.47% on the BCI-IV 2a EEG dataset (9 subjects, 22 channels) for motor intent decoding and 85.87% on the NinaPro DB2 sEMG dataset (40 subjects, 12 channels) for gesture recognition. Ablation studies confirm the effectiveness of each module, while comparative evaluations demonstrate that the proposed framework outperforms existing state-of-the-art methods across all tested scenarios. Together, these results demonstrate that our model not only achieves strong performance but also maintains a lightweight and resource-efficient design for EEG and sEMG decoding. Full article
Show Figures

Figure 1

24 pages, 7043 KB  
Article
Machine Learning-Based Detection of Archeological Sites Using Satellite and Meteorological Data: A Case Study of Funnel Beaker Culture Tombs in Poland
by Krystian Kozioł, Natalia Borowiec, Urszula Marmol, Mateusz Rzeszutek, Celso Augusto Guimarães Santos and Jerzy Czerniec
Remote Sens. 2025, 17(13), 2225; https://doi.org/10.3390/rs17132225 - 28 Jun 2025
Viewed by 942
Abstract
The detection of archeological sites in satellite imagery is often hindered by environmental constraints such as vegetation cover and variability in meteorological conditions, which affect the visibility of subsurface structures. This study aimed to develop predictive models for assessing archeological site visibility in [...] Read more.
The detection of archeological sites in satellite imagery is often hindered by environmental constraints such as vegetation cover and variability in meteorological conditions, which affect the visibility of subsurface structures. This study aimed to develop predictive models for assessing archeological site visibility in satellite imagery by integrating vegetation indices and meteorological data using machine learning techniques. The research focused on megalithic tombs associated with the Funnel Beaker culture in Poland. The primary objective was to create models capable of detecting archeological features under varying environmental conditions, thereby enhancing the efficiency of field surveys and reducing associated costs. To this end, a combination of vegetation indices and meteorological parameters was employed. Key indices—including the Normalized Difference Vegetation Index (NDVI), Normalized Difference Water Index (NDWI), Normalized Difference Moisture Index (NDMI), and Normalized Archeological Index (NAI)—were analyzed alongside meteorological variables such as wind speed, temperature, humidity, and total precipitation. By integrating these datasets, the study evaluated how environmental conditions influence the visibility of archeological sites in satellite imagery. The machine learning models, including logistic regression and decision tree-based algorithms, demonstrated strong potential for predicting site visibility. The highest predictive accuracy was achieved during periods of high soil moisture variability and fluctuating weather conditions. These findings enabled the development of visibility prediction maps, guiding the optimal timing of aerial surveys and minimizing the risk of unsuccessful data acquisition. The results underscore the effectiveness of integrating meteorological data with satellite imagery in archeological research. The proposed approach not only improves site detection but also reduces operational costs by concentrating resources on optimal survey conditions. Furthermore, the methodology is applicable to diverse archeological contexts, enhancing the capacity to locate and document heritage sites across varying environmental settings. Full article
(This article belongs to the Section AI Remote Sensing)
Show Figures

Figure 1

14 pages, 1438 KB  
Article
CDBA-GAN: A Conditional Dual-Branch Attention Generative Adversarial Network for Robust Sonar Image Generation
by Wanzeng Kong, Han Yang, Mingyang Jia and Zhe Chen
Appl. Sci. 2025, 15(13), 7212; https://doi.org/10.3390/app15137212 - 26 Jun 2025
Viewed by 531
Abstract
The acquisition of real-world sonar data necessitates substantial investments of manpower, material resources, and financial capital, rendering it challenging to obtain sufficient authentic samples for sonar-related research tasks. Consequently, sonar image simulation technology has become increasingly vital in the field of sonar data [...] Read more.
The acquisition of real-world sonar data necessitates substantial investments of manpower, material resources, and financial capital, rendering it challenging to obtain sufficient authentic samples for sonar-related research tasks. Consequently, sonar image simulation technology has become increasingly vital in the field of sonar data analysis. Traditional sonar simulation methods predominantly focus on low-level physical modeling, which often suffers from limited image controllability and diminished fidelity in multi-category and multi-background scenarios. To address these limitations, this paper proposes a Conditional Dual-Branch Attention Generative Adversarial Network (CDBA-GAN). The framework comprises three key innovations: The conditional information fusion module, dual-branch attention feature fusion mechanism, and cross-layer feature reuse. By integrating encoded conditional information with the original input data of the generative adversarial network, the fusion module enables precise control over the generation of sonar images under specific conditions. A hierarchical attention mechanism is implemented, sequentially performing channel-level and pixel-level attention operations. This establishes distinct weight matrices at both granularities, thereby enhancing the correlation between corresponding elements. The dual-branch attention features are fused via a skip-connection architecture, facilitating efficient feature reuse across network layers. The experimental results demonstrate that the proposed CDBA-GAN generates condition-specific sonar images with a significantly lower Fréchet inception distance (FID) compared to existing methods. Notably, the framework exhibits robust imaging performance under noisy interference and outperforms state-of-the-art models (e.g., DCGAN, WGAN, SAGAN) in fidelity across four categorical conditions, as quantified by FID metrics. Full article
Show Figures

Figure 1

29 pages, 3799 KB  
Article
Forest Three-Dimensional Reconstruction Method Based on High-Resolution Remote Sensing Image Using Tree Crown Segmentation and Individual Tree Parameter Extraction Model
by Guangsen Ma, Gang Yang, Hao Lu and Xue Zhang
Remote Sens. 2025, 17(13), 2179; https://doi.org/10.3390/rs17132179 - 25 Jun 2025
Viewed by 843
Abstract
Efficient and accurate acquisition of tree distribution and three-dimensional geometric information in forest scenes, along with three-dimensional reconstructions of entire forest environments, hold significant application value in precision forestry and forestry digital twins. However, due to complex vegetation structures, fine geometric details, and [...] Read more.
Efficient and accurate acquisition of tree distribution and three-dimensional geometric information in forest scenes, along with three-dimensional reconstructions of entire forest environments, hold significant application value in precision forestry and forestry digital twins. However, due to complex vegetation structures, fine geometric details, and severe occlusions in forest environments, existing methods—whether vision-based or LiDAR-based—still face challenges such as high data acquisition costs, feature extraction difficulties, and limited reconstruction accuracy. This study focuses on reconstructing tree distribution and extracting key individual tree parameters, and it proposes a forest 3D reconstruction framework based on high-resolution remote sensing images. Firstly, an optimized Mask R-CNN model was employed to segment individual tree crowns and extract distribution information. Then, a Tree Parameter and Reconstruction Network (TPRN) was constructed to directly estimate key structural parameters (height, DBH etc.) from crown images and generate tree 3D models. Subsequently, the 3D forest scene could be reconstructed by combining the distribution information and tree 3D models. In addition, to address the data scarcity, a hybrid training strategy integrating virtual and real data was proposed for crown segmentation and individual tree parameter estimation. Experimental results demonstrated that the proposed method could reconstruct an entire forest scene within seconds while accurately preserving tree distribution and individual tree attributes. In two real-world plots, the tree counting accuracy exceeded 90%, with an average tree localization error under 0.2 m. The TPRN achieved parameter extraction accuracies of 92.7% and 96% for tree height, and 95.4% and 94.1% for DBH. Furthermore, the generated individual tree models achieved average Peak Signal-to-Noise Ratio (PSNR) and Structural Similarity Index (SSIM) scores of 11.24 and 0.53, respectively, validating the quality of the reconstruction. This approach enables fast and effective large-scale forest scene reconstruction using only a single remote sensing image as input, demonstrating significant potential for applications in both dynamic forest resource monitoring and forestry-oriented digital twin systems. Full article
(This article belongs to the Special Issue Digital Modeling for Sustainable Forest Management)
Show Figures

Figure 1

33 pages, 2741 KB  
Review
Deep Learning in Multimodal Fusion for Sustainable Plant Care: A Comprehensive Review
by Zhi-Xiang Yang, Yusi Li, Rui-Feng Wang, Pingfan Hu and Wen-Hao Su
Sustainability 2025, 17(12), 5255; https://doi.org/10.3390/su17125255 - 6 Jun 2025
Cited by 13 | Viewed by 2630
Abstract
With the advancement of Agriculture 4.0 and the ongoing transition toward sustainable and intelligent agricultural systems, deep learning-based multimodal fusion technologies have emerged as a driving force for crop monitoring, plant management, and resource conservation. This article systematically reviews research progress from three [...] Read more.
With the advancement of Agriculture 4.0 and the ongoing transition toward sustainable and intelligent agricultural systems, deep learning-based multimodal fusion technologies have emerged as a driving force for crop monitoring, plant management, and resource conservation. This article systematically reviews research progress from three perspectives: technical frameworks, application scenarios, and sustainability-driven challenges. At the technical framework level, it outlines an integrated system encompassing data acquisition, feature fusion, and decision optimization, thereby covering the full pipeline of perception, analysis, and decision making essential for sustainable practices. Regarding application scenarios, it focuses on three major tasks—disease diagnosis, maturity and yield prediction, and weed identification—evaluating how deep learning-driven multisource data integration enhances precision and efficiency in sustainable farming operations. It further discusses the efficient translation of detection outcomes into eco-friendly field practices through agricultural navigation systems, harvesting and plant protection robots, and intelligent resource management strategies based on feedback-driven monitoring. In addressing challenges and future directions, the article highlights key bottlenecks such as data heterogeneity, real-time processing limitations, and insufficient model generalization, and proposes potential solutions including cross-modal generative models and federated learning to support more resilient, sustainable agricultural systems. This work offers a comprehensive three-dimensional analysis across technology, application, and sustainability challenges, providing theoretical insights and practical guidance for the intelligent and sustainable transformation of modern agriculture through multimodal fusion. Full article
Show Figures

Figure 1

Back to TopTop