Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (93)

Search Parameters:
Keywords = face attribute classification

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
23 pages, 1833 KB  
Review
From Fingerprint Spectra to Intelligent Perception: Research Advances in Spectral Techniques for Ginseng Species Identification
by Yuying Jiang, Xi Jin, Guangming Li, Hongyi Ge, Yida Yin, Huifang Zheng, Xing Li and Peng Li
Foods 2026, 15(4), 684; https://doi.org/10.3390/foods15040684 - 13 Feb 2026
Abstract
Owing to the high pharmacological relevance and multidimensional quality attributes of Panax spp., accurate authentication and quality evaluation of Panax-derived herbal materials remain challenging within traditional Chinese medicine (TCM) quality control systems. Conventional approaches often face trade-offs among analysis speed and throughput, non-destructive [...] Read more.
Owing to the high pharmacological relevance and multidimensional quality attributes of Panax spp., accurate authentication and quality evaluation of Panax-derived herbal materials remain challenging within traditional Chinese medicine (TCM) quality control systems. Conventional approaches often face trade-offs among analysis speed and throughput, non-destructive measurement, and analytical accuracy, which can limit their suitability for modern, large-scale quality control. This review summarizes recent advances in vibrational and related analytical techniques—infrared (IR) and near-infrared (NIR) spectroscopy, Raman spectroscopy, terahertz (THz) spectroscopy, hyperspectral imaging (HSI), and nuclear magnetic resonance (NMR)—for authentication and quality evaluation of Panax materials. We compare the capabilities of each modality in supporting key tasks, including species authentication, geographical origin tracing, age/cultivation-stage discrimination, and quantitative assessment of major chemical markers, with emphasis on the underlying measurement principles. In general, NIR and HSI are well suited to rapid, high-throughput screening of bulk samples, whereas Raman and NMR provide higher chemical specificity for molecular and structural characterization. To mitigate limitations of single-modality analysis, this review discusses a methodological shift from conventional spectral fingerprinting and chemometric approaches toward model-driven, data-enabled sensing strategies for robust quality evaluation. Specifically, we highlight multimodal data fusion frameworks combined with interpretable machine-learning/deep-learning methods to build robust classification and regression models for quality assessment. This perspective aims to support standardized and scalable authentication and quality evaluation of Panax herbal materials and to facilitate the digitization of quality control workflows for Chinese herbal medicines. Full article
Show Figures

Figure 1

16 pages, 3272 KB  
Article
Enhancing Fairness Without Demographic Labels via Identifying and Mitigating Potential Biases
by Pilhyeon Lee and Sungho Park
Symmetry 2026, 18(2), 344; https://doi.org/10.3390/sym18020344 - 12 Feb 2026
Abstract
Asymmetries in data distributions and performance across subgroups can induce systematic unfairness in real-world systems. A variety of previous studies have significantly ameliorated the fairness of deep learning models; however, most of them necessarily require additional labels for sensitive attributes, (i.e., ethnicity and [...] Read more.
Asymmetries in data distributions and performance across subgroups can induce systematic unfairness in real-world systems. A variety of previous studies have significantly ameliorated the fairness of deep learning models; however, most of them necessarily require additional labels for sensitive attributes, (i.e., ethnicity and gender). Since sensitive attributes often correspond to personal information, collecting such labels can be restricted and may raise privacy concerns. Although recent work has sought to address these issues by training a model without sensitive attribute labels, we point out that it has limitations, as it assumes specific characteristics of sensitive attributes and is validated in simplistic, constrained environments. Therefore, we propose an Unsupervised Fairness-aware Framework (UFF) that trains a fair classification model without pre-defining the characteristics of the sensitive attributes. It includes branches that capture various types of biases and eliminates them through adversarial training. In various scenarios on benchmark datasets, (i.e., CelebA and UTK Face) for facial attribute classification, the proposed method significantly enhances fairness without assuming specific characteristics of sensitive attributes. Moreover, we introduce g-FAT, which is a new metric to measure generalized trade-off performances between classification accuracy and fairness. For example, on CelebA, ours reduces EO from 11.8 to 7.6 for malignant bias and from 15.6 to 9.6 for benign bias, while improving g-FAT from 80.7 to 84.9 and from 79.0 to 85.2, respectively. In terms of g-FAT, our method achieves the highest trade-off performance among the compared methods on the benchmarks. Full article
(This article belongs to the Special Issue Symmetry/Asymmetry in Computer Vision and Artificial Intelligence)
Show Figures

Figure 1

43 pages, 6677 KB  
Article
Development of an AI-Driven Computational Framework for Integrated Dietary Pattern Assessment: A Simulation-Based Proof-of-Concept Study
by Mohammad Fazle Rabbi
Nutrients 2026, 18(3), 535; https://doi.org/10.3390/nu18030535 - 5 Feb 2026
Viewed by 280
Abstract
Background/Objectives: Contemporary food systems face dual imperatives of ensuring nutritional adequacy while minimizing environmental resource consumption, yet conventional dietary assessment methodologies inadequately integrate these competing objectives. This simulation-based proof-of-concept study developed an artificial intelligence-driven computational framework synthesizing nutritional evaluation, environmental footprint quantification, [...] Read more.
Background/Objectives: Contemporary food systems face dual imperatives of ensuring nutritional adequacy while minimizing environmental resource consumption, yet conventional dietary assessment methodologies inadequately integrate these competing objectives. This simulation-based proof-of-concept study developed an artificial intelligence-driven computational framework synthesizing nutritional evaluation, environmental footprint quantification, and economic accessibility assessment. Methods: The analytical architecture integrated random forest classification, dimensionality reduction, and scenario-based optimization across a simulated population cohort of 1500 individuals. Food composition data encompassed 55 representative foods across eight categories linked with greenhouse gas emissions, water use, and price parameters. Four dietary patterns (Mediterranean, Western, Plant-based, Mixed) were characterized across nutrient adequacy, greenhouse gas emissions, water consumption, and economic cost. Results: Random forest classification achieved 39.1% accuracy, with cost, greenhouse gas emissions, and water consumption emerging as the most discriminating features. Dietary patterns exhibited convergent macronutrient profiles (protein 108.8–112.8 g per day, 4% variation) despite categorical distinctions, while calcium inadequacy pervaded all patterns (867–927.5 mg per day, 7–13% below requirements). Environmental footprints demonstrated limited differentiation (greenhouse gas 3.73–3.96 kg CO2e per day, 6% range). Bootstrap resampling (n = 1000) confirmed narrow confidence intervals, with NHANES validation revealing substantial energy intake deviations (38–58% above observed means) attributable to adequacy-prioritized design rather than observed consumption patterns. Scenario modeling identified seasonally flexible dietary configurations maintaining micronutrient and protein adequacy while reducing water use to 87% of baseline at modest cost increases. Conclusions: This framework establishes a validated computational infrastructure for integrated dietary assessment benchmarked against sustainability thresholds and epidemiological reference data, demonstrating the feasibility of AI-driven evaluation of dietary patterns across nutritional, environmental, and economic dimensions. Full article
(This article belongs to the Section Nutrition Methodology & Assessment)
Show Figures

Figure 1

22 pages, 8610 KB  
Article
A Unified GNN-CV Framework for Intelligent Aerial Situational Awareness
by Leyan Li, Rennong Yang, Anxin Guo and Zhenxing Zhang
Sensors 2026, 26(1), 119; https://doi.org/10.3390/s26010119 - 24 Dec 2025
Viewed by 316
Abstract
Aerial situational awareness (SA) faces significant challenges due to inherent complexity involving large-scale dynamic entities and intricate spatio-temporal relationships. While deep learning advances SA for specific data modalities (static or time-series), existing approaches often lack the holistic, vision-centric perspective essential for human decision-making. [...] Read more.
Aerial situational awareness (SA) faces significant challenges due to inherent complexity involving large-scale dynamic entities and intricate spatio-temporal relationships. While deep learning advances SA for specific data modalities (static or time-series), existing approaches often lack the holistic, vision-centric perspective essential for human decision-making. To bridge this gap, we propose a unified GNN-CV framework for operational-level SA. This framework leverages mature computer vision (CV) architectures to intelligently process radar-map-like representations, addressing diverse SA tasks within a unified paradigm. Key innovations include methods for sparse entity attribute transformation graph neural networks (SET-GNNs), large-scale radar map reconstruction, integrated feature extraction, specialized two-stage pre-training, and adaptable downstream task networks. We rigorously evaluate the framework on critical operational-level tasks: aerial swarm partitioning and configuration recognition. The framework achieves an impressive end-to-end recognition accuracy exceeding 90.1%. Notably, in specialized tactical scenarios featuring small, large, and irregular flight intervals within formations, configuration recognition accuracy surpasses 85.0%. Even in the presence of significant position and heading disturbances, accuracy remains above 80.4%, with millisecond response cycles. Experimental results highlight the benefits of leveraging mature CV techniques such as image classification, object detection, and image generation, which enhance the efficacy, resilience, and coherence of intelligent situational awareness. Full article
(This article belongs to the Section Intelligent Sensors)
Show Figures

Figure 1

41 pages, 6158 KB  
Article
Security Audit of IoT Device Networks: A Reproducible Machine Learning Framework for Threat Detection and Performance Benchmarking
by Aigul Shaikhanova, Oleksandr Kuznetsov, Aizhan Tokkuliyeva, Kamil Ayapbergenov, Satiev Olzhas and Tlepov Danir
Sensors 2025, 25(24), 7519; https://doi.org/10.3390/s25247519 - 11 Dec 2025
Viewed by 804
Abstract
Internet of Things deployments face escalating security threats, yet systematic methods for auditing the defensive posture of IoT device networks remain underdeveloped. Current intrusion detection evaluations focus on algorithmic accuracy while neglecting operational requirements—computational efficiency, reproducibility, and interpretable risk assessment—that security audits demand. [...] Read more.
Internet of Things deployments face escalating security threats, yet systematic methods for auditing the defensive posture of IoT device networks remain underdeveloped. Current intrusion detection evaluations focus on algorithmic accuracy while neglecting operational requirements—computational efficiency, reproducibility, and interpretable risk assessment—that security audits demand. This paper introduces a reproducible security audit framework for IoT device networks, demonstrated through systematic evaluation of four machine learning models (Random Forest, LightGBM, XGBoost, Logistic Regression) on the TON_IoT dataset containing nine attack categories targeting smart environments. Our audit methodology enforces strict feature hygiene by excluding identity-revealing attributes, benchmarks both threat detection capability and computational cost, and provides complete reproducibility artifacts including preprocessing pipelines and trained models. The framework evaluates security posture through dual lenses: binary classification (distinguishing compromised from legitimate traffic) and multiclass classification (attributing threats to specific attack types). Binary audit results show ensemble models achieve 99.8–99.9% accuracy with perfect ROC-AUC (100%) and sub-15 ms inference latency per 1000 flows, confirming reliable attack detection. Multiclass auditing reveals more nuanced findings: while overall accuracy reaches 99.0% with macro-F1 near 97%, rare attack types expose critical blind spots—man-in-the-middle threats achieve only 78% F1 despite representing serious security risks. LightGBM provides optimal audit performance, balancing 99.93% detection accuracy with 2.76 MB deployment footprint. We translate audit findings into actionable security recommendations (network segmentation, rate-limiting, TLS metadata collection) and compare against twenty published studies, demonstrating that our framework achieves competitive detection rates while uniquely delivering the transparency, efficiency metrics, and reproducibility required for credible security assessment of production IoT networks. Full article
(This article belongs to the Special Issue Advances in Security for Emerging Intelligent Systems)
Show Figures

Figure 1

26 pages, 951 KB  
Article
Distributed Semi-Supervised Multi-Dimensional Uncertain Data Classification over Networks
by Zhen Xu and Sicong Chen
Electronics 2025, 14(23), 4634; https://doi.org/10.3390/electronics14234634 - 25 Nov 2025
Viewed by 328
Abstract
Distributed multi-dimensional classification, where multiple nodes over a network induce a multi-dimensional classifier based on their own local data and a little information exchanged from neighbors, has received extensive attention in the academic community recently. Nevertheless, we observe that the classical distributed multi-dimensional [...] Read more.
Distributed multi-dimensional classification, where multiple nodes over a network induce a multi-dimensional classifier based on their own local data and a little information exchanged from neighbors, has received extensive attention in the academic community recently. Nevertheless, we observe that the classical distributed multi-dimensional classification formulation requires all training data to have definite feature attributes and complete labels. However, in real-world scenarios, due to measurement errors in distributed networks, the collected data samples consist of attributes with uncertainty. Additionally, a substantial proportion of multi-dimensional data faces challenges in label acquisition. Therefore, the key to achieving satisfactory performance in such a case is designing an effective method to model the input uncertainty and exploit weakly supervised information from the training data. Considering this, in this paper, we design a novel misclassification loss function that extracts effective information from uncertain data by treating it as the integral of misclassification loss over the potential data distribution. Additionally, we propose a new explicit feature mapping for constructing a nonlinear discriminant function. Based on this, we further put forward a novel manifold regularization term to recover multi-dimensional labels and simplify the original objective function to enable it to be optimized. By leveraging the gradient descent method, we optimize the simplified decentralized cost function and obtain the global optimal solution. We evaluate the performance of the proposed distributed semi-supervised multi-dimensional uncertain data classification algorithm, namely the dSMUDC algorithm, on several real datasets. The results of our experiments indicate that, in terms of all metrics, our proposed algorithm outperforms existing approaches to a significant extent. Full article
Show Figures

Figure 1

47 pages, 3926 KB  
Review
AI-Driven Control Strategies for FACTS Devices in Power Quality Management: A Comprehensive Review
by Mahmoud Kiasari and Hamed Aly
Appl. Sci. 2025, 15(22), 12050; https://doi.org/10.3390/app152212050 - 12 Nov 2025
Viewed by 1167
Abstract
Current power systems are facing noticeable power quality (PQ) performance deterioration, which has been attributed to nonlinear loads, distributed generation, and extensive renewable energy infiltration (REI). These conditions cause voltage sags, harmonic distortion, flicker, and disadvantageous power factors. The traditional PI/PID-based scheme of [...] Read more.
Current power systems are facing noticeable power quality (PQ) performance deterioration, which has been attributed to nonlinear loads, distributed generation, and extensive renewable energy infiltration (REI). These conditions cause voltage sags, harmonic distortion, flicker, and disadvantageous power factors. The traditional PI/PID-based scheme of control, when applied to Flexible AC Transmission Systems (FACTSs), demonstrates low adaptability and low anticipatory functions, which are required to operate a grid in real-time and dynamic conditions. Artificial Intelligence (AI) opens proactive, reactive, or adaptive and self-optimizing control schemes, which reformulate FACTS to thoughtful, data-intensive power-system objects. This literature review systematically studies the convergence of AI and FACTS technology, with an emphasis on how AI can improve voltage stability, harmonic control, flicker control, and reactive power control in the grid formation of various types of grids. A new classification is proposed for the identification of AI methodologies, including deep learning, reinforcement learning, fuzzy logic, and graph neural networks, according to specific FQ goals and FACTS device categories. This study quantitatively compares AI-enhanced and traditional controllers and uses key performance indicators such as response time, total harmonic distortion (THD), precision of voltage regulation, and reactive power compensation effectiveness. In addition, the analysis discusses the main implementation obstacles, such as data shortages, computational time, readability, and regulatory limitations, and suggests mitigation measures for these issues. The conclusion outlines a clear future research direction towards physics-informed neural networks, federated learning, which facilitates decentralized control, digital twins, which facilitate real-time validation, and multi-agent reinforcement learning, which facilitates coordinated operation. Through the current research synthesis, this study provides researchers, engineers, and system planners with actionable information to create a next-generation AI-FACTS framework that can support resilient and high-quality power delivery. Full article
Show Figures

Figure 1

26 pages, 12698 KB  
Article
Innovative Multi-Type Identification System for Cropland Abandonment on the Loess Plateau: Spatiotemporal Dynamics, Driver Shifts (2000–2023) and Implications for Food Security
by Wei Song
Land 2025, 14(10), 2062; https://doi.org/10.3390/land14102062 - 15 Oct 2025
Viewed by 546
Abstract
As a critical ecological barrier and key dryland agricultural zone in China, the Loess Plateau is faced with acute tensions between food security risks arising from cropland abandonment (CA) and the imperatives of ecological conservation. Yet, existing research has failed to adequately capture [...] Read more.
As a critical ecological barrier and key dryland agricultural zone in China, the Loess Plateau is faced with acute tensions between food security risks arising from cropland abandonment (CA) and the imperatives of ecological conservation. Yet, existing research has failed to adequately capture the long-term, high-spatiotemporal-resolution dynamics of abandonment in this region or to quantitatively couple its driving mechanisms with implications for food security. To address these gaps, this study establishes a high-precision identification system for CA tailored to the Plateau’s complex topographic conditions, distinguishing among interannual abandonment, multiyear abandonment, conversion to forest/grassland, and reclamation. Leveraging long-term data from 2000 to 2023 and integrating the Mann–Kendall test with the random forest algorithm, we examine the spatiotemporal trajectories, driving forces, and food security consequences of CA. Guided by a “type differentiation–grade classification–temporal tracking” framework, the analysis reveals a marked transition in dominant drivers from “socioeconomic factors” to “topographic–climatic factors.” It further identifies an “increasing loss–slowing growth” effect of abandonment on grain production, alongside a “pressure alleviation” trend in per capita carrying capacity. The results showed that: (1) Between 2000 and 2023, the area of CA on the Loess Plateau expanded from 2.72 million ha to 6.96 million ha, with high-grade abandonment (≥8 years) accounting for 58.9% of the total and being spatially concentrated in the hilly–gully regions of northern Shaanxi and eastern Gansu; (2) The Grain for Green Project (GFGP) peaked at approximately 340,000 hectares in 2018, followed by a slight decline, but has generally remained at around 300,000 hectares since then; (3) The reclamation rate of CA remained between 5% and 12% during 2003–2015, with minimal overall fluctuations, but after 2016, it gradually increased and peaked at 23.4% in 2022; (4) In terms of driving forces, population density (14.99%) was the primary determinant in 2005, whereas by 2020, slope (15.43%) and mean annual precipitation (15.63%) emerged as core factors; and (5) Grain yield losses attributable to abandonment increased from less than 100 t to nearly 450 t, though the growth rate slowed after 2016, accompanied by gradual alleviation of pressure on per capita carrying capacity. Overall, the study offers robust empirical evidence to inform cropland protection, food security strategies, and sustainable agricultural development policies on the Loess Plateau. Full article
Show Figures

Figure 1

28 pages, 2961 KB  
Article
An Improved Capsule Network for Image Classification Using Multi-Scale Feature Extraction
by Wenjie Huang, Ruiqing Kang, Lingyan Li and Wenhui Feng
J. Imaging 2025, 11(10), 355; https://doi.org/10.3390/jimaging11100355 - 10 Oct 2025
Viewed by 908
Abstract
In the realm of image classification, the capsule network is a network topology that packs the extracted features into many capsules, performs sophisticated capsule screening using a dynamic routing mechanism, and finally recognizes that each capsule corresponds to a category feature. Compared with [...] Read more.
In the realm of image classification, the capsule network is a network topology that packs the extracted features into many capsules, performs sophisticated capsule screening using a dynamic routing mechanism, and finally recognizes that each capsule corresponds to a category feature. Compared with previous network topologies, the capsule network has more sophisticated operations, uses a large number of parameter matrices and vectors to express picture attributes, and has more powerful image classification capabilities. However, in the practical application field, the capsule network has always been constrained by the quantity of calculation produced by the complicated structure. In the face of basic datasets, it is prone to over-fitting and poor generalization and often cannot satisfy the high computational overhead when facing complex datasets. Based on the aforesaid problems, this research proposes a novel enhanced capsule network topology. The upgraded network boosts the feature extraction ability of the network by incorporating a multi-scale feature extraction module based on proprietary star structure convolution into the standard capsule network. At the same time, additional structural portions of the capsule network are changed, and a variety of optimization approaches such as dense connection, attention mechanism, and low-rank matrix operation are combined. Image classification studies are carried out on different datasets, and the novel structure suggested in this paper has good classification performance on CIFAR-10, CIFAR-100, and CUB datasets. At the same time, we also achieved 98.21% and 95.38% classification accuracy on two complicated datasets of skin cancer ISIC derived and Forged Face EXP. Full article
(This article belongs to the Section Image and Video Processing)
Show Figures

Figure 1

27 pages, 6007 KB  
Article
Research on Rice Field Identification Methods in Mountainous Regions
by Yuyao Wang, Jiehai Cheng, Zhanliang Yuan and Wenqian Zang
Remote Sens. 2025, 17(19), 3356; https://doi.org/10.3390/rs17193356 - 2 Oct 2025
Viewed by 863
Abstract
Rice is one of the most important staple crops in China, and the rapid and accurate extraction of rice planting areas plays a crucial role in the agricultural management and food security assessment. However, the existing rice field identification methods faced the significant [...] Read more.
Rice is one of the most important staple crops in China, and the rapid and accurate extraction of rice planting areas plays a crucial role in the agricultural management and food security assessment. However, the existing rice field identification methods faced the significant challenges in mountainous regions due to the severe cloud contamination, insufficient utilization of multi-dimensional features, and limited classification accuracy. This study presented a novel rice field identification method based on the Graph Convolutional Networks (GCN) that effectively integrated multi-source remote sensing data tailored for the complex mountainous terrain. A coarse-to-fine cloud removal strategy was developed by fusing the synthetic aperture radar (SAR) imagery with temporally adjacent optical remote sensing imagery, achieving high cloud removal accuracy, thereby providing reliable and clear optical data for the subsequent rice mapping. A comprehensive multi-feature library comprising spectral, texture, polarization, and terrain attributes was constructed and optimized via a stepwise selection process. Furthermore, the 19 key features were established to enhance the classification performance. The proposed method achieved an overall accuracy of 98.3% for the rice field identification in Huoshan County of the Dabie Mountains, and a 96.8% consistency compared to statistical yearbook data. The ablation experiments demonstrated that incorporating terrain features substantially improved the rice field identification accuracy under the complex topographic conditions. The comparative evaluations against support vector machine (SVM), random forest (RF), and U-Net models confirmed the superiority of the proposed method in terms of accuracy, local performance, terrain adaptability, training sample requirement, and computational cost, and demonstrated its effectiveness and applicability for the high-precision rice field distribution mapping in mountainous environments. Full article
Show Figures

Figure 1

27 pages, 1382 KB  
Article
Integrating AI and Geospatial Technologies for Sustainable Smart City Development: A Case Study of Yerevan
by Khoren Mkhitaryan, Anna Sanamyan, Mariam Mnatsakanyan, Erika Kirakosyan and Svetlana Ratner
Urban Sci. 2025, 9(10), 389; https://doi.org/10.3390/urbansci9100389 - 26 Sep 2025
Cited by 3 | Viewed by 3591
Abstract
Urban growth and environmental pressures in rapidly transforming cities require innovative governance tools that integrate advanced technologies with institutional assessment. This study develops and applies a strategic integration framework that combines spatial analysis, Convolutional Neural Networks (CNNs)-based land-use classification, SHAP-based feature attribution, and [...] Read more.
Urban growth and environmental pressures in rapidly transforming cities require innovative governance tools that integrate advanced technologies with institutional assessment. This study develops and applies a strategic integration framework that combines spatial analysis, Convolutional Neural Networks (CNNs)-based land-use classification, SHAP-based feature attribution, and stakeholder interviews to evaluate Yerevan, Armenia, as a case of a mid-income city facing accelerated urbanization. The case selection is justified by Yerevan’s rapid built-up expansion, fragmented green areas, and institutional challenges in aligning urban development with sustainability goals. The CNN model achieved 92.4% accuracy in land-use classification, and projections under a business-as-usual scenario indicate a 12.8% increase in built-up areas and a 6.5% decline in green zones by 2030. SHAP analysis identified land surface temperature and NDVI as the most influential predictors, while governance interviews highlighted gaps in regulatory support and technical capacity. The proposed framework advances the literature by integrating AI-driven geospatial analysis with qualitative governance assessment, providing actionable insights for urban policymakers. Findings underscore the potential of combining machine learning, geospatial technologies, and institutional diagnostics to guide smart city planning in transition economies. Full article
(This article belongs to the Special Issue Sustainable Urbanization, Regional Planning and Development)
Show Figures

Figure 1

28 pages, 6613 KB  
Article
Age Estimation and Gender Classification from Facial Images
by Maja Kocoń and Szymon Pawlukiewicz
Appl. Sci. 2025, 15(18), 10212; https://doi.org/10.3390/app151810212 - 19 Sep 2025
Viewed by 4902
Abstract
This study addresses the task of simultaneous age estimation and gender classification from facial images using convolutional neural networks (CNNs). The objective was to develop a unified model capable of handling both regression and classification tasks effectively. Four models with varying architectures, loss [...] Read more.
This study addresses the task of simultaneous age estimation and gender classification from facial images using convolutional neural networks (CNNs). The objective was to develop a unified model capable of handling both regression and classification tasks effectively. Four models with varying architectures, loss functions, and preprocessing strategies were implemented and evaluated. The best-performing model achieved over 90% accuracy in gender classification and a mean absolute error (MAE) below four years for age estimation. Performance analysis showed variation across age groups, with reduced accuracy for elderly individuals due to dataset imbalance and improved predictions for younger and middle-aged adults. To assess generalization, the model was also tested on external images, maintaining strong performance, particularly in gender classification. Challenges such as overfitting and face misdetection were addressed through preprocessing adjustments and model tuning. Beyond empirical results, this work consolidates a unified, reproducible protocol for joint age estimation and gender classification on a widely used face database. We standardize preprocessing, implement a consistent image-level split with a published seed, and define task-appropriate metrics. All training details are documented to provide a baseline, enhanced by a qualitative error analysis, enabling consistent reporting and facilitating future extensions. This approach demonstrates the effectiveness of CNNs for age and gender prediction and highlights their potential integration into recommendation systems that personalize content based on demographic attributes. Full article
Show Figures

Figure 1

15 pages, 3677 KB  
Article
Contextual Feature Expansion with Superordinate Concept for Compositional Zero-Shot Learning
by Soohyeong Kim and Yong Suk Choi
Appl. Sci. 2025, 15(17), 9837; https://doi.org/10.3390/app15179837 - 8 Sep 2025
Viewed by 847
Abstract
Compositional Zero-Shot Learning (CZSL) seeks to enable machines to recognize objects and attributes (i.e., primitives),learn their associations, and generalize to novel compositions, enabling systems to exhibit a human-like ability to infer and generalize. The existing approaches, multi-label and multi-class classification, face inherent trade-offs: [...] Read more.
Compositional Zero-Shot Learning (CZSL) seeks to enable machines to recognize objects and attributes (i.e., primitives),learn their associations, and generalize to novel compositions, enabling systems to exhibit a human-like ability to infer and generalize. The existing approaches, multi-label and multi-class classification, face inherent trade-offs: the former suffers from biases against unrelated compositions, while the latter struggles with exponentially growing search spaces as the number of objects and attributes increases. To overcome these limitations and address the exponential complexity in CZSL, we introduce Concept-oriented Feature ADjustment (CoFAD), a novel method that extracts superordinate conceptual features based on primitive relationships and expands label feature boundaries. By incorporating spectral clustering and membership function in fuzzy logic, CoFAD achieves state-of-the-art performance while using 2×–4× less GPU memory and reducing training time by up to 50× on large-scale dataset. Full article
Show Figures

Figure 1

20 pages, 17025 KB  
Article
SODE-Net: A Slender Rotating Object Detection Network Based on Spatial Orthogonality and Decoupled Encoding
by Xiaozhi Yu, Wei Xiang, Lu Yu, Kang Han and Yuan Yang
Remote Sens. 2025, 17(17), 3042; https://doi.org/10.3390/rs17173042 - 1 Sep 2025
Viewed by 1423
Abstract
Remote sensing objects often exhibit significant scale variations, high aspect ratios, and diverse orientations. The anisotropic spatial distribution of such objects’ features leads to the conflict between feature representation and boundary regression caused by the coupling of different attribute parameters: previous detection methods [...] Read more.
Remote sensing objects often exhibit significant scale variations, high aspect ratios, and diverse orientations. The anisotropic spatial distribution of such objects’ features leads to the conflict between feature representation and boundary regression caused by the coupling of different attribute parameters: previous detection methods based on square-kernel convolution lack the overall perception of large-scale or slender objects due to the limited receptive field; if the receptive field is simply expanded, although more context information can be captured to help object perception, a large amount of background noise will be introduced, resulting in inaccurate feature extraction of remote sensing objects. Additionally, the extracted features face issues of feature conflict and discontinuous loss during parameter regression. Existing methods often neglect the holistic optimization of these aspects. To address these challenges, this paper proposes SODE-Net as a systematic solution. Specifically, we first design a multi-scale fusion and spatially orthogonal convolution (MSSO) module in the backbone network. Its multiple shapes of receptive fields can naturally capture the long-range dependence of the object without introducing too much background noise, thereby extracting more accurate target features. Secondly, we design a multi-level decoupled detection head, which decouples target classification, bounding-box position regression and bounding-box angle regression into three subtasks, effectively avoiding the coupling problem in parameter regression. At the same time, the phase-continuous encoding module is used in the angle regression branch, which converts the periodic angle value into a continuous cosine value, thus ensuring the stability of the loss value. Extensive experiments demonstrate that, compared to existing detection networks, our method achieves superior performance on four widely used remote sensing object datasets: DOTAv1.0, HRSC2016, UCAS-AOD, and DIOR-R. Full article
Show Figures

Figure 1

29 pages, 12228 KB  
Article
Conditional Domain Adaptation with α-Rényi Entropy Regularization and Noise-Aware Label Weighting
by Diego Armando Pérez-Rosero, Andrés Marino Álvarez-Meza and German Castellanos-Dominguez
Mathematics 2025, 13(16), 2602; https://doi.org/10.3390/math13162602 - 14 Aug 2025
Viewed by 1771
Abstract
Domain adaptation is a key approach to ensure that artificial intelligence models maintain reliable performance when facing distributional shifts between training (source) and testing (target) domains. However, existing methods often struggle to simultaneously preserve domain-invariant representations and discriminative class structures, particularly in the [...] Read more.
Domain adaptation is a key approach to ensure that artificial intelligence models maintain reliable performance when facing distributional shifts between training (source) and testing (target) domains. However, existing methods often struggle to simultaneously preserve domain-invariant representations and discriminative class structures, particularly in the presence of complex covariate shifts and noisy pseudo-labels in the target domain. In this work, we introduce Conditional Rényi α-Entropy Domain Adaptation, named CREDA, a novel deep learning framework for domain adaptation that integrates kernel-based conditional alignment with a differentiable, matrix-based formulation of Rényi’s quadratic entropy. The proposed method comprises three main components: (i) a deep feature extractor that learns domain-invariant representations from labeled source and unlabeled target data; (ii) an entropy-weighted approach that down-weights low-confidence pseudo-labels, enhancing stability in uncertain regions; and (iii) a class-conditional alignment loss, formulated as a Rényi-based entropy kernel estimator, that enforces semantic consistency in the latent space. We validate CREDA on standard benchmark datasets for image classification, including Digits, ImageCLEF-DA, and Office-31, showing competitive performance against both classical and deep learning-based approaches. Furthermore, we employ nonlinear dimensionality reduction and class activation maps visualizations to provide interpretability, revealing meaningful alignment in feature space and offering insights into the relevance of individual samples and attributes. Experimental results confirm that CREDA improves cross-domain generalization while promoting accuracy, robustness, and interpretability. Full article
Show Figures

Figure 1

Back to TopTop