Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (3,619)

Search Parameters:
Keywords = joint information

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
25 pages, 2327 KB  
Article
Joint Beamforming for Integrated Satellite–Terrestrial ISAC Systems
by Tengyu Wang and Qian Wang
Sensors 2026, 26(7), 2273; https://doi.org/10.3390/s26072273 - 7 Apr 2026
Abstract
Satellite–terrestrial integrated networks provide seamless global coverage, especially in remote areas where terrestrial deployment is costly. Integrated sensing and communications (ISAC) enhances spectral efficiency by merging both functions on a single platform. This paper proposes a novel integrated satellite–terrestrial ISAC architecture, where a [...] Read more.
Satellite–terrestrial integrated networks provide seamless global coverage, especially in remote areas where terrestrial deployment is costly. Integrated sensing and communications (ISAC) enhances spectral efficiency by merging both functions on a single platform. This paper proposes a novel integrated satellite–terrestrial ISAC architecture, where a satellite performs simultaneous communication and sensing. The satellite transmits communication signals and sensing waveforms to an Earth Station, which then relays them to a terrestrial base station to serve multiple users. We formulate a joint beamforming design problem to maximize the sum rate of users under quality-of-service constraints, backhaul capacity limits, beampattern requirements for sensing, and power budgets. With perfect channel state information, the non-convex problem is transformed into a difference-of-convex form and solved via the convex–concave procedure. For imperfect channel state information, a robust method combining successive convex approximation and the S-procedure is developed. Simulations show the proposed design outperforms benchmarks and is suitable for low-Earth orbit satellite systems. Full article
(This article belongs to the Special Issue New Technologies in Wireless Communication System)
Show Figures

Figure 1

23 pages, 9833 KB  
Article
Early Warning Method for Thermal Runaway High-Risk Cells Based on Nonlinear Mapping and Multidimensional Features
by Zhengxin Liu, Hongda Liu, Fang Lu, Yuxi Liu and Yangting Xiao
J. Mar. Sci. Eng. 2026, 14(7), 684; https://doi.org/10.3390/jmse14070684 - 7 Apr 2026
Abstract
In harsh marine environments, vessel energy storage systems (VESS) face elevated thermal runaway (TR) risk, yet practical early warning remains difficult because early voltage differences between TR high-risk cells and normal cells are often weak, warning thresholds vary across operating segments, and decisions [...] Read more.
In harsh marine environments, vessel energy storage systems (VESS) face elevated thermal runaway (TR) risk, yet practical early warning remains difficult because early voltage differences between TR high-risk cells and normal cells are often weak, warning thresholds vary across operating segments, and decisions relying on a single feature are prone to false or missed warnings. To overcome these difficulties, this study develops a four-part early warning strategy for TR high-risk cells in VESS. First, the original cell voltages are denoised through multiscale jump plus mode decomposition and Spearman correlation guided mode reconstruction to suppress irrelevant interference. Second, an improved Sigmoid nonlinear mapping is introduced to enhance subtle inter-cell voltage deviations and improve early separability. Third, sparse representation is used to construct a cell deviation score, and an adaptive threshold is employed to perform primary abnormal-cell screening under varying segment conditions. Finally, multidimensional mutual information value derived from voltage, temperature, and their rates of change is incorporated into a joint assessment methodology to further verify the abnormal state of flagged cells. Validation on 18 independent real operation cases comprising 2483 discharge segments shows that, across the evaluated TR high-risk cases, the shortest confirmed warning lead time achieved by the proposed strategy was 14 days. The proposed strategy also reduced false and missed warnings, outperformed the compared benchmark methods overall, and retained computational feasibility for onboard application in VESS. Full article
(This article belongs to the Special Issue Safety of Ships and Marine Design Optimization)
Show Figures

Figure 1

17 pages, 8164 KB  
Article
Gli1+ Cells Exhibit Clonogenicity and Slow-Cycling Features at the Temporomandibular Joint (TMJ) Enthesis–Condyle Interface
by Rafael Correia Cavalcante, Honghao Zhang, Peter X. Ma and Yuji Mishina
Int. J. Mol. Sci. 2026, 27(7), 3324; https://doi.org/10.3390/ijms27073324 - 7 Apr 2026
Abstract
The temporomandibular joint (TMJ) relies on specialized progenitor cells for tissue maintenance and repair. We characterized TMJ-derived progenitor cells in mice and investigated the role of Evc2-mediated Hedgehog signaling. Progenitor cells from the anterior TMJ exhibited greater colony-forming capacity and an elongated [...] Read more.
The temporomandibular joint (TMJ) relies on specialized progenitor cells for tissue maintenance and repair. We characterized TMJ-derived progenitor cells in mice and investigated the role of Evc2-mediated Hedgehog signaling. Progenitor cells from the anterior TMJ exhibited greater colony-forming capacity and an elongated morphology, while posterior cells were cuboidal, highlighting regional heterogeneity. TMJ-derived progenitors demonstrated multipotency, differentiating into osteogenic and chondrogenic lineages. Gli1-expressing, slow-cycling cells localized to the ligament attachment regions, initially accumulating there and not overlapping with specialized cells (Col1+ cells). Conditional Evc2 disruption in Gli1-expressing cells paradoxically augmented expression of Gli1 and mechanosensors (Yap, Wwtr1, Piezo1), and produced more confluent, rapidly expanding colonies. We hypothesize that these colonies are primarily composed of transit amplifying cells (TACs), which may proliferate robustly but face challenges in terminal differentiation. These results reveal critical roles for EVC2 and regional progenitor cell diversity in TMJ regenerative biology and suggest that targeting cell signaling and mechanical factors may inform novel strategies for TMJ disorder therapies. Full article
(This article belongs to the Special Issue Recent Advances in Adult Stem Cell Research)
Show Figures

Figure 1

20 pages, 7392 KB  
Article
Composite Multiscale Fractional Fuzzy Diversity Entropy and Its Application in Bearing Fault Identification
by Xiong Gan and Guangyou Yang
Fractal Fract. 2026, 10(4), 243; https://doi.org/10.3390/fractalfract10040243 - 7 Apr 2026
Abstract
This paper presents an intelligent fault identification approach integrating composite multiscale fractional fuzzy diversity entropy (CMFFDE) for feature extraction, joint mutual information (JMI) for feature selection, and an extreme learning machine (ELM) for classification. First, the CMFFDE method is developed by incorporating composite [...] Read more.
This paper presents an intelligent fault identification approach integrating composite multiscale fractional fuzzy diversity entropy (CMFFDE) for feature extraction, joint mutual information (JMI) for feature selection, and an extreme learning machine (ELM) for classification. First, the CMFFDE method is developed by incorporating composite multiscale analysis into the proposed fractional fuzzy diversity entropy to extract multiscale fault characteristics. JMI feature selection is then applied to identify sensitive features, which are subsequently used as input to the ELM classifier for fault identification. The effectiveness and superiority of the proposed approach are verified using bearing experimental data. Analysis results demonstrate that the proposed approach achieves better identification performance in bearing vibration signal analysis than alternative methods. Full article
(This article belongs to the Special Issue Fractional Order Modeling and Fault Detection in Complex Systems)
Show Figures

Figure 1

24 pages, 4411 KB  
Article
GT-TD3: A Kinematics-Aware Graph-Transformer Framework for Stable Trajectory Tracking of High-Degree-of-Freedom (DOF) Manipulators
by Hanwen Miao, Haoran Hou, Zhaopeng Zhu, Zheng Chao and Rui Zhang
Machines 2026, 14(4), 397; https://doi.org/10.3390/machines14040397 - 5 Apr 2026
Viewed by 38
Abstract
Accurate trajectory tracking of redundant manipulators is difficult because the controller must simultaneously model local couplings between adjacent joints and global dependencies across the whole kinematic chain. Existing reinforcement learning methods typically employ multilayer perceptrons, which do not explicitly exploit manipulator structure and [...] Read more.
Accurate trajectory tracking of redundant manipulators is difficult because the controller must simultaneously model local couplings between adjacent joints and global dependencies across the whole kinematic chain. Existing reinforcement learning methods typically employ multilayer perceptrons, which do not explicitly exploit manipulator structure and therefore show limited stability and representation ability in high-dimensional continuous control tasks. This paper proposes GT-TD3, a Graph Transformer-enhanced-Twin Delayed Deep Deterministic Policy Gradient framework, for redundant manipulator trajectory tracking. The proposed actor first converts the raw system state into joint-level node features and uses a graph neural network to extract local kinematic coupling information. A Transformer is then employed to capture long-range dependencies among joints. To strengthen the use of structural priors, topology- and distance-related bias terms are incorporated into the attention mechanism, enabling the network to encode manipulator structure during global feature learning. Experiments on a 7-DoF KUKA iiwa manipulator in PyBullet demonstrate that GT-TD3 outperforms MLP, pure GNN, and pure Transformer baselines in tracking performance. The proposed method achieves more stable training, faster convergence, and smoother and more accurate end-effector motion. The results show that the integration of local graph modeling and structure-aware global attention provides an effective solution for high-precision trajectory tracking of redundant manipulators. Full article
(This article belongs to the Section Robotics, Mechatronics and Intelligent Machines)
Show Figures

Figure 1

23 pages, 1645 KB  
Article
Secure Cooperative Communications in 6G Networks: A Constrained Hierarchical Reinforcement Learning Framework with Hybrid Action Space
by Xiaosi Tian, Zulin Wang and Yuanhan Ni
Entropy 2026, 28(4), 412; https://doi.org/10.3390/e28040412 - 4 Apr 2026
Viewed by 96
Abstract
With the rapid evolution toward 6G networks, ensuring robust physical layer security (PLS) in highly dynamic and heterogeneous wireless environments has become a key challenge. Traditional security methods often struggle to adapt to time-varying channels, especially in the absence of perfect channel state [...] Read more.
With the rapid evolution toward 6G networks, ensuring robust physical layer security (PLS) in highly dynamic and heterogeneous wireless environments has become a key challenge. Traditional security methods often struggle to adapt to time-varying channels, especially in the absence of perfect channel state information. Furthermore, the dynamic nature of node selection and power allocation in heterogeneous networks creates a complex hybrid action space operating across multiple timescales, significantly complicating the design of efficient and adaptive security strategies. To address this, this paper proposes a novel constrained hierarchical reinforcement learning (CHRL) framework for secure cooperative communications in next-generation wireless systems. The framework is designed to optimize secrecy performance within a hybrid action space comprising both discrete node selection and continuous power allocation, operating at different timescales. By hierarchically decoupling the joint optimization problem, the upper layer performs risk-aware node selection to maximize long-term secrecy capacity (SC) while guaranteeing a stable and secure link. At the lower layer, we develop a constrained MiniMax Multi-objective Deep Deterministic Policy Gradient (M3DDPG) algorithm that optimizes power allocation considering worst-case conditions. Lagrange multipliers are integrated to enforce a strictly positive SC constraint throughout transmission, effectively preventing security outages. Simulation results under time-varying Rayleigh fading channels demonstrate that the proposed CHRL framework outperforms existing HRL methods, achieving up to 17% improvement in SC while strictly maintaining security constraints. These results validate the effectiveness of the proposed approach for enhancing PLS in next-generation cooperative wireless networks. Full article
Show Figures

Figure 1

22 pages, 10553 KB  
Article
Reconstruction of Multiplex Networks with Correlated Layers
by Valerio Gemmetto and Diego Garlaschelli
Entropy 2026, 28(4), 411; https://doi.org/10.3390/e28040411 - 4 Apr 2026
Viewed by 87
Abstract
In many situations, the complete microscopic structure of a network is empirically inaccessible and has to be inferred from aggregate information using some probabilistic model. While several network reconstruction methods have been developed in the case of single-layer networks where nodes can be [...] Read more.
In many situations, the complete microscopic structure of a network is empirically inaccessible and has to be inferred from aggregate information using some probabilistic model. While several network reconstruction methods have been developed in the case of single-layer networks where nodes can be connected only by one type of link, the problem is still largely unexplored in the case of multiplex networks where several interdependent layers, each representing a distinct mode of connection, coexist. Even the most advanced network reconstruction techniques, when applied to each layer separately, may fail in replicating the inter-layer dependencies embodying the essence of multiplex networks. Here we develop a methodology to reconstruct a class of correlated multiplexes which includes, as a specific example that we study in detail, the multiplex network of international trade in different products. Our method starts from virtually any reconstruction model that successfully reproduces a set of desired marginal properties of each layer separately, i.e., node strengths and/or node degrees. It then introduces the minimal dependency structure required to replicate an additional set of higher-order properties, namely the portion of each node’s degree and each node’s strength that is shared and/or reciprocated across pairs of layers. These properties are found to provide empirically robust measures of inter-layer coupling, allowing for an accurate reconstruction of the world trade multiplex network. Our method allows for joint multi-layer connection probabilities to be reliably reconstructed from marginal ones, effectively bridging the gap between single-layer information and global multiplex properties. Full article
(This article belongs to the Section Multidisciplinary Applications)
Show Figures

Figure 1

16 pages, 1050 KB  
Article
Psychometric Validation of a Spanish–Chilean Instrument for Assessing Public Attitudes Toward Childhood Stuttering: Construct Validity and Internal Consistency
by Yasna Sandoval, Carlos Rojas, Francisco Novoa-Muñoz, Gabriel Lagos, Carla Figueroa, Álvaro Silva, Jaime Crisosto-Alarcón and Mauricio Alfaro-Calfullán
Children 2026, 13(4), 506; https://doi.org/10.3390/children13040506 - 3 Apr 2026
Viewed by 182
Abstract
Background/Objectives: Stuttering is a neurodevelopmental disorder of speech fluency. It emerges most commonly between 2 and 5 years old, often causing social exclusion and stigma. In Latin America, cultural misconceptions regarding its causes exacerbate these psychosocial challenges. This study validated a culturally adapted [...] Read more.
Background/Objectives: Stuttering is a neurodevelopmental disorder of speech fluency. It emerges most commonly between 2 and 5 years old, often causing social exclusion and stigma. In Latin America, cultural misconceptions regarding its causes exacerbate these psychosocial challenges. This study validated a culturally adapted instrument for Chile to measure public attitudes toward stuttering. The instrument provides a psychometrically sound method to assess and address stigma within educational and community settings. Methods: A total of 756 Chilean adults (mean age = 36.7 years, SD = 15.8; 64% women, 36% men) were recruited using stratified probability sampling to reflect the national demographics. Ethical approval and informed consent were obtained. The subsection underwent rigorous cross-cultural adaptation (translation, expert review, cognitive debriefing n = 30, pre-testing n = 50). Analysis employed polychoric matrices, EFA, CFA with WLSMV, and multiple reliability/validity indices. Results: Joint analysis showed poor fit (CFI = 0.72, RMSEA = 0.12), confirming independence. Beliefs (14 items): three-factor CFA fit excellent (CFI = 0.993, RMSEA = 0.034); factors: competence/normality (α = 0.85), psychological causes (α = 0.78), and help/support (α = 0.72). Reactions (11 items): four-factor fit adequate (CFI = 0.97, RMSEA = 0.043); factors: distant concern (α = 0.82), personal concern (α = 0.79), media sources (α = 0.75), and formal sources (α = 0.77). Validity was supported; bifactor models favored multidimensionality. Conclusions: The adapted subsection is psychometrically robust and effectively captures Chilean-specific attitudes toward childhood stuttering. It provides a reliable tool for quantifying public stigma and misconceptions, particularly in educational and school contexts, thereby supporting the design of targeted school-based policies and interventions to reduce bullying, promote inclusion, and safeguard the psychosocial well-being of children and adolescents who stutter. Full article
(This article belongs to the Section Pediatric Mental Health)
Show Figures

Figure 1

23 pages, 1312 KB  
Article
From Text to Structure: Precise Cognitive Diagnosis via Semantic Enhancement and Dynamic Q-Matrix Calibration
by Jingxing Fan, Zhichang Zhang and Yuming Du
Appl. Sci. 2026, 16(7), 3477; https://doi.org/10.3390/app16073477 - 2 Apr 2026
Viewed by 237
Abstract
Traditional cognitive diagnosis models typically rely on expert-annotated Q-matrices to define the relationship between exercises and knowledge concepts. This process is not only highly subjective and costly, but also prone to introducing noise and bias, which directly affects diagnostic accuracy. Meanwhile, most existing [...] Read more.
Traditional cognitive diagnosis models typically rely on expert-annotated Q-matrices to define the relationship between exercises and knowledge concepts. This process is not only highly subjective and costly, but also prone to introducing noise and bias, which directly affects diagnostic accuracy. Meanwhile, most existing deep learning-based methods overlook the rich semantic information contained in concept descriptions, making it difficult to deeply model the intrinsic relationships among knowledge points, resulting in limited interpretability of the models. To address these issues, this paper proposes a cognitive diagnosis model that incorporates key textual information from concept descriptions to refine the Q-matrix (KECQCD). The core innovation of the model lies in leveraging the pre-trained language model RoBERTa to encode concept texts, fusing semantic features with identifier embeddings through a gating mechanism to construct semantically-enhanced concept representations. It designs a concept-exercise heterogeneous information network and employs a graph attention mechanism to adaptively aggregate node features, explicitly modeling high-order knowledge dependencies. Furthermore, a multi-task joint learning framework is established to predict student performance while dynamically correcting association errors in the initial Q-matrix. Experimental results on the public Junyi dataset show that the KECQCD model significantly outperforms mainstream baseline models across multiple metrics, including accuracy (ACC), area under the curve (AUC), and root mean square error (RMSE). Ablation studies confirm the effectiveness of each core module, and diagnostic consistency (DOA) evaluation further demonstrates the enhanced interpretability of the model’s outcomes. This research offers a new solution for building accurate, reliable, and interpretable cognitive diagnosis systems, contributing positively to the advancement of personalized intelligent education. Full article
Show Figures

Figure 1

24 pages, 4191 KB  
Article
TR-BiGRU-CRF: A Lightweight Key Information Extraction Approach for Civil Aviation Flight Crew Operational Instructions
by Weijun Pan, Yao Zheng, Yidi Wang, Sheng Chen, Qinghai Zuo, Tian Luan and Chen Zeng
Appl. Sci. 2026, 16(7), 3461; https://doi.org/10.3390/app16073461 - 2 Apr 2026
Viewed by 190
Abstract
To enhance flight safety and operational efficiency, extracting key actions, flight parameters, and status information from civil aviation flight crew instructions generated during pre-flight and in-flight procedures is crucial. However, such texts are highly condensed and involve complex multi-role interactions, easily leading to [...] Read more.
To enhance flight safety and operational efficiency, extracting key actions, flight parameters, and status information from civil aviation flight crew instructions generated during pre-flight and in-flight procedures is crucial. However, such texts are highly condensed and involve complex multi-role interactions, easily leading to entity boundary drift and category misclassification. To address this, this paper proposes a joint key information extraction framework based on a lightweight pre-trained language model (TinyBERT) and a Role-Aware Fusion mechanism, abbreviated as TR-BiGRU-CRF. This framework introduces the Role-Aware Fusion mechanism to resolve semantic ambiguity caused by multi-party interactions, utilizes TinyBERT for semantic representation that balances accuracy and computational efficiency, and employs BiGRU-CRF for robust sequence feature modeling and decoding. Experiments on a flight crew instruction dataset show that the proposed method achieves 92.2% precision, 91.8% recall, a 92.0% F1 score, and an overall prediction accuracy of 92.6%. Compared to the BiGRU-CRF baseline, it significantly improves accuracy, precision, and F1 score by 11.4, 13.3, and 13.5 percentage points, respectively. These results prove that the proposed method effectively mitigates boundary drift and category confusion, providing strong support for flight crew instruction understanding and safety decision-making. Full article
(This article belongs to the Topic AI-Enhanced Techniques for Air Traffic Management)
Show Figures

Figure 1

23 pages, 8076 KB  
Article
Task Offloading of Parked Vehicles Edge Computing Based on Differential Privacy Hotstuff
by Guoling Liang, Zhaoyu Su, Chunhai Li, Mingfeng Chen and Feng Zhao
Information 2026, 17(4), 339; https://doi.org/10.3390/info17040339 - 1 Apr 2026
Viewed by 195
Abstract
The integration of blockchain into parked vehicle edge computing (PVEC) has emerged as a promising approach to mitigate the inherent trust challenges in distributed and untrusted computing environments. However, during task offloading and consensus, vehicles are vulnerable to location information disclosure, leading to [...] Read more.
The integration of blockchain into parked vehicle edge computing (PVEC) has emerged as a promising approach to mitigate the inherent trust challenges in distributed and untrusted computing environments. However, during task offloading and consensus, vehicles are vulnerable to location information disclosure, leading to privacy leakage. To address this problem, we propose a location differential privacy-enabled blockchain PVEC (DBPVEC) framework to protect location information during offloading and consensus. Specifically, we design a location differential privacy mechanism based on the Laplace mechanism and theoretically prove that it satisfies ε-differential privacy. This mechanism perturbs vehicles’ locations, and a privacy-preserving offloading strategy is designed to enhance the Hotstuff consensus and protect location privacy in edge computing. Subsequently, we formulate a joint optimization problem, considering system energy consumption, latency, and privacy strength. To solve it, we design a two-layer deep reinforcement learning (DRL) algorithm, with a Deep Q-Network (DQN) as the upper layer and a Deep Deterministic Policy Gradient (DDPG) as the lower layer, to determine the optimal offloading strategy. The experimental results demonstrate that our scheme achieves significant reductions compared to the two baseline methods: the total cost decreases by 68.31% and 63.25%, energy consumption by 9.96% and 16.27%, and delay by 31.46% and 18.07%, respectively. Moreover, it effectively preserves vehicle location privacy during task offloading and consensus while maintaining favorable performance in energy consumption and latency. Full article
(This article belongs to the Section Information and Communications Technology)
Show Figures

Figure 1

20 pages, 41296 KB  
Article
Frequency-Domain Feature Learning Network for Joint Image Demosaicing and Denoising
by Donghui Zhang, Feiyu Li, Jun Yang and Le Yang
Mathematics 2026, 14(7), 1175; https://doi.org/10.3390/math14071175 - 1 Apr 2026
Viewed by 232
Abstract
The methods employed for image demosaicing and denoising play a pivotal role in image acquisition and restoration, and have been extensively studied over the past few decades. Traditionally, these tasks are performed sequentially, with demosaicing followed by denoising, or vice versa, treating each [...] Read more.
The methods employed for image demosaicing and denoising play a pivotal role in image acquisition and restoration, and have been extensively studied over the past few decades. Traditionally, these tasks are performed sequentially, with demosaicing followed by denoising, or vice versa, treating each process independently. While this approach can enhance image quality, it often leads to issues such as color inaccuracies and information loss, as the outcome of the first task influences the second. Consequently, the integration of joint demosaicing and denoising (JDD) has become a focal point in recent research. Deep convolutional neural networks have shown promising results in addressing JDD challenges. This study introduces an end-to-end network, termed the Frequency-domain Features learning Network (FFNet), designed to tackle the JDD problem. Unlike conventional methods that focus on spatial domain features, FFNet utilizes frequency-domain (FD) characteristics to capture both global and local image details. Based on the vision Transformer architecture, FFNet consists of two key components: a global Fourier block (GFB), which uses global attention to determine the weights of FD parameters, and an MLP-based local Fourier block (LFB), which improves local feature extraction. These blocks are integrated with a channel attention mechanism to form the frequency-domain attention block (FAB), the core element of FFNet. Extensive experimental results on benchmark datasets demonstrate that FFNet achieves superior performance in terms of both quantitative metrics (PSNR/SSIM) and visual quality compared to existing state-of-the-art JDD methods. Furthermore, we provide a comprehensive analysis of its computational efficiency, including parameter count, FLOPs, and inference time, showing a competitive trade-off between performance and complexity. Full article
Show Figures

Figure 1

21 pages, 13964 KB  
Article
Towards Generalizable Deepfake Detection via Facial Landmark-Guided Convolution and Local Structure Awareness
by Hao Chen, Zhengxu Zhang, Qin Li and Chunhui Feng
Algorithms 2026, 19(4), 270; https://doi.org/10.3390/a19040270 - 1 Apr 2026
Viewed by 216
Abstract
As deepfakes become increasingly realistic, there is a growing need for robust and highly accurate facial forgery detection algorithms. Existing studies show that global feature modeling approaches (Transformer, VMamba) are effective in capturing long-range dependencies, yet they often lack sufficient sensitivity to localized [...] Read more.
As deepfakes become increasingly realistic, there is a growing need for robust and highly accurate facial forgery detection algorithms. Existing studies show that global feature modeling approaches (Transformer, VMamba) are effective in capturing long-range dependencies, yet they often lack sufficient sensitivity to localized facial tampering artifacts. Meanwhile, traditional convolutional methods excel at extracting local image features but struggle to incorporate prior knowledge about facial anatomy, resulting in limited representational capability. To address these limitations, this paper proposes LGMamba, a novel detection framework that integrates facial guidance focusing on key facial components and fine-grained detail regions commonly manipulated in deepfakes with global modeling. First, we introduce an innovative Landmark-Guided Convolution (LGConv), which adaptively adjusts convolutional sampling positions using facial landmark information. This allows the model to attend to forgery-prone facial regions, such as the eyes and mouth. Second, we design a parallel Facial Structure Awareness Block (FSAB) to operate alongside the VMamba-based visual State-Space Model. Equipped with a multi-stage residual design and a CBAM attention mechanism, FSAB enhances the model’s sensitivity to subtle facial artifacts, enabling joint exploitation of global semantic consistency and fine-grained forgery cues within a unified architecture. The proposed LGMamba achieves superior performance compared to existing mainstream approaches. In cross-dataset evaluations, it attains AUC scores of 92.34% on CD1 and 96.01% on CD2, outperforming all compared methods. Full article
Show Figures

Figure 1

32 pages, 31939 KB  
Article
Hierarchical Prototype Alignment for Video Temporal Grounding
by Yun Tian, Xiaobo Guo, Jinsong Wang, Yuming Zhao and Bin Li
Entropy 2026, 28(4), 389; https://doi.org/10.3390/e28040389 - 1 Apr 2026
Viewed by 250
Abstract
Recent advances in vision-language cross-modal learning have substantially improved the performance of video temporal grounding. However, most existing methods directly associate global video features with sentence-level features, overlooking the fact that textual semantics usually correspond to only limited spatio-temporal regions within a video. [...] Read more.
Recent advances in vision-language cross-modal learning have substantially improved the performance of video temporal grounding. However, most existing methods directly associate global video features with sentence-level features, overlooking the fact that textual semantics usually correspond to only limited spatio-temporal regions within a video. This limitation often leads to unstable alignment in complex scenarios involving intertwined events and diverse actions. In essence, accurate video temporal grounding requires the joint modeling of fine-grained spatial semantics and heterogeneous temporal event structures. Motivated by this observation, we propose a hierarchical prototype alignment approach that models cross-modal correspondence between video and text through structured intermediate prototype representations. Specifically, the alignment process is decomposed into two complementary stages: object-phrase alignment and event-sentence alignment. In the object-phrase alignment stage, discriminative local visual regions and informative textual words are aggregated to construct object and phrase prototypes, thereby enhancing fine-grained spatial correspondence at the level of entities and localized actions. In the event-sentence alignment stage, object prototypes are further integrated along the temporal dimension to form event prototypes that represent continuous action units, enabling effective alignment with sentence-level semantics and facilitating the modeling of diverse temporal event structures. On this basis, we further directly inject cross-modal alignment information into candidate moment aggregation. This design allows candidate moment representations to emphasize query-relevant temporal regions. Extensive experiments on Charades-STA, ActivityNet Captions, and TACoS demonstrate that the proposed method outperforms existing approaches, validating the effectiveness of hierarchical prototype alignment for improving both cross-modal alignment quality and temporal grounding accuracy. Full article
Show Figures

Figure 1

19 pages, 1148 KB  
Article
Co-Occurring Model of Trauma and Substance Use: An Application of a Joint Latent Profile Framework
by Jasmín D. Llamas
Sci 2026, 8(4), 78; https://doi.org/10.3390/sci8040078 - 1 Apr 2026
Viewed by 234
Abstract
Trauma and substance use disorders commonly co-occur, are clinically complex, and are associated with poorer outcomes. This study applies mixture modeling methods in a co-occurring model to examine group membership patterns across trauma and substance use to identify differences in treatment outcomes. Using [...] Read more.
Trauma and substance use disorders commonly co-occur, are clinically complex, and are associated with poorer outcomes. This study applies mixture modeling methods in a co-occurring model to examine group membership patterns across trauma and substance use to identify differences in treatment outcomes. Using the constructs of trauma and substance use, a co-occurring model was conducted to examine group membership patterns at intake and identify differences in outcomes among court-mandated participants in a trauma-informed substance abuse treatment program. This approach uses a joint/cross-classification of two independent Latent Profile Analyses (LPAs) to examine patterns. Findings from the LPA identified three trauma and four substance use profiles. Classes from each LPA were regressed in the co-occurring model, resulting in 12 unique pattern combinations, which were then compared to examine the differences in graduate rates. The results demonstrated that those in the Minimal Trauma/Alcohol Use group were more likely to complete treatment than other higher drug-using populations. Given the complexity of the clinical treatment and the prevalence of co-occurring disorders, the application of this approach can provide a means to examine different grouping patterns across two diagnostic criteria that can guide and tailor treatment efforts. Full article
(This article belongs to the Section Integrative Medicine)
Show Figures

Figure 1

Back to TopTop