Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (2,333)

Search Parameters:
Keywords = code similarity

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
17 pages, 2577 KB  
Article
Characterization and Phylogenetic Analysis of the Complete Mitochondrial Genome of Celaenorrhinus victor (Lepidoptera: Hesperiidae)
by Yaping Hu, Site Luo, Zhentian Yan, Xiaomin Ge, Le Wang, Xu Zhou, Bin Chen, Hui Ding and Xiao Zheng
Genes 2026, 17(2), 244; https://doi.org/10.3390/genes17020244 - 19 Feb 2026
Viewed by 81
Abstract
Background: Skipper butterflies (Hesperiidae) are a morphologically distinctive lineage within Papilionoidea, yet relationships among many groups remain difficult to resolve, and mitochondrial genomic resources remain limited for some tribes, including Celaenorrhinini. Methods: We sequenced and characterized the complete mitochondrial genome of [...] Read more.
Background: Skipper butterflies (Hesperiidae) are a morphologically distinctive lineage within Papilionoidea, yet relationships among many groups remain difficult to resolve, and mitochondrial genomic resources remain limited for some tribes, including Celaenorrhinini. Methods: We sequenced and characterized the complete mitochondrial genome of Celaenorrhinus victor using Illumina short-read sequencing. Gene content and organization were annotated, codon-usage patterns were assessed across Celaenorrhinus using relative synonymous codon usage and multiple compositional/selection tests (ENC–GC3s, neutrality, and PR2 analyses), selective constraints were evaluated using Ka/Ks for 13 protein-coding genes, and phylogenetic relationships were inferred with a partitioned maximum-likelihood analysis of 66 complete hesperiid mitogenomes. Results: The circular mitogenome of C. victor is 15,180 bp and contains the typical 37 genes (13 protein-coding genes, 22 tRNAs, and two rRNAs) plus an A + T-rich control region, with an overall A + T content of 79.64%. Gene order and orientation match those of other Celaenorrhinus and hesperiid mitogenomes. All protein-coding genes use standard invertebrate mitochondrial start codons (with cox1 initiating with TTG) and terminate with complete TAA stop codons. Codon usage is strongly biased toward A/U-ending codons and is broadly similar among five sampled Celaenorrhinus mitogenomes; ENC–GC3s, neutrality, and PR2 analyses indicate a predominant influence of A + T-directed mutational pressure with additional effects beyond base composition. Ka/Ks values for all 13 protein-coding genes were <1, consistent with pervasive purifying selection; cox genes were the most conserved, whereas several NADH dehydrogenase subunit genes evolved comparatively faster. The phylogeny recovered monophyletic Celaenorrhinini and a well-supported Celaenorrhinus clade, placing C. victor as sister to Celaenorrhinus consanguineus, while deeper nodes among major hesperiid lineages showed only moderate support in parts of the tree. Conclusions: This study provides a new mitogenomic resource for Celaenorrhinini and a comparative reference for codon usage and selective constraints within Celaenorrhinus, supporting the placement of C. victor within Hesperiidae while highlighting remaining uncertainty at deeper hesperiid divergences. Full article
(This article belongs to the Section Animal Genetics and Genomics)
Show Figures

Figure 1

11 pages, 1064 KB  
Communication
TOTEMS: Histogram of Evolutionarily Conserved Amino Acids
by Michael J. Fajardo, Adam G. Marsh and John R. Jungck
Computation 2026, 14(2), 52; https://doi.org/10.3390/computation14020052 - 18 Feb 2026
Viewed by 79
Abstract
We have developed a tool that allows us to easily visualize evolutionary variation via complementary multiple sequence alignments and frequency-based stacked Sequence Logos. This tool, TOTEMS (hisTogram of evOluTionarily consErved aMino acidS [...] Read more.
We have developed a tool that allows us to easily visualize evolutionary variation via complementary multiple sequence alignments and frequency-based stacked Sequence Logos. This tool, TOTEMS (hisTogram of evOluTionarily consErved aMino acidS), visualizes conserved regions in a multiple sequence alignment within regions of a three-dimensional structure that share similar degrees of evolutionary conservation as revealed in ConSurf output data. Unlike Sequence Logos that illustrate the relative frequency of individual amino acid residues (as in MSAViewer), or moving window averages that focus on properties such as hydrophobicity or electrical charge (as in CATH), TOTEMS can help users discriminate degrees of evolutionary conservation in adjacent positions within a three-dimensional structure. Thus, we offer a tool that serves to complement pre-existing visualization applications such as ConSurf, MSAViewer, and CATH. TOTEMS and its source code are freely available. Full article
(This article belongs to the Section Computational Biology)
Show Figures

Figure 1

19 pages, 1159 KB  
Article
Inguinal Hernia Recurrence in Adults in Romania: A Five-Year Nationwide Analysis of Surgical Practice and Health System Disparities
by Anca Tigora, Dragos Garofil, Mihai Zurzu, Vlad Paic, Mircea Bratucu, Florian Popa, Valeriu Surlin, Sandu Ramboiu, Daniela Marinescu, Victor Strambu and Petru Radu
Medicina 2026, 62(2), 391; https://doi.org/10.3390/medicina62020391 - 17 Feb 2026
Viewed by 95
Abstract
Introduction: Recurrent inguinal hernia remains a clinically relevant outcome that is difficult to quantify in the absence of national prospective registries. In Romania, structural differences between public and private hospitals may further influence recurrence-related care, access to minimally invasive surgery, and resource [...] Read more.
Introduction: Recurrent inguinal hernia remains a clinically relevant outcome that is difficult to quantify in the absence of national prospective registries. In Romania, structural differences between public and private hospitals may further influence recurrence-related care, access to minimally invasive surgery, and resource utilization. This study aimed to assess recurrence patterns after inguinal hernia repair at a national level, with emphasis on reinterventions, patient-related risk factors, and health system disparities. Methods: A nationwide retrospective cohort study was conducted using administrative DRG data from the Romanian National Health Insurance House. All adult patients undergoing inguinal hernia repair in 2019 were identified and followed for five years (2019–2023). Reintervention was used as a proxy for recurrence. Surgical approach, hospital sector, length of stay, reimbursement, patient migration, geographic distribution, and comorbidities were analyzed using descriptive statistics and multivariable logistic regression to explore factors associated with laparoscopic approach and reintervention. Results: Among the 18,185 patients who underwent inguinal hernia repair in 2019, reintervention rates during follow-up ranged from 0.58% to 4.88%, a variability that reflects inherent limitations of administrative coding. Most reinterventions occurred in the year of the index surgery, suggesting early technical failure. Public hospitals managed the majority of cases and disproportionately absorbed recurrent and clinically complex patients. Access to laparoscopic repair was uneven and concentrated in large academic centers. Length of hospital stay declined gradually in public hospitals but remained consistently shorter in private institutions, reflecting differences in patient selection and care pathways. Reimbursement by The National Health Insurance House was similar for open and laparoscopic procedures. Conclusions: Recurrent inguinal hernia care in Romania is shaped by system-level disparities extending beyond surgical technique. Further progress requires reimbursement reform, establishment of a national hernia registry, and expansion of laparoscopic training to ensure equitable access to high-quality hernia care. Full article
(This article belongs to the Special Issue Abdominal Surgery: Clinical Updates and Future Perspectives)
Show Figures

Figure 1

13 pages, 465 KB  
Article
The Increase in Kidney Biopsies in Germany—Potential Risks and Reasons
by Ludwig Matrisch and Yannick Rau
Kidney Dial. 2026, 6(1), 12; https://doi.org/10.3390/kidneydial6010012 - 17 Feb 2026
Viewed by 85
Abstract
Background: Kidney biopsy is the diagnostic gold standard for characterizing glomerular disease and other intrarenal pathologies. Despite its clinical importance, epidemiological trends in kidney biopsy incidence remain poorly understood in many developed healthcare systems. This study characterizes temporal and demographic trends in [...] Read more.
Background: Kidney biopsy is the diagnostic gold standard for characterizing glomerular disease and other intrarenal pathologies. Despite its clinical importance, epidemiological trends in kidney biopsy incidence remain poorly understood in many developed healthcare systems. This study characterizes temporal and demographic trends in kidney biopsy utilization in Germany between 2006 and 2023, providing crucial data for resource allocation in renal pathology services. Methods: Data on all kidney biopsies (OPS code 1-465.0) performed in German hospitals were extracted from the Federal Statistical Office database and stratified by age and sex. Population denominators were obtained from national census data. Incidence rates per 100,000 inhabitants per year were calculated, and temporal trends were analyzed using Poisson regression with year as a continuous predictor variable. Separate models were fitted for overall population incidence, age-stratified incidence, and sex-stratified incidence. Results: The incidence of kidney biopsies increased 96.6% over 18 years, from 8.59 per 100,000 inhabitants in 2006 to 16.89 per 100,000 in 2023 (IRR: 1.0296 per year, 95% CI: 1.0287–1.0305; p < 0.0001). Age-stratified analysis revealed pronounced heterogeneity, with the oldest patients (>80 years) experiencing the steepest increase of 7.74% annually, while the youngest age group (<20 years) showed no significant temporal change. Sex-stratified analysis demonstrated similar increases in both males and females (3.36% and 3.04% annually, respectively). Conclusion: The substantial increase in kidney biopsy utilization in Germany over nearly two decades mirrors international patterns and suggests a global shift toward more liberal biopsy utilization in aging populations. Multiple factors likely contributed to this increase, including demographic aging, improved procedural safety and accessibility, evolving diagnostic guidelines, and expanding therapeutic options for glomerular disease. These findings underscore the need for national registry systems to optimize resource allocation for renal pathology and ensure equitable diagnostic access across healthcare systems. Full article
Show Figures

Figure 1

18 pages, 12200 KB  
Article
An Efficient Design-to-Verification Framework for CubeSat ADCS: Application to INHA RoSAT
by Hye-Eun Yoo, Chang-Oh Kim, Sung-Hoon Mok, Jisoo Yu and Keeyoung Choi
Aerospace 2026, 13(2), 189; https://doi.org/10.3390/aerospace13020189 - 16 Feb 2026
Viewed by 183
Abstract
CubeSats are increasingly adopted for space missions due to their low cost and short development cycles. However, their attitude determination and control systems (ADCS) often suffer from limited verification environments and constrained hardware configurations. This study addresses the development and verification of a [...] Read more.
CubeSats are increasingly adopted for space missions due to their low cost and short development cycles. However, their attitude determination and control systems (ADCS) often suffer from limited verification environments and constrained hardware configurations. This study addresses the development and verification of a flight-ready ADCS for the INHA RoSAT 3U CubeSat under realistic constraints in hardware, software, and test infrastructure. A model-based design (MBD) approach is adopted to construct an integrated development pipeline covering algorithm design, simulation, automatic C code generation, and integration with flight software (FSW). The generated code is embedded into a closed commercial onboard computer framework while preserving consistency across model-in-the-loop (MIL) and processor-in-the-loop (PIL) verification stages. To compensate for the lack of full hardware-in-the-loop (HIL) facilities, a FlatSat-based Sensor-to-Actuator test strategy is introduced to validate critical hardware–software interfaces including signal polarity, unit consistency, mounting orientation, and data flow using actual flight hardware. Furthermore, a fault-aware hierarchical attitude control scheme is defined in which the controller transitions to an alternative controller upon actuator fault indications. The presented approach demonstrates a practical ADCS development and verification strategy suitable for resource-constrained CubeSat missions, providing guidance for teams facing similar limitations in cost, resources, and test infrastructure. Full article
(This article belongs to the Section Astronautics & Space Science)
Show Figures

Figure 1

26 pages, 3735 KB  
Article
On Demand Secure Scalable Video Streaming for Both Human and Machine Applications
by Alaa Zain, Yibo Fan and Jinjia Zhou
Sensors 2026, 26(4), 1285; https://doi.org/10.3390/s26041285 - 16 Feb 2026
Viewed by 180
Abstract
Scalable video coding plays an essential role in supporting heterogeneous devices, network conditions, and application requirements in modern video streaming systems. However, most existing scalable coding approaches primarily optimize human perceptual quality and provide limited support for data privacy, as well as for [...] Read more.
Scalable video coding plays an essential role in supporting heterogeneous devices, network conditions, and application requirements in modern video streaming systems. However, most existing scalable coding approaches primarily optimize human perceptual quality and provide limited support for data privacy, as well as for machine analyses and the integration of heterogeneous sensor data. This limitation motivated the development of adaptive scalable video coding frameworks. The proposed approach is designed to serve both human viewers and automated analysis systems while ensuring high security and compression efficiency. The method adaptively encrypts selected layers during transmission to protect sensitive content without degrading decoding or analysis performance. Experimental evaluations on benchmark datasets demonstrate that the proposed framework achieves superior rate distortion efficiency and reconstruction quality, while also improving machine analysis accuracy compared to existing traditional and learning-based codes. In video surveillance scenarios, where the base layer is preserved for analysis, the proposed scalable human machine coding (SHMC) method outperforms scalable extensions of H.265/High Efficiency Video Coding (HEVC), Scalable High Efficiency Video Coding (SHVC), reducing the average bit-per-pixel (bpp) by 26.38%, 30.76%, and 60.29% at equivalent mean Average Precision (mAP), Peak Signal-to-Noise Ratio (PSNR), and Multi-Scale Structural Similarity (MS-SSIM) levels. These results confirm the effectiveness of integrating scalable video coding with intelligent encryption for secure and efficient video transmission. Full article
(This article belongs to the Special Issue Machine Learning in Image/Video Processing and Sensing)
Show Figures

Figure 1

11 pages, 1017 KB  
Article
Variation in Follicle-Stimulating Hormone Receptor Expression Is Associated with the Twinning Rate QTL Located on Bovine Chromosome 11 in Holstein Cattle
by Maryam Bakherad, João Paulo Nascimento Andrade, Sadrollah Molaei Moghbeli, Jackson F. Gille, Livia Martino-Duarte, Milo C. Wiltbank and Brian W. Kirkpatrick
Animals 2026, 16(4), 616; https://doi.org/10.3390/ani16040616 - 15 Feb 2026
Viewed by 223
Abstract
Twin births in dairy cattle present challenges for producers, resulting in increased prevalence of health issues for both cows and calves, thereby impacting profitability. Genome-wide association study (GWAS) analyses of the twinning rate in Holstein cattle have reported the most significant genomic association [...] Read more.
Twin births in dairy cattle present challenges for producers, resulting in increased prevalence of health issues for both cows and calves, thereby impacting profitability. Genome-wide association study (GWAS) analyses of the twinning rate in Holstein cattle have reported the most significant genomic association with twinning rate in a region containing two strong candidate genes: follicle-stimulating hormone receptor (FSHR) and luteinizing hormone/chorionic gonadotropin receptor (LHCGR). Coding-sequence variants of these genes were not associated with the twinning rate, suggesting that one of the two genes is differentially expressed in association with the twinning rate. Granulosa cells were collected from 38 Holstein cows that were selected to provide similar representation of genotypes for the twinning rate quantitative trait locus (QTL). RNA was extracted from granulosa cells and gene expression was assessed by quantitative PCR with data analyzed by the ΔΔCT method. Association of gene expression with QTL genotype was tested by the Kruskal–Wallis test with the QTL genotype based on the SNP most significantly associated with twinning rate. QTL genotype was significantly associated (p = 1.88 × 10−8) with the expression of FSHR but was not associated with LHCGR expression (p = 0.18). The increased FSHR expression was associated with an increasing copy number of the G allele and thus an increased twinning rate. Full article
(This article belongs to the Special Issue Investigating Twin Pregnancies in Mono-Ovulatory Species)
Show Figures

Figure 1

22 pages, 7987 KB  
Article
RioCC: Efficient and Accurate Class-Level Code Recommendation Based on Deep Code Clone Detection
by Hongcan Gao, Chenkai Guo and Hui Yang
Entropy 2026, 28(2), 223; https://doi.org/10.3390/e28020223 - 14 Feb 2026
Viewed by 154
Abstract
Context: Code recommendation plays an important role in improving programming efficiency and software quality. Existing approaches mainly focus on method- or API-level recommendations, which limits their effectiveness to local code contexts. From a multi-stage recommendation perspective, class-level code recommendation aims to efficiently narrow [...] Read more.
Context: Code recommendation plays an important role in improving programming efficiency and software quality. Existing approaches mainly focus on method- or API-level recommendations, which limits their effectiveness to local code contexts. From a multi-stage recommendation perspective, class-level code recommendation aims to efficiently narrow a large candidate code space while preserving essential structural information. Objective: This paper proposes RioCC, a class-level code recommendation framework that leverages deep forest-based code clone detection to progressively reduce the candidate space and improve recommendation efficiency in large-scale code spaces. Method: RioCC models the recommendation process as a coarse-to-fine candidate reduction procedure. In the coarse-grained stage, a quick search-based filtering module performs rapid candidate screening and initial similarity estimation, effectively pruning irrelevant candidates and narrowing the search space. In the fine-grained stage, a deep forest-based analysis with cascade learning and multi-grained scanning captures context- and structure-aware representations of class-level code fragments, enabling accurate similarity assessment and recommendation. This two-stage design explicitly separates coarse candidate filtering from detailed semantic matching to balance efficiency and accuracy. Results: Experiments on a large-scale dataset containing 192,000 clone pairs from BigCloneBench and a collected code pool show that RioCC consistently outperforms state-of-the-art methods, including CCLearner, Oreo, and RSharer, across four types of code clones, while significantly accelerating the recommendation process with comparable detection accuracy. Conclusions: By explicitly formulating class-level code recommendation as a staged retrieval and refinement problem, RioCC provides an efficient and scalable solution for large-scale code recommendation and demonstrates the practical value of integrating lightweight filtering with deep forest-based learning. Full article
(This article belongs to the Section Multidisciplinary Applications)
23 pages, 10369 KB  
Article
AI-Driven Methods in Façade Design
by Sanghyun Son and Hyoensu Kim
Buildings 2026, 16(4), 782; https://doi.org/10.3390/buildings16040782 - 13 Feb 2026
Viewed by 255
Abstract
This study proposes an integrated façade design framework that harmonizes the creative divergence of Generative AI with the economic efficiency of Design for Manufacturing and Assembly (DfMA). To address low productivity in the construction industry, a stepwise pipeline is developed, synthesizing image generation [...] Read more.
This study proposes an integrated façade design framework that harmonizes the creative divergence of Generative AI with the economic efficiency of Design for Manufacturing and Assembly (DfMA). To address low productivity in the construction industry, a stepwise pipeline is developed, synthesizing image generation via Midjourney, automated coding using ChatGPT, and quantitative optimization. Central to this process is the Hamming Distance algorithm, which evaluates image similarity to implement core DfMA principles: standardization and simplification. The study introduces a multidimensional decision-making model utilizing Grid Size (GS), Replacement Rate (RR), and Hamming Threshold (HT) indices to visualize the trade-off between component minimization and design fidelity. This process transforms abstract 2D patterns into manufacturable geometric panels, bridging the gap between conceptual design and constructability. The results demonstrate that algorithmic optimization significantly reduces component count, contributing to potential cost savings and schedule reduction. Ultimately, this research establishes a collaborative model where architects’ qualitative insights complement AI’s quantitative analysis, enabling designers to regain agency over digital tools and realize creative visions within technical constraints. Full article
(This article belongs to the Section Building Structures)
28 pages, 1177 KB  
Article
Context-Aware Code Review Automation: A Retrieval-Augmented Approach
by Büşra İçöz and Göksel Biricik
Appl. Sci. 2026, 16(4), 1875; https://doi.org/10.3390/app16041875 - 13 Feb 2026
Viewed by 226
Abstract
Manual code review is essential for software quality, but often slows down development cycles due to the high time demands on developers. In this study, we propose an automated solution for Python (version 3.13) projects that generates code review comments by combining Large [...] Read more.
Manual code review is essential for software quality, but often slows down development cycles due to the high time demands on developers. In this study, we propose an automated solution for Python (version 3.13) projects that generates code review comments by combining Large Language Models (LLMs) with Retrieval-Augmented Generation (RAG). To achieve this, we first curated a dataset from GitHub pull requests (PRs) using the GitHub REST Application Programming Interface (API) (version 2022-11-28) and classified comments into semantic categories using a semi-supervised Support Vector Machine (SVM) model. During the review process, our system uses a vector database to retrieve the top-k most relevant historical comments, providing context for a diverse spectrum of open-weights LLMs, including DeepSeek-Coder-33B, Qwen2.5-Coder-32B, Codestral-22B, CodeLlama-13B, Mistral-Instruct-7B, and Phi-3-Mini. We evaluated the system using a multi-step validation that combined standard metrics (BLEU-4, ROUGE-L, cosine similarity) with an LLM-as-a-Judge approach, and verified the results through targeted human review to ensure consistency with expert standards. The findings show that retrieval augmentation improves feedback relevance for larger models, with DeepSeek-Coder’s alignment score increasing by 17.9% at a retrieval depth of k = 3. In contrast, smaller models such as Phi-3-Mini suffered from context collapse, where too much context reduced accuracy. To manage this trade-off, we built a hybrid expert system that routes each task to the most suitable model. Our results indicate that the proposed approach improved performance by 13.2% compared to the zero-shot baseline (k = 0). In addition, our proposed system reduces hallucinations and generates comments that closely align with the standards expected from the experts. Full article
(This article belongs to the Special Issue Artificial Intelligence in Software Engineering)
Show Figures

Figure 1

26 pages, 5458 KB  
Article
Knowledge-Driven Human-in-the-Loop Decision Support for Student Services Using Active Learning and Large Language Models
by Anil Eyupoglu, Kian Jazayeri and Erbuğ Çelebi
Appl. Sci. 2026, 16(4), 1802; https://doi.org/10.3390/app16041802 - 11 Feb 2026
Viewed by 191
Abstract
This study presents an AI-based, human-in-the-loop decision support system designed for large-scale institutional query routing and response generation. The proposed system combines semantic text classification with large language model-based response generation to assist administrative staff in handling high-volume natural language requests from various [...] Read more.
This study presents an AI-based, human-in-the-loop decision support system designed for large-scale institutional query routing and response generation. The proposed system combines semantic text classification with large language model-based response generation to assist administrative staff in handling high-volume natural language requests from various system users, while preserving human oversight. Using a dataset of 135,359 real student and staff interactions collected over 15 years, the system was designed, deployed, and evaluated in a live university information portal. The classification component achieved 95.88% accuracy in evaluation and 82.21% staff acceptance in practice, while 94.81% of AI-generated draft responses were adopted with minor edits. Operational evaluation showed a 30.8% reduction in resolution time, a 32.6% decrease in misrouting, and an increase in user satisfaction from 3.6 to 4.9 out of 5. The system is implemented as a modular RESTful API to ensure interoperability with existing Student Information Systems, with analysis code available upon request to support replication in similar resource-constrained environments. The results illustrate how human-in-the-loop AI systems can support improvements in service quality, efficiency, and institutional capacity in resource-constrained environments, providing a transferable applied AI framework for scalable decision support in complex administrative domains. Full article
Show Figures

Figure 1

29 pages, 4856 KB  
Article
Evaluating LLMs for Source Code Generation and Summarization Using Machine Learning Classification and Ranking
by Hussain Mahfoodh, Mustafa Hammad, Bassam A. Y. Alqaralleh and Aymen I. Zreikat
Computers 2026, 15(2), 119; https://doi.org/10.3390/computers15020119 - 10 Feb 2026
Viewed by 354
Abstract
The recent use of large language models (LLMs) in code generation and code summarization tasks has been widely adopted by the software engineering community. New LLMs are emerging regularly with improved functionalities, efficiency, and expanding data that allow models to learn more effectively. [...] Read more.
The recent use of large language models (LLMs) in code generation and code summarization tasks has been widely adopted by the software engineering community. New LLMs are emerging regularly with improved functionalities, efficiency, and expanding data that allow models to learn more effectively. The lack of guidelines for selecting the right LLMs for coding tasks makes the selection a subjective choice by developers rather than a choice built on code complexity, code correctness, and linguistic similarity analysis. This research investigates the use of machine learning classification and ranking methods to select the best-suited open-source LLMs for code generation and code summarization tasks. This work conducts a comparison experiment on four open-source LLMs (Mistral, CodeLlama, Gemma 2, and Phi-3) and uses the MBPP coding question dataset to analyze code-generated outputs in terms of code complexity, maintainability, cyclomatic complexity, code structure, and LLM perplexity by collecting these as a set of features. An SVM classification problem is conducted on the highest correlated feature pairs, where the models are evaluated through performance metrics, including accuracy, area under the ROC curve (AUC), precision, recall, and F1 scores. The RankNet ranking methodology is used to evaluate code summarization model capabilities by measuring ROUGE and BERTScore accuracies between LLM code-generated summaries and the coding questions used from the dataset. The study results show a maximum accuracy of 49% for the code generation experiment, with the highest AUC score reaching 86% among the top four correlated feature pairs. The highest precision score reached is 90%, and the recall score reached up to 92%. Code summarization experiment results show Gemma 2 scored a 1.93 RankNet win probability score, and represented the highest ranking reached among other models. The phi3 model was the second-highest ranking with a 1.66 score. The research highlights the potential of machine learning to select LLMs based on coding metrics and paves the way for advancements in terms of accuracy, dataset diversity, and exploring other machine learning algorithms for other researchers. Full article
(This article belongs to the Special Issue AI in Action: Innovations and Breakthroughs)
Show Figures

Graphical abstract

14 pages, 265 KB  
Article
Sports Nutrition Misinformation on Spanish-Language YouTube and Digital Health Literacy: Mapping a Young–Adult Relevant Information Environment
by Ainoa Sofía Pastor-González, Juan Pablo Hervás-Pérez, Eva María Rodríguez-González, María Del Carmen Lozano-Estevan, Carlos Ruíz-Núñez, Cibeles Serna-Menor and Ivan Herrera-Peco
Youth 2026, 6(1), 18; https://doi.org/10.3390/youth6010018 - 7 Feb 2026
Viewed by 263
Abstract
YouTube is a de facto learning environment for athletes seeking fast, actionable nutritional guidance, yet platform dynamics may favor simplified or testimonial narratives over evidence-aligned messages. This study maps Spanish-language sports-nutrition videos to clarify who is most visible, how advice is framed, and [...] Read more.
YouTube is a de facto learning environment for athletes seeking fast, actionable nutritional guidance, yet platform dynamics may favor simplified or testimonial narratives over evidence-aligned messages. This study maps Spanish-language sports-nutrition videos to clarify who is most visible, how advice is framed, and what users encounter first. We conducted a cross-sectional, mixed-methods study of 558 YouTube videos on pre/post-exercise nutrition and supplementation. Data was coded for video types (divulgation/testimonial), claim presence, evidence links, and creator status (professional/non-professional). Exposure-adjusted metrics (View Ratio, Viewer Interaction) and nonparametric tests summarized distributions. An undirected network generated centrality rankings to select qualitative samples. Thematic analysis of titles and descriptions identified recurring rhetorical patterns and discourse modes. Divulgation videos predominated (97.3%). Evidence links were rare (0.2%). Exposure and interaction were right-skewed, indicating concentrated visibility. Non-professionals produced most videos, with older uploads and higher daily view accrual; however, interaction per view was similar across groups. Qualitative synthesis revealed two dominant discourse modes, scientific–cautious and experience–testimonial. Oversimplification and motivational cues clustered in testimonial/non-professional items; instructional language and scarce evidence links concentrated in professional/divulgation items. In Spanish sports-nutrition content, visibility is concentrated, and creator identity shapes advice framing. Evidence-aligned messages can compete when expressed with clear athletic framing, explicit caveats, and links to trustworthy sources. Full article
28 pages, 6547 KB  
Article
Contamination of Amorphous Carbon Thin Films: Modelling the Transport of Atoms in Gases During Deposition
by Pedro M. A. Guerreiro, Ana Rita G. E. Pires, Susana M. C. S. Fidalgo, Orlando M. N. D. Teodoro, Pedro Costa Pinto and Nenad Bundaleski
C 2026, 12(1), 13; https://doi.org/10.3390/c12010013 - 4 Feb 2026
Viewed by 241
Abstract
Monte Carlo simulations of the transport of atoms in gases related to the deposition process and the contamination of amorphous carbon thin films during deposition in magnetron discharges have been performed. These films are of interest in accelerator technology due to their low [...] Read more.
Monte Carlo simulations of the transport of atoms in gases related to the deposition process and the contamination of amorphous carbon thin films during deposition in magnetron discharges have been performed. These films are of interest in accelerator technology due to their low secondary electron yield when their structures are dominated by sp2 carbon. Two codes, which practically share the same algorithm, are introduced: TAGs 1 simulates the transport of sputtered atoms from the target to the substrate, and TAGs 2 simulates the transport of atoms from the plasma towards the target and the substrate. The similar results of TAGs 1 and the well-established SIMTRA for the same input parameters imply the algorithm’s accuracy. The codes were used to model the transport of different atoms (C, H, O, D) in a magnetron Ar discharge. The simulations reveal that the operating pressure should be higher than 1 Pa for a sample-target distance of 90 mm to secure sp2 carbon formation. The contamination mechanisms of amorphous carbon coatings were then studied by merging the results obtained with both programs. Preliminary comparisons with experiments suggest that the combined results of TAGs 1 and 2 agree very well with the experiments. Full article
Show Figures

Figure 1

26 pages, 403 KB  
Article
How the Representation of Retrieved Context Affects In-Context Prompting for Commit Message Generation
by Dokyeong An and Geunseok Yang
Electronics 2026, 15(3), 652; https://doi.org/10.3390/electronics15030652 - 2 Feb 2026
Viewed by 131
Abstract
High-quality commit messages are essential software artifacts because they succinctly communicate the intent and scope of code changes, yet large language models (LLMs) often fail to reflect project-specific writing conventions when used in a zero-shot setting without contextual signals. This study investigates not [...] Read more.
High-quality commit messages are essential software artifacts because they succinctly communicate the intent and scope of code changes, yet large language models (LLMs) often fail to reflect project-specific writing conventions when used in a zero-shot setting without contextual signals. This study investigates not whether retrieval helps, but how the same retrieved example, when represented differently in the prompt, quantitatively changes generation outcomes. We implement a retrieve-then-generate framework where the target commit’s diff is used as a query for BM25 (Best Matching 25)-based sparse retrieval over a commit-level database, and the top-1 similar commit is optionally injected as an example context. We compare a no-context condition (K = 0) against a minimal-context condition (K = 1) under three context representations: Diff-only, Message-only, and Diff + Message pair. Using Qwen-7B on 8000 evaluation samples with a fixed prompt skeleton, deterministic decoding, and identical post-processing across conditions, we observe negligible differences at K = 0 (BLEU-4 1.14, ROUGE-L 7.47–7.48, METEOR 4.88–4.91), establishing a stable baseline. At K = 1, the same top-1 retrieved case yields systematically different metric responses depending on how it is represented (Diff-only, Message-only, or Diff + Message), even under an identical prompt skeleton, deterministic decoding, and identical post-processing. This indicates that “context representation” is not a cosmetic formatting choice but a first-class prompt-design variable in retrieval-augmented in-context learning for commit message generation. Accordingly, practitioners should select the representation based on the intended objective (e.g., lexical/style alignment vs. change-intent grounding), rather than assuming a universally optimal format. Full article
(This article belongs to the Special Issue AI-Powered Natural Language Processing Applications)
Show Figures

Figure 1

Back to TopTop