ACmix-Swin Deep Learning of 4-Day-Old Apis mellifera Larval Transcriptomes Reveals Early Caste-Biased Regulatory Hubs

Gong, Peixun; Li, Jinyou; Tian, Weixue; Ding, Xiang; Su, Runlang; Yue, Dan

doi:10.3390/genes17010017

Open AccessArticle

ACmix-Swin Deep Learning of 4-Day-Old Apis mellifera Larval Transcriptomes Reveals Early Caste-Biased Regulatory Hubs

by

Peixun Gong

^1,†,

Jinyou Li

^2,†,

Weixue Tian

¹,

Xiang Ding

^3,4,

Runlang Su

^3,*

and

Dan Yue

^1,*

¹

College of Animal Science and Technology, Yunnan Agricultural University, Kunming 650201, China

²

Faculty of Computing & Data Sciences, Boston University, Boston, MA 02215, USA

³

School of Mechanical and Electrical Information, Yiwu Industrial and Commercial College, Jinhua 322000, China

⁴

University of Chinese Academy of Sciences, Beijing 100049, China

^*

Authors to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Genes 2026, 17(1), 17; https://doi.org/10.3390/genes17010017

Submission received: 10 December 2025 / Revised: 22 December 2025 / Accepted: 24 December 2025 / Published: 25 December 2025

(This article belongs to the Section Bioinformatics)

Download

Browse Figures

Versions Notes

Abstract

Background/Objectives: Early larval development is critical for caste and sex differentiation in honeybees. This study investigates molecular divergence in 4-day-old Apis mellifera larvae and introduces a customized deep learning model for hub-gene discovery. Methods: Genome-guided RNA-seq, DEGs, WGCNA, and splicing analyses were integrated. A hybrid convolution–attention model, ACmix-Swin, combined with WGAN-GP augmentation, was developed to classify larvae and prioritize caste-biased genes. Selected genes were validated by qPCR. Results: Significant caste- and sex-specific divergence was detected in cuticle formation, hormone metabolism, and reproductive signaling. ACmix-Swin achieved the highest accuracy among baseline models and consistently identified key regulators, including Vg, LOC725841, LOC412768, and LOC100576841. qPCR confirmed RNA-seq trends. Conclusions: Caste- and sex-specific transcriptional programs are established early in larval development. The ACmix-Swin framework provides an effective strategy for high-dimensional transcriptome interpretation and robust hub-gene identification.

Keywords:

Apis mellifera; RNA-seq; caste differentiation; deep learning; ACmix-Swin

Graphical Abstract

1. Introduction

Bees are essential pollinators, and their colonies consist of queens, drones, and workers, each performing distinct biological roles [1,2,3,4,5,6]. These castes differ in morphology as well as in epigenetic and molecular regulatory features [1,3]. The queen is the only reproductive female, drones function mainly in mating, and workers—sterile females—carry out brood care, foraging, nest construction, and colony defense [4,5,6]. Honeybee development proceeds through embryonic, larval, pupal, and adult stages. The larval period is a highly dynamic phase involving rapid growth and extensive transcriptomic and epigenetic remodeling that underlies caste differentiation. Before day three, larvae are fed royal jelly, allowing female larvae to retain the potential to develop into either workers or queens [7]. The fourth larval day is a key transition point at which metabolic reprogramming and major shifts in gene regulation occur. Many caste-related, immune, and metabolic pathways show their strongest changes between days 3 and 5, making day four an informative stage for studying developmental regulation [8,9,10,11]. Nevertheless, despite these technical constraints, these studies consistently demonstrated that 4-day-old larvae represent a critical—and effectively a “golden”—developmental window for investigating honeybee caste differentiation, during which caste-specific transcriptional programs are robustly established [12]. Honeybee caste differentiation has been extensively investigated at the transcriptomic level, revealing the complex molecular mechanisms underlying phenotypic plasticity. Previous RNA-seq studies have demonstrated that thousands of genes are differentially expressed between queen- and worker-destined larvae during critical developmental windows [13,14]. These studies indicate that caste-specific developmental trajectories are established through coordinated changes in gene expression involving hormonal signaling, metabolic reprogramming, and epigenetic regulation [12,15]. More recently, advances in sequencing technologies have uncovered additional layers of regulation, including alternative splicing, poly(A) tail dynamics, and RNA modifications such as m6A methylation, all of which contribute to the formation of caste-specific transcriptomic landscapes [15]. Honeybee caste differentiation is a classical model for understanding developmental plasticity and has been extensively investigated at the transcriptomic level. Early microarray-based studies first revealed differential gene expression between developing queens and workers, identifying metabolic enzymes and regulatory factors as early molecular markers of caste differentiation [16]. Subsequent RNA-seq analyses systematically compared queen, worker, and drone larvae, demonstrating that reproductive, metabolic, and developmental pathways begin to diverge already at the larval stage [12,17]. Subsequent work further characterized the transcriptomic signatures of social interactions underlying caste development, emphasizing the dynamic interplay between intrinsic gene regulatory programs and extrinsic nutritional signals [17]. In the context of honeybee caste differentiation, extensive studies have already identified numerous caste-associated genes and pathways. For example, Vitellogenin (Vg) has been repeatedly recognized as a central regulator linking reproduction, metabolism, and social behavior, and is consistently enriched in queens relative to workers and drones. However, despite this rich body of work, there is still no consensus on which genes constitute the core regulators that most directly distinguish queen, worker, and drone phenotypes [10]. Most existing studies emphasize differential expression or functional enrichment, whereas the relative contribution and discriminative importance of individual genes across castes remain insufficiently resolved. This unresolved question motivates the use of model-based feature prioritization approaches, which aim to rank genes according to their contribution to caste discrimination rather than expression change alone [12].

Advances in sequencing technologies have greatly increased transcriptomic data volume, yet computational methods have not kept pace [18]. Modern transcriptomes are multidimensional and heterogeneous, presenting challenges for data interpretation [19,20]. Although public databases contain abundant transcriptome resources, only a small fraction is deeply analyzed [20]. Conventional statistical approaches identify differentially expressed genes but often fail to pinpoint key regulators driving phenotypic divergence [21,22,23]. Machine learning (ML) offers improved performance in handling large, complex datasets and integrating heterogeneous information [24,25], and automated ML further reduces technical barriers [26].

Deep learning (DL) has shown superior predictive accuracy compared with traditional ML in genomic applications [27,28,29], but small sample sizes often lead to overfitting. Generative adversarial networks (GANs) address this limitation by producing synthetic synthetic data improving model stability and performance [30,31]. GANs generate latent-space data through adversarial optimization and have been widely adopted in bioinformatics [31].

In this study, the WGAN-GP algorithm was applied to generate synthetic transcriptomic profiles for 4-day-old larvae of the three honeybee castes [32]. A hybrid architecture combining Swin-Transformer and CNN models was then constructed by integrating both original and GAN-generated datasets [33,34]. This fusion-based framework enabled efficient feature extraction and accurate classification of caste-specific transcriptomic patterns.

2. Materials and Methods

2.1. Sample Collection

Transcriptome datasets of 4-day-old Italian honeybee (Apis mellifera ligustica) larvae were retrieved from the NCBI Sequence Read Archive (SRA). These datasets were originally generated and published in previous transcriptomic studies investigating caste differentiation in honeybee larvae. In the present study, we reanalyzed these publicly available data using an updated reference genome and advanced analytical frameworks. For each caste at each developmental stage, six biological replicates were available, including 4-day-old workers (SRS1263211), queens (SRS1263243), and drones (SRS1263256) [12]. For the qPCR experiment, larvae were collected from A. mellifera colonies maintained near Yiwu Industrial & Commercial College. Colonies were kept at approximately 34 °C and 70% relative humidity without additional environmental manipulation. After collection, larval tissues were homogenized, filtered, and stored at −20 °C.

Total RNA was extracted using the Flash Pure Total RNA Micro Kit (catalog no. R516-PRL, Beijing, China) from embryonic samples preserved in PRL lysis buffer, following the manufacturer’s instructions. Subsequent qPCR assays were performed by Anhui Gaohes Biological Co., Ltd., AnQing, China.

2.2. Alignment and Quality Control of RNA-Seq Data

Quality control of the cleaned sequencing data was performed using FastQC. Raw reads were processed with Fastp (v0.23.2) for adapter removal and quality trimming, using the following parameters: automatic detection of paired-end adapters, Q20 trimming, filtering of reads with >40% low-quality bases or >5% ambiguous bases, and retention of reads ≥50 bp in length. Clean reads were aligned to the Apis mellifera reference genome (Amel_HAv3.1) using Hisat2 (v2.2.1) [35]. Hisat2 was run with default settings, including a 20 bp seed length, allowance for soft clipping, mismatch penalties of 6 and 2, and reporting up to five valid alignments per read.

SAM files were converted to sorted BAM files using Samtools (v1.6) [36]. Gene-level read counts were then generated with FeatureCounts [37]. Differential expression analysis was conducted using the DESeq2 algorithm integrated within Trinity (v2.15.1) [38,39]. p-values were adjusted using the Benjamini–Hochberg false discovery rate (FDR) method, and genes with FDR < 0.05 and |log₂FC| > 1 were considered significantly differentially expressed.

2.3. GO Enrichment Analysis of Differentially Expressed Genes and Alternatively Spliced Genes

GO annotation was performed using the longest CDS-derived protein sequences from the reference genome. Functional annotation was conducted with emapper.py (v2.1.10) [40] based on the EggNOG database [41]. The resulting annotation file was then converted into an org.db format, and GO enrichment analysis was carried out using the clusterProfiler package (v4.4) in R.

2.4. Alternative Splicing Analysis

Alternative splicing analysis was performed using SUPPA (v2.4) together with the reference genome annotation file (GTF) [42,43]. SUPPA was used to quantify five types of alternative splicing events (SE, MXE, A5, A3, RI), and transcript abundance was estimated with Salmon (v1.10.1). For each phenotype group (workers, drones, and queens), six biological replicates were used to calculate PSI values for all splicing events, and ΔPSI was computed to assess differences between groups. Significant differential splicing events were defined as those with ∆PSI ≥ 0.1 and p < 0.05 after FDR correction.

2.5. WGCNA Analysis

After removing outlier samples based on pairwise correlation analysis and hierarchical clustering, a weighted gene co-expression network was constructed using the WGCNA R package (v1.73) (https://cran.r-project.org/web/packages/WGCNA/index.html, accessed on 4 December 2025). The network was built with the following parameters: networkType = “signed”, mergeCutHeight = 0.25, minModuleSize = 30, and deepSplit = 2. Automatic block-wise module detection was applied, and a data-driven soft-thresholding power was selected to approximate scale-free topology. Module eigengenes (MEs) were calculated and correlated with experimental conditions to identify phenotype-associated modules. Gene–gene regulatory networks were then extracted from the selected modules.

2.6. Model Construction

To address the limitations imposed by high-dimensional transcriptomic data with small sample sizes, an integrated data augmentation strategy based on the Wasserstein GAN with Gradient Penalty (WGAN-GP) framework was employed. The WGAN-GP generator consisted of two fully connected layers (hidden dimension = 128; output dimension matching the number of transcriptomic features), whereas the discriminator was implemented using a three-layer architecture with spectral normalization to ensure training stability. Training parameters included a noise vector dimension of 64, a gradient penalty coefficient (λ_gp) of 10, a critic–generator update ratio of n_critic = 5, and the Adam optimizer (learning rate = 1 × 10⁻⁴, β₁ = 0.0, β₂ = 0.9).

Following adversarial training, multiple augmentation strategies were integrated to increase sample diversity and maintain biological plausibility. The augmented dataset consisted of WGAN-GP synthetic samples (30% of total augmentation), SMOTE-based interpolation (15%), Gaussian-noise perturbation (35%, with noise scaled to 0.05–0.15 of the feature-wise standard deviation), and Mixup combinations (20%, using a Beta(0.4, 0.4) mixing coefficient). To prevent synthetic samples from deviating excessively from the true data distribution, only WGAN-GP outputs with distances less than twice the median within-class distance were retained.

After filtering and integration, each phenotype category was expanded to 50–100 samples to meet the requirements of downstream deep learning model training. A schematic overview of the workflow is presented in Figure 1.

In model construction, a hybrid deep learning framework named ACmix-Swin was designed, integrating the strengths of CNN and Swin-Transformer architectures to capture both local gene co-expression patterns and global regulatory signals [44,45,46]. Input features were first processed through two fully connected embedding layers with LayerNorm, GELU activation, and a dropout rate of 0.15 to generate unified feature representations.

The core ACmix-Swin module consists of parallel convolutional and self-attention branches. The self-attention branch adopts a Swin-style windowed multi-head mechanism, which incorporates relative positional bias to model fine-grained regional dependencies. The attention computation is defined as:

A t t e n t i o n (Q, K, V) = s o f t m a x (\frac{Q K^{T}}{\sqrt{d_{k}}} + B) V

(1)

The convolutional branch employed depthwise separable convolutions (kernel size = 3, groups = embed_dim) followed by pointwise convolutions to extract local expression features, together with BatchNorm and GELU activation. Outputs from the convolutional and self-attention branches were dynamically fused through a set of learnable parameters. The fused representation was subsequently processed by LayerNorm and adaptive average pooling before entering the classification head (dropout = 0.3). The final prediction of the three phenotypic classes (workers, drones, and queens) was obtained by modulating the contributions of the two branches using learnable weights α and β.

O u t p u t = α \cdot {A t t e n t i o n}_{b r a n c h} + β \cdot {C o n v}_{b r a n c h}

(2)

The hybrid architecture of the ACmix-Swin model is illustrated in Figure 2.

Model training was performed using the AdamW optimizer (learning rate = 1 × 10⁻⁴; weight decay = 1 × 10⁻³), together with cosine annealing scheduling and a warmup phase during the first 10 epochs. Cross-entropy loss with label smoothing (smoothing = 0.05) was applied, and Mixup augmentation (α = 0.2) and gradient clipping (max_norm = 1.0) were incorporated to reduce overfitting. Training was run for a maximum of 300 epochs, and an early-stopping criterion based on validation loss (patience = 50) was used to ensure stable convergence. Core model hyperparameters included an embedding dimension of 64, eight attention heads, a window size of 7, and a dropout rate of 0.6.

To identify key regulatory genes, feature attribution was computed using the Gradient × Input method, in which gradients of the model output with respect to the input gene expression values are obtained by backpropagation and multiplied element-wise with the input to estimate gene importance, defined as:

{I m p o r t a n c e}_{i} = |\frac{\partial L o s s}{\partial x_{i}} \times x_{i}|

(3)

To identify class-specific marker genes, phenotype-specific importance scores were further computed. Gene importance derived from the deep learning model (DL score) was integrated with WGCNA topological measures—gene significance (GS) and module membership (MM)—to construct a combined scoring framework, defined as:

{C o m b i n e d \}_{s c o r e} = w_{D L} \times {D L \}_{s c o r e} + (1 - w_{D L}) \times {W G C N A \}_{s c o r e}

(4)

In this framework, wDLwDL controls the relative contribution of deep learning–derived importance and network topological metrics (e.g., wDL = 1.0 wDL = 1.0 indicates reliance on DL scores only, whereas wDL = 0.5 wDL = 0.5 represents equal weighting of DL and WGCNA). In this study, wDL = 1.0 wDL = 1.0. Based on the combined scores, the top 20 overall hub genes and the top 10 phenotype-specific hub genes for each caste were selected.

Model performance was systematically evaluated using accuracy, precision, recall, F1-score, and confusion matrices. All model training and score calculations were implemented in the PyTorch(v2.0) framework and accelerated using NVIDIA GPUs. The model implementation used in this study has been made publicly available on GitHub at: https://github.com/surunlang-creator/4dlarve-DLmodel, (accessed on 6 December 2025).

3. Results

3.1. RNA-Seq Overview and Quality Assessment

RNA sequencing was performed on all samples using the Illumina platform, generating an average of 3.9 Gbp of raw data per sample. After stringent quality filtering, approximately 3.85 Gbp of high-quality clean reads were retained for downstream analyses. Quality assessment indicated that both Q20 and Q30 values reached 100%, demonstrating excellent sequencing accuracy, and the mean GC content was 38.00%, consistent with expectations for the honeybee genome (Supplemental Table S1). Clean reads were aligned to the Apis mellifera reference genome using HISAT2, with more than 88% of reads uniquely mapped to a single genomic location. Sliding-window density analysis further confirmed the consistency and reliability of the mapping results. Across all 18 samples, a total of 645 million clean reads were uniquely mapped, yielding an overall average mapping rate of 94.47% (Supplemental Table S2).

3.2. Principal Component Analysis (PCA) of Samples

Dimensionality reduction and clustering assessment of the expression matrix were performed using the scikit-learn library in Python(v3.10). Standardized gene expression values were first obtained through z-score normalization (sklearn.preprocessing.StandardScaler), followed by principal component analysis (PCA; sklearn.decomposition.PCA). The first two principal components (PC1 and PC2) accounted for the majority of variance among samples. The PCA projection revealed clear separation among queen, worker, and drone samples, while biological replicates within each group were tightly clustered. Density distributions along the axes illustrated overall variation patterns on PC1 and PC2, and accompanying boxplots further confirmed distinct expression characteristics across the three castes. Ellipses denote the 95% confidence intervals for each caste group (Figure 3).

3.3. Traditional Analysis of Differentially Expressed Genes (DEGs)

In the comparison between drones and queens, 4107 differentially expressed genes (DEGs) were identified, including 1087 upregulated and 936 downregulated genes. A total of 3589 DEGs were detected between drones and workers (1163 upregulated and 919 downregulated), while 3044 DEGs were found between workers and queens (697 upregulated and 937 downregulated) (Figure 4a). Overall, drone expression profiles showed greater divergence from the two female castes, reflecting strong sex-related regulatory differences. In contrast, workers and queens exhibited fewer transcriptional differences, consistent with their shared female developmental trajectory.

Volcano plots illustrated the genome-wide expression patterns across castes (Figure 4b). In drone–queen comparisons, genes involved in cuticle formation (LOC102655429, LOC726185) and hormone regulation (Jhe) displayed pronounced differential expression. In drone–worker comparisons, fibroin-related genes (Fibroin1–4) were strongly downregulated in drones, indicating more active secretory functions in worker larvae. In worker–queen comparisons, reproductive and metabolic genes such as CPR3 and Vg were markedly upregulated in queens.

KNN-based unsupervised clustering further supported these trends (Figure 4c). Female individuals (workers and queens) formed a tight cluster (R = 0.883), whereas drones were clearly separated from both female groups (R = 0.776–0.857). Key caste-associated genes included Fibroin1 (higher in drones/queens) and Vg (lower in workers), suggesting that sex- and caste-related regulatory programs are already established at early larval stages.

3.4. Differential Gene Heatmap (DEGs)

The expression patterns of the top 30 most significant DEGs across drones, workers, and queens were visualized using a heatmap (Figure 5). Hierarchical clustering revealed clearly separated transcriptomic clusters for the three castes. Workers showed consistently high expression for most DEGs, whereas drones and queens displayed lower or caste-specific patterns, indicating strong transcriptional differentiation.

Fibroin-related genes (Fibroin1–4) were strongly upregulated in workers, consistent with their roles in nest construction. In contrast, queens showed marked upregulation of Vg (Vitellogenin), which was nearly absent in workers and drones, highlighting its importance in queen reproduction and metabolic regulation.

Drone-specific high-expression genes included LOC724477, LOC410452, LOC552799, LOC100578744, LOC727170, LOC409827, LOC409826, LOC107965608, and LOC552558, all showing low expression in workers and queens. Worker-enriched genes included LOC102653748, Fibroin2, LOC107964872, Fibroin3, Fibroin4, Fibroin1, LOC102654184, LOC726040, LOC102653922, LOC551869, LOC102655889, LOC100576458, LOC412843, LOC100577537, LOC551179, LOC725026, LOC551465, and LOC551544. Queen-specific expression was observed mainly for Vg, LOC406147, and LOC100577163 (Figure 5). These gene sets provide meaningful candidates for understanding the molecular mechanisms underlying honeybee caste differentiation.

3.5. GO and KEGG Analysis of Differentially Expressed Genes

GO and KEGG enrichment analyses revealed clear functional divergence among the three castes (Figure 6a). Drones showed reduced expression in cuticle and reproductive pathways compared with queens but higher expression of mating-related genes, indicating a bias toward sex-specific behavioral regulation. Workers were enriched in pupal development, fatty acid metabolism, hormone metabolism, and lipid processing, suggesting early activation of developmental and metabolic programs linked to their labor roles. GO CC terms highlighted differences in exoskeleton and extracellular matrix organization, while MF terms were enriched for chitin-binding, oxidoreductase activity, and amino acid transport. KEGG analysis indicated that queens emphasized reproductive metabolism and neuroregulation, whereas drones were enriched in fundamental metabolic pathways. The KO–gene–pathway relationships shown in the Sankey diagram (Figure 6b) revealed major enrichment in amino acid metabolism, fatty acid elongation, and hormone biosynthesis, particularly in drone–queen and worker–queen comparisons. Some genes associated with reproductive signaling and neural pathways further reflected the transcriptional specialization of queens. Overall, queens were enriched in reproductive and hormonal pathways, workers in structural and metabolic processes, and drones in amino acid and lipid metabolism, forming the functional basis of caste differentiation.

3.6. Deep Learning Analysis

3.6.1. WGAN-GP Analysis

To address the limitation of small sample size in transcriptomic analyses, the ACmix-Swin framework first applies Wasserstein GAN with Gradient Penalty (WGAN-GP) to augment all differentially expressed gene profiles, expanding each original sample into 100 synthetic samples to improve classification stability and generalization. The GAN was trained for 600 epochs, and four key indicators were used to assess training stability (Figure 7).

The generator and discriminator losses reached equilibrium after ~200 epochs, with generator loss stabilized at 1.5–2.0 and discriminator loss at −1.5 to −2.5 (Figure 7a), indicating stable adversarial dynamics without oscillation or mode collapse. The Wasserstein distance gradually decreased and stabilized at 1.5–2.0 (Figure 7b), and its 20-epoch moving average showed a consistent convergence trend, demonstrating that the generator effectively approximated the true data distribution. Discriminator scores also showed clear separation, with real samples centered around 0.4 and generated samples around –1.7 (Figure 7c), reflecting strong discriminative ability and suggesting room for further refinement of synthetic samples. Finally, the gradient penalty term decreased rapidly from 0.8 to ~0.02–0.05 within the first 50 epochs and remained stable thereafter (Figure 7d), confirming satisfaction of the 1-Lipschitz constraint and validating the theoretical assumptions of WGAN-GP.

Together, these metrics indicate that the WGAN-GP training process was stable, well-controlled, and free of mode collapse, providing a reliable foundation for downstream ACmix-Swin classification.

3.6.2. Performance Evaluation of the ACmix-Swin Classifier

The ACmix-Swin classifier demonstrated stable convergence and strong generalization on the augmented transcriptomic dataset (Figure 8). Training and testing losses declined rapidly within the first 50 epochs, with the testing loss reaching its minimum (0.2191) at epoch 251 (Figure 8a). Model accuracy remained consistently high: training accuracy reached 95% within 50 epochs and stabilized at 97–99%, while testing accuracy ranged from 95 to 97% with a peak of 95.8% (Figure 8b). The small train–test accuracy gap (3–6%; Figure 8c) confirms the effectiveness of regularization strategies. The learning rate schedule, combining warmup and cosine decay, further supported smooth optimization (Figure 8d). The final model assigned higher weight to the self-attention branch (0.462) than to the CNN branch (0.365) (Supplemental Figure S1). Overall, ACmix-Swin achieved robust caste classification performance and provides a reliable DL framework for small-sample transcriptomic analysis.

3.6.3. Hub Gene Analysis

To identify key regulators underlying caste differentiation, gene importance scores from the ACmix-Swin model were integrated with WGCNA network topology to construct a combined Hub-gene scoring framework (Combined score = DL_importance × GS × MM) (Supplemental Figure S2). The top 30 candidates per caste were selected based on Gradient × Input importance. Network visualization revealed four Hub-gene groups: overall hubs (blue circles), including the known queen marker Vg, and caste-specific hubs for drones (red squares), queens (green triangles), and workers (purple diamonds), each forming distinct subnetwork structures (Figure 9). Module–trait correlations further supported these patterns, with the pink module associated with drones, the salmon and yellow modules with queens, and the red module with workers, indicating cohesive module-level regulatory organization.

Overall, integrating deep learning with WGCNA improved Hub-gene prioritization and uncovered modular regulatory patterns central to caste differentiation. Four selected Hub genes were validated by qPCR, and their expression patterns were consistent with RNA-seq results, supporting the reliability of the screening strategy (Supplemental Figure S3a,b; Supplemental Table S3).

3.6.4. Model Performance Benchmarking

To evaluate model performance, ACmix-Swin was compared with nine commonly used machine learning and deep learning methods, including traditional models (KNN, SVM with RBF kernel, logistic regression, random forest), neural networks (MLP, 1D-CNN, LSTM), and a Transformer architecture. All methods were trained and tested on the same larval expression dataset using five-fold cross-validation.

ACmix-Swin achieved the highest accuracy (95.8%), outperforming all baselines. The best-performing baseline was KNN (82.2%), followed by Transformer (81.1%), and MLP, 1D-CNN, and RBF-SVM (each 80.0%). LSTM (78.9%), logistic regression (77.8%), and random forest (76.7%) showed lower performance. Compared with the strongest baseline (KNN), ACmix-Swin improved accuracy by 13.6 percentage points, corresponding to a 76.4% relative reduction in error rate. It also exceeded Transformer, CNN, and LSTM by 14.7–16.9 points, highlighting the advantage of the hybrid convolution–attention architecture.

Error-bar analysis showed higher variance (2–5%) across folds in baseline models, whereas ACmix-Swin displayed consistently stable performance, indicating superior generalization. Notably, standard deep learning models performed similarly to traditional methods in this task, whereas ACmix-Swin effectively captured both local and global expression patterns (Figure 10; Supplemental Figure S4 and Table S4).

3.6.5. Ablation Analysis of Model Components

The ablation study systematically assessed the contribution of each component in ACmix-Swin (Figure 11). The full model achieved the highest accuracy (95.8%), outperforming all ablated variants. Removing the convolution branch (86.7%) or the attention branch (81.1%) led to substantial performance drops, confirming the complementary roles of both modules. Modifying window size, embedding dimension, number of attention heads, or regularization strength also reduced accuracy to 80–84%, indicating that the current hyperparameters lie near an optimal configuration. Overall, the ablation results strongly support the necessity and effectiveness of the hybrid convolution–attention architecture (Figure 11; Supplemental Table S5).

4. Discussion

To characterize transcriptional differences in 4-day-old Apis mellifera ligustica larvae, we reanalyzed RNA-seq datasets from NCBI using a high-quality genome, and mapping statistics confirmed data reliability [47,48,49]. Machine-learning-based dimensionality reduction improved sample separation compared to previous studies (Figure 3). Differential expression revealed early divergence across pathways related to cuticle remodeling, hormone metabolism, sex determination, silk protein synthesis, and reproduction. For example, our differential expression analysis identified a total of 10,740 differentially expressed genes (DEGs), a number substantially higher than that reported in previous transcriptomic studies of honeybee caste differentiation, in which 475 and 687 DEGs were identified in 2-day-old and 4-day-old larvae, respectively [12]. This difference can be partly attributed to the use of an updated reference genome (Amel_HAv3.1 compared with OGSv3.2), as well as more stringent quality control and mapping strategies. More importantly, however, under conventional transcriptomic analysis frameworks, even when large numbers of DEGs are identified, interpretation is typically restricted to a small subset of genes with the largest log₂foldchanges or the most significant p-values. Although well-established caste-associated genes such as Vitellogenin (Vg) and Fibroin, which have been repeatedly reported in previous microarray- and RNA-seq–based studies, are consistently recovered, this screening strategy remains insufficient to resolve their relative importance within the global regulatory network. In particular, in the context of high-dimensional transcriptomic data, it remains challenging to determine which genes play dominant regulatory roles, which exhibit functional redundancy, and which minimal gene set constitutes the core discriminative signature distinguishing queen, worker, and drone phenotypes [23]. At the level of developmental timing, our results are highly consistent with those reported previously [12]. Multiple key signaling pathways and regulatory genes previously implicated in caste differentiation, including those involved in hormone biosynthesis, oocyte maturation, and Wnt/Notch signaling, were also identified as significantly differentially expressed in our analysis, further confirming the robustness and reproducibility of earlier findings. Building upon this established developmental framework, the present study extends traditional transcriptomic analyses by introducing a deep learning–based methodology. Unlike approaches that rely solely on statistical significance, the ACmix-Swin model directly couples gene expression patterns with caste classification tasks and prioritizes genes according to their contribution to classification performance. This enables genes to be systematically evaluated based on their discriminative importance for caste identity rather than expression magnitude alone, providing an interpretable ranking of key regulatory candidates and effectively complementing conventional differential expression and functional enrichment analyses [50,51]. Moreover, incorporating WGAN-GP improves robustness under small-sample conditions by learning and simulating high-dimensional expression distributions and expanding the effective training space. This strategy stabilizes feature attribution and facilitates the systematic identification of genes whose absence would substantially impair caste classification, highlighting them as candidate core regulators [52]. Therefore, our model-driven prioritization framework does not negate previous findings; rather, it provides quantitative evidence of central role for established candidates (Vg) while simultaneously nominating additional poorly annotated LOC genes as focused targets for downstream functional and network-level validation.

More importantly, many of the high-importance genes prioritized by the ACmix-Swin model map coherently onto the enriched biological pathways, improving accessibility for biologically oriented readers. For example, Vitellogenin (Vg), identified as a top-ranked hub gene, is centrally involved in reproductive metabolism, lipid transport, and endocrine regulation, consistent with the strong enrichment of steroid hormone biosynthesis and nutrient metabolism pathways in queens. Likewise, Or63, an annotated honeybee odorant receptor, was among the discriminative features and provides a direct molecular anchor to the enrichment of sensory/behavior-related pathways (chemosensory perception and neuroactive ligand–receptor interaction) [53]. Together, these concordant gene–pathway relationships enhance the biological interpretability of model-derived hub genes and support their functional relevance in caste differentiation [54].

Homology annotation indicated that several LOC genes (LOC102655429, LOC726185, LOC726094, LOC102655476, LOC726206) belong to cuticle protein families. Their strong upregulation in drones suggests early activation of exoskeleton construction and energy allocation programs (Figure 4a,b and Figure 5). GO and KEGG enrichment analyses further supported caste-specific developmental trajectories: drones were enriched for molting, cuticle formation and hormone pathways, while worker–queen differences involved extracellular matrix, neuroactive ligand–receptor interactions, and steroid/retinol metabolism (Figure 6a,b).

We applied the ACmix-Swin deep learning framework to prioritize caste-related regulators among 1874 DEGs. To address the high-dimensional, small-sample characteristics of RNA-seq data, we incorporated WGAN-GP augmentation, which has demonstrated superior stability and fidelity in transcriptomic tasks [30,52,55,56,57,58]. In our dataset, WGAN-GP enabled stable synthetic expression profiles (Figure 8), supporting improved classifier training. Benchmarking confirmed that ACmix-Swin achieved the highest accuracy and F1-score without overfitting (Figure 10 and Figure 11), consistent with best practices for RNA-seq classification [59,60,61].

Four high-importance genes were validated by qPCR, showing expression patterns consistent with RNA-seq (Supplemental Figures S2 and S3a,b), reinforcing the biological reliability of model predictions. This research strategy—combining model-driven feature prioritization with representative experimental validation—has been widely adopted in biomedical studies that integrate WGCNA with machine learning approaches and has been shown to be sufficient for assessing the biological reliability of model-derived feature selection. In such studies, investigators typically validate only a limited number of representative hub genes using qPCR or external datasets to evaluate whether the features identified by the model exhibit stable and reproducible biological signals, rather than performing exhaustive experimental validation of all candidate genes [62,63]. Similarly, methodological studies have emphasized that the primary purpose of qPCR validation is to support the statistical and biological robustness and reproducibility of feature selection strategies, rather than to achieve complete coverage of all candidates [64]. Therefore, this research paradigm itself provides adequate and reasonable experimental support for model-driven gene prioritization results.

Vitellogenin (Vg) emerged consistently as the most representative caste-biased gene. Its expression followed a queen > worker > drone pattern (Figure 4c), and queens exhibited the richest alternative splicing events (Supplementary Figure S5a,b; Table S3). Given Vg’s pleiotropic roles in reproduction, nutrient allocation, division of labor, lifespan, and immunity [65,66,67,68], it likely serves as a central regulatory hub in early caste differentiation.

Model-prioritized genes (LOC725841, LOC412768, LOC100576841) showed distinct caste-biased expression and may underlie queen energy metabolism, worker task specialization, and drone-specific reproductive or behavioral functions.

In summary, integrating differential expression, co-expression networks, and deep learning allowed us to define core molecular signatures of early caste divergence. The robust candidate genes identified here provide strong targets for future functional studies using RNAi or CRISPR.

5. Conclusions

In conclusion, this study integrates genome-guided transcriptome profiling with deep learning to dissect early caste differentiation in 4-day-old Apis mellifera larvae. By combining WGAN-GP augmentation, ACmix-Swin classification, differential expression, WGCNA, and splicing analyses, we consistently identified Vg and several caste-biased LOC genes as key regulatory candidates. The strong agreement between qPCR and RNA-seq results validates the reliability of our predictions. Overall, our findings show that sex- and caste-related transcriptional programs are established at an early larval stage and highlight the power of deep learning-based feature prioritization for interpreting high-dimensional insect transcriptomes.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes17010017/s1, Supplemental Table S1: Quality of data; Supplemental Table S2: Mapping ratio of data Differentially Expressed Genes Among Drone, Worker, and Queen Larvae Differentially Expressed Genes Among Drone, Worker, and Queen Larvae; Supplemental Table S3: qPCR result; Supplemental Table S4: Compared of Machine Learning; Supplemental Table S5: Ablation comparison; Supplemental Figure S1: ACmix Fusion Weights; Supplemental Figure S2: Top Hub Genes by Phenotype; Supplemental Figure S3: qPCR of hub gene. Supplemental Figure S4: Comparison with Baseline Methods; Supplemental Figure S5a,b: Sasimiplot of Vg and qPCR of Vg splicing.

Author Contributions

Conceptualization, P.G., J.L. and R.S.; Methodology, P.G. and R.S.; Software, P.G. and R.S.; Validation, R.S., J.L. and W.T.; Formal analysis, X.D.; Investigation (experiments), P.G., J.L., W.T. and X.D.; Resources, D.Y. and X.D.; Data curation, X.D., P.G. and J.L.; Writing—original draft preparation, P.G., J.L., R.S. and D.Y.; Writing—review and editing, P.G., J.L., W.T., X.D. and R.S.; Visualization, J.L. and X.D.; Supervision, X.D.; Project administration, X.D.; Funding acquisition, X.D., R.S. and D.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research is supported by the Yiwu City Science and Technology Bureau and Yiwu. Industrial and Commercial College Program under Grant Nos. XM2025DCZ0227 and XJKJ2502YB. This work was supported by Jinhua Key Laboratory of Robot Intelligent Welding Technology and by Jinhua Public Welfare Technology Application Research Project in 2024.

Institutional Review Board Statement

According to national and institutional regulations in China: Ethical approval is not required for studies involving non-protected invertebrate species, including honeybees.

Informed Consent Statement

Not applicable.

Data Availability Statement

Transcriptome data of 4-day-old Apis mellifera larvae were retrieved from the NCBI database, with six biological replicates for each caste (workers: SRS1263211; queens: SRS1263243; drones: SRS1263256).

Acknowledgments

We thank Qingdao Tongyuan Gene Technology Co., Ltd. for providing computational server support, and Anhui Gaohe Biotechnology for their experimental assistance. We also acknowledge the provision of the RNA extraction reagent (catalog no. R516-PRL, Beijing, China), which was essential for the completion of this study.

Conflicts of Interest

The authors declare no conflict of interest.

References

Luna-Lucena, D.; Rabico, F.; Simoes, Z.L.P. Reproductive capacity and castes in eusocial stingless bees (Hymenoptera: Apidae). Curr. Opin. Insect Sci. 2019, 31, 20–28. [Google Scholar] [CrossRef] [PubMed]
McKinnon, A.C.; Collins, L.; Wood, J.L.; Murphy, N.; Franks, A.E.; Steinbauer, M.J. Precision Monitoring of Honey Bee (Hymenoptera: Apidae) Activity and Pollen Diversity during Pollination to Evaluate Colony Health. Insects 2023, 14, 95. [Google Scholar] [CrossRef]
Zhang, Y.; Li, Z.; He, X.; Wang, Z.; Zeng, Z. H3K4me1 Modification Functions in Caste Differentiation in Honey Bees. Int. J. Mol. Sci. 2023, 24, 6217. [Google Scholar] [CrossRef]
Rangel, J.; Fisher, A. Factors affecting the reproductive health of honey bee (Apis mellifera) drones—A review. Apidologie 2019, 50, 759–778. [Google Scholar] [CrossRef]
Remolina, S.C.; Hughes, K.A. Evolution and mechanisms of long life and high fertility in queen honey bees. AGE 2008, 30, 177–185. [Google Scholar] [CrossRef]
Ding, X.; Li, J.; Yue, D.; Su, R. Integrating WGCNA, TCN, and Alternative Splicing to Map Early Caste Programs in Day-2 Honeybee Larvae. Genes 2025, 16, 1409. [Google Scholar] [CrossRef]
He, X.J.; Zhou, L.B.; Pan, Q.Z.; Barron, A.B.; Yan, W.Y.; Zeng, Z.J. Making a queen: An epigenetic analysis of the robustness of the honeybee (Apis mellifera) queen developmental pathway. Mol. Ecol. 2017, 26, 1598–1607. [Google Scholar] [CrossRef]
Chan, Q.W.T.; Howes, C.G.; Foster, L.J. Quantitative Comparison of Caste Differences in Honeybee Hemolymph * S. Mol. Cell. Proteom. 2006, 5, 2252–2262. [Google Scholar] [CrossRef]
Wolschin, F.; Amdam, G.V. Comparative proteomics reveal characteristics of life-history transitions in a social insect. Proteome Sci. 2007, 5, 10. [Google Scholar] [CrossRef] [PubMed]
Scharlaken, B.; de Graaf, D.C.; Goossens, K.; Peelman, L.J.; Jacobs, F.J. Differential gene expression in the honeybee head after a bacterial challenge. Dev. Comp. Immunol. 2008, 32, 883–889. [Google Scholar] [CrossRef] [PubMed]
Chan, Q.W.T.; Foster, L.J. Changes in protein expression during honey bee larval development. Genome Biol. 2008, 9, R156. [Google Scholar] [CrossRef]
He, X.-J.; Jiang, W.-J.; Zhou, M.; Barron, A.B.; Zeng, Z.-J. A comparison of honeybee (Apis mellifera) queen, worker and drone larvae by RNA-Seq. Insect Sci. 2019, 26, 499–509. [Google Scholar] [CrossRef]
Chen, X.; Hu, Y.; Zheng, H.; Cao, L.; Niu, D.; Yu, D.; Sun, Y.; Hu, S.; Hu, F. Transcriptome comparison between honey bee queen- and worker-destined larvae. Insect Biochem. Mol. Biol. 2012, 42, 665–673. [Google Scholar] [CrossRef]
Cameron, R.C.; Duncan, E.J.; Dearden, P.K. Biased gene expression in early honeybee larval development. BMC Genom. 2013, 14, 903. [Google Scholar] [CrossRef] [PubMed]
Wang, M.; Xiao, Y.; Li, Y.; Wang, X.; Qi, S.; Wang, Y.; Zhao, L.; Wang, K.; Peng, W.; Luo, G.-Z.; et al. RNA m6A Modification Functions in Larval Development and Caste Differentiation in Honeybee (Apis mellifera). Cell Rep. 2021, 34, 108580. [Google Scholar] [CrossRef]
Evans, J.D.; Wheeler, D.E. Differential gene expression between developing queens and workers in the honey bee, Apis mellifera. Proc. Natl. Acad. Sci. USA 1999, 96, 5575–5580. [Google Scholar] [CrossRef] [PubMed]
Vojvodic, S.; Johnson, B.R.; Harpur, B.A.; Kent, C.F.; Zayed, A.; Anderson, K.E.; Linksvayer, T.A. The transcriptomic and evolutionary signature of social interactions regulating honey bee caste development. Ecol. Evol. 2015, 5, 4795–4807. [Google Scholar] [CrossRef]
Berger, B.; Yu, Y.W. Navigating bottlenecks and trade-offs in genomic data analysis. Nat. Rev. Genet. 2023, 24, 235–250. [Google Scholar] [CrossRef] [PubMed]
Eraslan, G.; Avsec, Ž.; Gagneur, J.; Theis, F.J. Deep learning: New computational modelling techniques for genomics. Nat. Rev. Genet. 2019, 20, 389–403. [Google Scholar] [CrossRef]
Xu, C.; Jackson, S.A. Machine learning and complex biological data. Genome Biol. 2019, 20, 76. [Google Scholar] [CrossRef]
Porcu, E.; Sadler, M.C.; Lepik, K.; Auwerx, C.; Wood, A.R.; Weihs, A.; Sleiman, M.S.B.; Ribeiro, D.M.; Bandinelli, S.; Tanaka, T.; et al. Differentially expressed genes reflect disease-induced rather than disease-causing changes in the transcriptome. Nat. Commun. 2021, 12, 5647. [Google Scholar] [CrossRef]
Zhu, M.; Zhao, S. Candidate Gene Identification Approach: Progress and Challenges. Int. J. Biol. Sci. 2007, 3, 420–427. [Google Scholar] [CrossRef]
Abbas, M.; El-Manzalawy, Y. Machine learning based refined differential gene expression analysis of pediatric sepsis. BMC Med. Genom. 2020, 13, 122. [Google Scholar] [CrossRef]
Libbrecht, M.W.; Noble, W.S. Machine learning applications in genetics and genomics. Nat. Rev. Genet. 2015, 16, 321–332. [Google Scholar] [CrossRef]
Auslander, N.; Gussow, A.B.; Koonin, E.V. Incorporating Machine Learning into Established Bioinformatics Frameworks. Int. J. Mol. Sci. 2021, 22, 2903. [Google Scholar] [CrossRef] [PubMed]
Le, T.T.; Fu, W.; Moore, J.H. Scaling tree-based automated machine learning to biomedical big data with a feature set selector. Bioinformatics 2020, 36, 250–256. [Google Scholar] [CrossRef] [PubMed]
Chen, Y.; Li, Y.; Narayan, R.; Subramanian, A.; Xie, X. Gene expression inference with deep learning. Bioinformatics 2016, 32, 1832–1839. [Google Scholar] [CrossRef] [PubMed]
Koumakis, L. Deep learning models in genomics; are we there yet? Comput. Struct. Biotechnol. J. 2020, 18, 1466–1473. [Google Scholar] [CrossRef]
Lin, E.; Lin, C.-H.; Lane, H.-Y. Deep Learning with Neuroimaging and Genomics in Alzheimer’s Disease. Int. J. Mol. Sci. 2021, 22, 7911. [Google Scholar] [CrossRef]
Li, R.; Wu, J.; Li, G.; Liu, J.; Xuan, J.; Zhu, Q. Mdwgan-gp: Data augmentation for gene expression data based on multiple discriminator WGAN-GP. BMC Bioinform. 2023, 24, 427. [Google Scholar] [CrossRef]
Chaudhari, P.; Agrawal, H.; Kotecha, K. Data augmentation using MG-GAN for improved cancer classification on gene expression data. Soft Comput. 2020, 24, 11381–11391. [Google Scholar] [CrossRef]
Arjovsky, M.C.S.; Bottou, L. Wasserstein Generative Adversarial Networks. PMLR 2017, 70, 214–223. [Google Scholar]
Liu, Z.; Lin, Y.; Cao, Y.; Hu, H.; Wei, Y.; Zhang, Z.; Lin, S.; Guo, B. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10–17 October 2021; pp. 9992–10002. [Google Scholar]
Lecun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef]
Kim, D.; Langmead, B.; Salzberg, S. HISAT:A fast spliced aligner with low memory requirements. Nat. Methods 2015, 12, 357–360. [Google Scholar] [CrossRef]
Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R.; 1000 Genome Project Data Processing Subgroup. The Sequence Alignment/Map format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef]
Liao, Y.; Smyth, G.K.; Shi, W. featureCounts: An efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 2014, 30, 923–930. [Google Scholar] [CrossRef] [PubMed]
Haas, B.J.; Papanicolaou, A.; Yassour, M.; Grabherr, M.; Blood, P.D.; Bowden, J.; Couger, M.B.; Eccles, D.; Li, B.; Lieber, M.; et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc. 2013, 8, 1494–1512. [Google Scholar] [CrossRef] [PubMed]
Grabherr, M.G.; Haas, B.J.; Yassour, M.; Levin, J.Z.; Thompson, D.A.; Amit, I.; Adiconis, X.; Fan, L.; Raychowdhury, R.; Zeng, Q.D.; et al. Me: Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 2011, 29, 644–652. [Google Scholar] [CrossRef]
Cantalapiedra, C.P.; Hernández-Plaza, A.; Letunic, I.; Bork, P.; Huerta-Cepas, J. eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale. Mol. Biol. Evol. 2021, 38, 5825–5829. [Google Scholar] [CrossRef] [PubMed]
Huerta-Cepas, J.; Szklarczyk, D.; Heller, D.; Hernández-Plaza, A.; Forslund, S.K.; Cook, H.; Mende, D.R.; Letunic, I.; Rattei, T.; Jensen Lars, J.; et al. eggNOG 5.0: A hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res. 2019, 47, D309–D314. [Google Scholar] [CrossRef]
Trincado, J.L.; Entizne, J.C.; Hysenaj, G.; Singh, B.; Skalic, M.; Elliott, D.J.; Eyras, E. SUPPA2: Fast, accurate, and uncertainty-aware differential splicing analysis across multiple conditions. Genome Biol. 2018, 19, 40. [Google Scholar] [CrossRef]
Alamancos, G.P.; Pagès, A.; Trincado, J.L.; Bellora, N.; Eyras, E. Leveraging transcript quantification for fast computation of alternative splicing profiles. RNA 2015, 21, 1521–1531. [Google Scholar] [CrossRef]
Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet classification with deep convolutional neural networks. Commun. ACM 2017, 60, 84–90. [Google Scholar] [CrossRef]
Wang, Y.; Yue, X.; Lou, S.; Feng, P.; Cui, B.; Liu, Y. Gene Swin transformer: New deep learning method for colorectal cancer prognosis using transcriptomic data. Brief. Bioinform. 2025, 26, bbaf275. [Google Scholar] [CrossRef]
Pan, X.; Ge, C.; Lu, R.; Song, S.; Chen, G.; Huang, Z.; Huang, G. On the Integration of Self-Attention and Convolution. In Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 18–24 June 2022. [Google Scholar]
Brown, J.W.S.; Calixto, C.P.G.; Zhang, R. High-quality reference transcript datasets hold the key to transcript-specific RNA-sequencing analysis in plants. New Phytol. 2017, 213, 525–530. [Google Scholar] [CrossRef]
Park, S.; Lee, J.; Kim, J.; Kim, D.; Lee, J.H.; Pack, S.P.; Seo, M. Benchmark study for evaluating the quality of reference genomes and gene annotations in 114 species. Front. Vet. Sci. 2023, 10, 1128570. [Google Scholar] [CrossRef]
Ma, R.; Rangel, J.; Grozinger, C.M. Honey bee (Apis mellifera) larval pheromones may regulate gene expression related to foraging task specialization. BMC Genom. 2019, 20, 592. [Google Scholar] [CrossRef]
Jha, A.K.; Aicher, J.R.; Gazzara, M.; Singh, D.; Barash, Y. Enhanced Integrated Gradients: Improving interpretability of deep learning models using splicing codes as a case study. Genome Biol. 2020, 21, 149. [Google Scholar] [CrossRef] [PubMed]
van Hilten, A.; Katz, S.; Saccenti, E.; Niessen, W.J.; Roshchupkin, G.V. Designing interpretable deep learning applications for functional genomics: A quantitative analysis. Brief. Bioinform. 2024, 25, bbae449. [Google Scholar] [CrossRef] [PubMed]
Jiang, T.; Shen, C.; Ding, P.; Luo, L. Data augmentation based on the WGAN-GP with data block to enhance the prediction of genes associated with RNA methylation pathways. Sci. Rep. 2024, 14, 26321. [Google Scholar] [CrossRef]
Robertson, H.M.; Wanner, K.W. The chemoreceptor superfamily in the honey bee, Apis mellifera: Expansion of the odorant, but not gustatory, receptor family. Genome Res. 2006, 16, 1395–1403. [Google Scholar] [CrossRef]
Zhou, X.; Rokas, A.; Berger, S.L.; Liebig, J.; Ray, A.; Zwiebel, L.J. Chemoreceptor Evolution in Hymenoptera and Its Implications for the Evolution of Eusociality. Genome Biol. Evol. 2015, 7, 2407–2416. [Google Scholar] [CrossRef]
Ai, X.; Smith, M.C.; Feltus, F.A. Generative adversarial networks applied to gene expression analysis: An interdisciplinary perspective. Comput. Syst. Oncol. 2023, 3, e1050. [Google Scholar] [CrossRef]
Ning, Q.; Qi, Z. WGAN-GP_Glu: A semi-supervised model based on double generator-Wasserstein GAN with gradient penalty algorithm for glutarylation site identification. Comput. Biol. Med. 2025, 184, 109328. [Google Scholar] [CrossRef]
Huang, K.; Tian, J.; Sun, L.; Hu, H.; Huang, X.; Zhou, S.; Deng, A.; Zhou, Z.; Jiang, M.; Li, G.; et al. TransGeneSelector: Using a transformer approach to mine key genes from small transcriptomic datasets in plant responses to various environments. BMC Genom. 2025, 26, 259. [Google Scholar] [CrossRef]
Marouf, M.; Machart, P.; Bansal, V.; Kilian, C.; Magruder, D.S.; Krebs, C.F.; Bonn, S. Realistic in silico generation and augmentation of single-cell RNA-seq data using generative adversarial networks. Nat. Commun. 2020, 11, 166. [Google Scholar] [CrossRef]
Divate, M.; Tyagi, A.; Richard, D.J.; Prasad, P.A.; Gowda, H.; Nagaraj, S.H. Deep Learning-Based Pan-Cancer Classification Model Reveals Tissue-of-Origin Specific Gene Expression Signatures. Cancers 2022, 14, 1185. [Google Scholar] [CrossRef]
Moreno-Barea, F.; Jerez, J.; Franco, L. GAN-Based Data Augmentation for Prediction Improvement Using Gene Expression Data in Cancer. Int. Conf. Comput. Sci. 2022, 13352, 28–42. [Google Scholar]
Shen, Y.; Chu, Q.; Timko, M.P.; Fan, L. scDetect: A rank-based ensemble learning algorithm for cell type identification of single-cell RNA sequencing in cancer. Bioinformatics 2021, 37, 4115–4122. [Google Scholar] [CrossRef] [PubMed]
Nevone, A.; Lattarulo, F.; Russo, M.; Panno, G.; Milani, P.; Basset, M.; Avanzini, M.A.; Merlini, G.; Palladini, G.; Nuvolone, M. A Strategy for the Selection of RT-qPCR Reference Genes Based on Publicly Available Transcriptomic Datasets. Biomedicines 2023, 11, 1079. [Google Scholar] [CrossRef]
Zhang, Y.; Huang, L.-X.; Yue, Z.-H. Integrating WGCNA and SVM-RFE identifies hub molecular biomarkers driving ischemic stroke progression. Neurol. Res. 2025, 47, 626–636. [Google Scholar] [CrossRef]
He, X.; Yin, J.; Yu, M.; Wang, H.; Qiu, J.; Wang, A.; He, X.; Wu, X. Identification and Validation of Hub Genes for Predicting Treatment Targets and Immune Landscape in Rheumatoid Arthritis. BioMed Res. Int. 2022, 2022, 8023779. [Google Scholar] [CrossRef]
Amdam, G.V.; Simões, Z.L.P.; Hagen, A.; Norberg, K.; Schrøder, K.; Mikkelsen, Ø.; Kirkwood, T.B.L.; Omholt, S.W. Hormonal control of the yolk precursor vitellogenin regulates immune function and longevity in honeybees. Exp. Gerontol. 2004, 39, 767–773. [Google Scholar] [CrossRef] [PubMed]
Corona, M.; Velarde, R.A.; Remolina, S.; Moran-Lauter, A.; Wang, Y.; Hughes, K.A.; Robinson, G.E. Vitellogenin, juvenile hormone, insulin signaling, and queen honey bee longevity. Proc. Natl. Acad. Sci. USA 2007, 104, 7128–7133. [Google Scholar] [CrossRef] [PubMed]
Harwood, G.; Amdam, G. Vitellogenin in the honey bee midgut. Apidologie 2021, 52, 837–847. [Google Scholar] [CrossRef]
Nicewicz, Ł.; Nicewicz, A.W.; Nakonieczny, M. Vitellogenins Level as a Biomarker of the Honeybee Colony Strength in Urban and Rural Conditions. Insects 2025, 16, 25. [Google Scholar] [CrossRef]

Figure 1. Illustrates the complete WGAN-GP architecture, including the generator network (green), critic network (red), training process (blue), gradient penalty computation (yellow), and training hyperparameters (cyan).

Figure 2. Illustrates the detailed implementation of the ACmix-Swin1D hybrid architecture. The architecture comprises three main stages: Stage I performs feature projection, transforming the input (B, 7, 64) into Q, K, V projections; Stage II consists of two parallel branches—the attention path employs Swin window attention for global dependency modeling, while the convolution path uses depthwise separable convolution for local feature extraction; finally, features are fused through learnable weights α and β, outputting dimensions of (B, 7, 64). (Q+K for attention computation and V for the convolution path.).

Figure 3. PCA and hierarchical clustering reveal clear caste-specific transcriptomic separation among drone, queen, and worker larvae.

Figure 4. (a) Differentially expressed genes (DEGs) among drones, queens, and workers. (b) Volcano of differentially expressed genes among drone, worker, and queen larvae. (c) Gene Expression Correlation Analysis Among Bee Larval Castes via K-means Clustering (A–C) Pairwise comparison of cluster-level gene expression patterns between Drone, Queen, and Worker larvae at 4 days post-hatching. Each point represents a gene cluster (n = 200 clusters from 5000 high-variance genes). Correlation coefficients (R) and significance (p-value) are shown. Outlier genes (labeled) indicate caste-specific expression signatures. Diagonal dashed line (y = x) indicates equal expression.

Figure 5. Expression patterns of the top 30 differentially expressed genes (DEGs) across drones, workers, and queens.

Figure 6. KEGG and GO Analysis. (a) GO and KEGG enrichment analysis of differentially expressed genes among three castes of honeybee. (b) KEGG pathway enrichment and gene-pathway relationships revealed by Sankey diagram and bubble plot of three castes of honeybee.

Figure 7. Training dynamics and convergence evaluation of WGAN-GP for transcriptomic data augmentation. (a) Training Loss Dynamics. (b) Wasserstein Distance Over Training. (c) Real vs. Generated Sample Scores. (d) Gradient Penalty Over Epochs.

Figure 8. Training dynamics of the ACmix-Swin classifier. (a) Training and test loss curves. Both losses decreased rapidly during early epochs, with the lowest test loss (0.2191) reached at epoch 251. Test loss remained consistently below training loss, indicating effective regularization. (b) Training and test accuracy. Training accuracy stabilized at 97–99%, while test accuracy peaked at 95.8% and remained stable throughout training. (c) Overfitting assessment. The train–test accuracy gap stayed within 3–6%, far below conventional overfitting thresholds, demonstrating strong generalization. (d) Learning-rate schedule. A warm-up phase (epochs 0–10) was followed by cosine annealing decay, enabling stable convergence and refined optimization.

Figure 9. Integrated hub gene selection combining deep learning importance and WGCNA module-trait correlation analysis. (Edges represent gene co-expression strength (thicker lines indicate stronger connectivity), *: p < 0.05, **; p < 0.01, ***: p < 0.001).

Figure 10. Integrated Analysis of Gene Expression Patterns and Machine Learning-based Caste Classification (The proposed ACmix-Swin model achieves superior classification performance (95.8% accuracy) compared to traditional machine learning approaches.).

Figure 11. Ablation study demonstrating the contribution of each architectural component in the ACmix-Swin model.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Gong, P.; Li, J.; Tian, W.; Ding, X.; Su, R.; Yue, D. ACmix-Swin Deep Learning of 4-Day-Old Apis mellifera Larval Transcriptomes Reveals Early Caste-Biased Regulatory Hubs. Genes 2026, 17, 17. https://doi.org/10.3390/genes17010017

AMA Style

Gong P, Li J, Tian W, Ding X, Su R, Yue D. ACmix-Swin Deep Learning of 4-Day-Old Apis mellifera Larval Transcriptomes Reveals Early Caste-Biased Regulatory Hubs. Genes. 2026; 17(1):17. https://doi.org/10.3390/genes17010017

Chicago/Turabian Style

Gong, Peixun, Jinyou Li, Weixue Tian, Xiang Ding, Runlang Su, and Dan Yue. 2026. "ACmix-Swin Deep Learning of 4-Day-Old Apis mellifera Larval Transcriptomes Reveals Early Caste-Biased Regulatory Hubs" Genes 17, no. 1: 17. https://doi.org/10.3390/genes17010017

APA Style

Gong, P., Li, J., Tian, W., Ding, X., Su, R., & Yue, D. (2026). ACmix-Swin Deep Learning of 4-Day-Old Apis mellifera Larval Transcriptomes Reveals Early Caste-Biased Regulatory Hubs. Genes, 17(1), 17. https://doi.org/10.3390/genes17010017

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

ACmix-Swin Deep Learning of 4-Day-Old Apis mellifera Larval Transcriptomes Reveals Early Caste-Biased Regulatory Hubs

Abstract

1. Introduction

2. Materials and Methods

2.1. Sample Collection

2.2. Alignment and Quality Control of RNA-Seq Data

2.3. GO Enrichment Analysis of Differentially Expressed Genes and Alternatively Spliced Genes

2.4. Alternative Splicing Analysis

2.5. WGCNA Analysis

2.6. Model Construction

3. Results

3.1. RNA-Seq Overview and Quality Assessment

3.2. Principal Component Analysis (PCA) of Samples

3.3. Traditional Analysis of Differentially Expressed Genes (DEGs)

3.4. Differential Gene Heatmap (DEGs)

3.5. GO and KEGG Analysis of Differentially Expressed Genes

3.6. Deep Learning Analysis

3.6.1. WGAN-GP Analysis

3.6.2. Performance Evaluation of the ACmix-Swin Classifier

3.6.3. Hub Gene Analysis

3.6.4. Model Performance Benchmarking

3.6.5. Ablation Analysis of Model Components

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI