Genome-Wide Association Study for Weight-Related Traits in Scylla paramamosain Using Whole-Genome Resequencing

Lin Chen; Yaodong Zhang; Peitan Jia; Siyi Zhou; Qionghui Qin; Weiren Zhang; Kewei Huang; Xiaopeng Wang; Haihui Ye

doi:10.3390/ani15131829

,

and

State Key Laboratory of Mariculture Breeding, Fisheries College of Jimei University, Xiamen 361021, China

^*

Authors to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Animals2025, 15(13), 1829;https://doi.org/10.3390/ani15131829

This article belongs to the Section Animal Genetics and Genomics

Version Notes

Order Reprints

Simple Summary

In this study, we investigated the genetic basis of weight-related traits in the mud crab Scylla paramamosain through whole-genome resequencing of 323 individuals and subsequent genome-wide association studies (GWAS). We analyzed five weight traits: body weight, trunk weight, weight excluding chelae, cheliped weight, and appendage weight. Our results revealed that significant SNPs were primarily concentrated on chromosomes 15, 22, 25, and 36. We identified shared candidate genes for both body-related and appendage-related traits, as well as across all five traits. These candidate genes are clustered in functional categories related to growth, development, metabolism, and immunity. Key genes included CCHa1R (related to feeding), DCX-EMA (linked to movement), MSTO1, NVD, CYP307A1, FGF1, NF2, ANKRD52 (relevant to growth and development), and RGS10 (associated with immune responses). These findings improve our comprehension of mud crab growth and provide insights for sustainable breeding programs.

Abstract

Weight traits serve as key economic indicators for assessing growth performance and commercial quality in the mud crab Scylla paramamosain, yet the genetic basis of these traits remains poorly characterized. Here, we performed whole-genome resequencing on 323 individuals and conducted genome-wide association studies (GWAS) on five weight-related traits: (1) body-related traits, including body weight (BW), trunk weight (TruW), and weight excluding chelae (WEC); (2) appendage-related traits, containing appendage weight (AppW) and cheliped weight (CheW). Significantly associated SNPs were primarily enriched on chromosomes 15, 22, 25, and 36. For body-related traits, we identified 45 shared candidate SNPs and 175 common candidate genes; appendage-related traits revealed 71 shared candidate SNPs, and 229 common genes were identified; and across all five traits, there were 9 shared candidate SNPs and 49 common genes. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses indicated that shared functional terms/pathways among the five traits were mainly related to metabolism, development, and immunity. Body-related traits exhibited more unique GO terms and KEGG pathways associated with metabolism and immunity, whereas appendage-related traits showed some unique GO terms and KEGG pathways involved in development and morphogenesis. Among the candidate genes, we identified multiple genes associated with growth and development, metabolism, and immune responses. For example, the CCHa1R gene, common to carapace-related traits, is linked to feeding; the DCX-EMA gene, which is common to appendage-related traits, is connected to movement, and the MSTO1 gene is pertinent to muscle development. Among the candidate genes shared by all five traits, there are a series of genes concerning growth and development (such as NVD, CYP307A1, FGF1, NF2, ANKRD52) and immune responses (RGS10). These findings advance our understanding of the genetic architecture underlying decapod crustacean growth and provide valuable insights for optimizing sustainable breeding strategies in S. paramamosain.

Keywords:

weight-related traits; Scylla paramamosain; whole-genome resequencing; GWAS

1. Introduction

Scylla paramamosain, commonly known as the mud crab, belongs to the genus Scylla, family Portunidae, order Decapoda, and class Crustacea. This species is characterized by a robust carapace (exoskeleton) with serrated edges along the anterior margin. Scylla paramamosain typically inhabits intertidal zones with muddy or sandy substrates, where it feeds primarily on slow-moving or benthic organisms such as mollusks, small crabs, and worms []. It is widely distributed across the Indo-West Pacific region [] and is renowned for its large size, flavorful meat, and rich nutritional value []. This species plays a significant role in the marine economy of China and Southeast Asia, and holds an important position in aquaculture []. In 2023, China’s mud crab aquaculture production reached approximately 160,000 tons [], constituting over half of the global total production reported by the Food and Agriculture Organization of the United Nations (FAO) []. Despite this, the majority of mud crab seeds for aquaculture are still sourced from wild capture, which has led to overfishing concerns in many areas []. Furthermore, the lack of high-quality mud crab breeds complicates efforts to meet the increasing demand from the aquaculture sector. Consequently, it is imperative to strengthen genetic breeding efforts, investigate the genetic mechanisms underlying economically relevant traits, and apply this knowledge to breeding practices in S. paramamosain. Such initiatives are crucial for advancing the mud crab industry and ensuring the sustainable management of marine biological resources.

Growth traits are critical economic traits for aquatic species, as they directly affect aquaculture efficiency and market value [,]. Among these traits, body weight serves as a key representative of growth performance, making it a target for selection in breeding programs aimed at fast-growing and large body sizes. The genetic basis of body weight, being a complex quantitative trait influenced by multiple genes, presents challenges for genetic analysis. However, the rapid advancements in sequencing technologies now enable the detailed investigation of genetic variants and genes associated with weight traits at the whole-genome level in aquatic animals. Among the various phenotypic genetic analysis methods, conducting a genome-wide association study (GWAS) has proven to be a powerful, high-resolution tool for identifying genetic variants linked to complex weight-related traits in fish species, including Atlantic salmon (Salmo salar) [], rainbow trout (Oncorhynchus mykiss) [], olive flounder (Paralichthys olivaceus) [], European sea bass (Dicentrarchus labrax) [], spotted sea bass (Lateolabrax maculatus) [], and brown-spotted grouper (Epinephelus tauvina) []. In contrast, GWAS research in crustaceans began more recently [], but has progressed rapidly. For instance, GWAS in the Pacific white shrimp (Litopenaeus vannamei) have detected SNPs related to growth [] and sex determination [], while research in the oriental river prawn (Macrobrachium nipponense) has uncovered candidate genes linked to growth traits []. Additionally, GWAS in the Chinese mitten crab (Eriocheir sinensis) has revealed SNPs and genes associated with hypoxia tolerance [], and a study in freshwater crustaceans has explored the Alpha, alpha-trehalose-phosphate synthase (TPS) gene related to salinity tolerance []. Despite these advancements, genetic analysis of phenotypic traits in mud crabs has lagged, primarily due to the long-standing absence of a high-quality reference genome. However, the recent publication of high-quality reference genomes for S. paramamosain has provided a solid foundation for analyzing the genetic variations in traits from the whole-genome variation level [,]. For example, Zhang et al. released the highest-quality version of the S. paramamosain reference genome to date, and identifying a sex-linked region on chromosome 6 through resequencing data []. Similarly, Ye et al. performed a genome-wide association study on 100 individuals using 40K SNP microarray data from S. paramamosain and identified SNPs and genes related to traits such as body weight, body length, and body height []. Despite these efforts, to date, no GWASs have been conducted on weight-related traits in S. paramamosain using whole-genome resequencing data.

In this study, we performed whole-genome resequencing on 323 S. paramamosain individuals and subdivided weight traits into five specific traits: body weight (BW), trunk weight (TruW), weight excluding chelae (WEC), cheliped weight (CW), and appendage weight (AppW). We utilized GWAS technology to deeply mine genetic loci and candidate genes related to these traits. This study not only provides a new perspective for analyzing the genetic mechanisms of growth traits in crustaceans, but also lays a solid theoretical foundation for the precise breeding of high-yielding new breeds of S. paramamosain.

2. Materials and Methods

2.1. Sample and Phenotype Collection

In October 2023, 323 S. paramamosain individuals, including 178 males (55.11%) and 145 females (44.89%), were collected from Fuzhou, Fujian Province, China (Table S1). To minimize environmental variance, all juvenile crabs were cultivated in a single pond system under standardized protocols, maintaining identical initial size distributions and rearing conditions (180 ± 7 days growth period). Harvested crabs were transported to the laboratory at Jimei University within 12 h for dissection to obtain muscle tissue samples. Tissue samples were immediately frozen in liquid nitrogen and stored at −80 °C.

Weight-related traits were classified into five categories based on the body configuration of S. paramamosain (Figure 1A): (1) body weight (BW), the weight of the entire body; (2) weight excluding chelae (WEC), the body weight after removing the two large chelae; (3) trunk weight (TruW), the body weight minus the appendage attached to the cephalothorax; (4) cheliped weight (CheW), the combined weight of the two large chelae; and (5) appendage weight (AppW), the weight of all ten legs. The Wilcoxon rank-sum test was employed to assess the statistical significance of distributional differences in five traits across sexes, with results visualized using the R programming language.

Figure 1. Statistics of phenotypic and sequencing data. (A) Five weight-related traits of S. paramamosain. The abbreviations BW, WEC, TruW, AppW, and CheW correspond to body weight, weight excluding chelae, trunk weight, appendage weight, and cheliped weight, respectively. (B) Correlation among the five traits. *** indicates a p-value less than 0.001. (C) Density plot of SNP distribution across chromosomes based on sequencing data, generated using the R package CMplot (v4.5.1) []. (D) Distribution plot of the minor allele frequency (MAF) counts for SNPs. (E) Classification information of SNPs after annotation.

2.2. DNA Extraction, Sequencing, and Variant Calling

Genomic DNA was isolated using TaKaRa DNA extraction kits (Takara Biotechnology Co., Kusatsu, Japan) following the manufacturer’s protocol. The concentration and quality of the extracted DNA were assessed using a NanoDrop 2000 Spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA). DNA libraries with an average insert size of 350 bp were prepared following Illumina standard protocols and sequenced on an Illumina HiSeq X plus platform by a commercial service provider (Novogene, Beijing, China) to generate 150 bp paired-end reads.

Raw sequencing reads in FASTQ format were processed using fastp (v0.23.2) []. In this step, adapter sequences, reads containing ploy-N regions, and low-quality reads were removed to obtain high-quality clean reads. Additionally, Q20, Q30, GC content, and sequence duplication levels were calculated to assess data quality. The filtered reads were mapped to the reference genome [] using BWA (v0.7.12) []. The alignment results were sorted, and duplicate reads were marked using Samtools (v1.9) []. Variant calling, including single-nucleotide polymorphisms (SNPs) and insertions/deletions (InDels), was performed using the HaplotypeCaller module in GATK (v3.8) [] with the following filtering criteria: QD < 2.0 || MQ < 40.0 || FS > 60.0 || QUAL < −12.5 || ReadPosRankSum < −8.0-clusterSize 2-clusterWindowSize 5. SNPs annotation was performed on the basis of the reference genome using snpEff (v3.6c) []. Variants were categorized into intergenic regions, upstream or downstream regions, exons or introns. SNPs located in coding regions were further classified as synonymous or nonsynonymous.

For quality control, PLINK (v1.9) [] was employed to trim the data with minor allele frequency (MAF) < 0.05, call rates < 90%, and Hardy–Weinberg equilibrium (HWE) p-values < 0.000001. After filtering, a final set of 4,042,299 high-confidence SNPs from 323 individuals was retained for subsequent analysis.

2.3. Phenotypic Heritability and Correlations

Heritability (

h^{2}

) of traits was defined as the ratio of the additive genetic variance to phenotypic variance. The SNP-based heritability (h²) was calculated using HIBLUP software (v1.5.3) [] as follows:

h^{2} = \frac{σ_{a}^{2}}{σ_{a}^{2} + σ_{e}^{2}}

where

σ_{a}^{2}

represents the additive genetic variance and

σ_{e}^{2}

represents the residual variance. Furthermore, Pearson correlation coefficients were computed for pairs of traits to further investigate the correlations among the phenotypic characteristics themselves. Trait correlations were visualized using the “chart.Correlation” function in the PerformanceAnalytics (v2.0.4) [] package of the R programming language.

2.4. Population Structure Analysis

Principal component analysis (PCA) was conducted using PLINK (v1.9) [], and the results were visualized through ggplot2 (v3.5.1) [] package in R. A genomic relationship matrix (GRM) was computed utilizing GCTA (v1.94.3) [], and a heatmap was generated with the pheatmap (v1.0.12) [] package in R.

2.5. Genome-Wide Association Study

Genome-wide association studies (GWAS) were conducted to examine associations between genome-wide SNPs and individual weight-related traits using GEMMA (v0.98.5) []. The univariate linear mixed model (LMM) was applied as follows:

y = W α + x β + u + ε; u ~ {MVN}_{n} (0, λ τ^{- 1} K), ε ~ {MVN}_{n} (0, τ^{- 1} l_{n})

where y is the phenotype vector; W is the fixed effect matrix, including the top six eigenvectors of PCA, sex, and date of sampling;

α

is a c-vector of the corresponding coefficients including the intercept; x is the vector of SNP genotype;

β

indicates the effect size of the marker; u is a random effect;

ε

represents residuals; λ is the ratio between the two variance components; τ⁻¹ is the residual variance; K is the standardized correlation matrix estimated by GEMMA (v0.98.5) []; l_n is the unit matrix; MVN_n denotes the n-dimensional multivariate normal distribution; and n is the number of animals.

Considering the stringency of the Bonferroni correction threshold (p = 1.24 × 10⁻⁸; 0.05/4,042,299), this study adopted a more lenient threshold of p = 1.0 × 10⁻⁵, which commonly recommended for GWAS discoveries [,,]. The phenotypic variance explained (PVE) by each significant locus was estimated using the methodology of previously study []. To systematically evaluate the statistical power of our GWAS, we conducted power analysis using G*Power software (version 3.1.9.7) []. A two-tailed t-test was employed with the “Linear multiple regression: Fixed model, single regression coefficient” module, and PVE was used as the effect size metric [].

2.6. Gene Annotation and Functional Enrichment Analysis

According to previous research [,,], candidate genes were identified within 100 kb upstream and downstream of significant SNPs using the R package GALLO (v1.5) []. Protein sequences from the reference genome were annotated using EggNOG-mapper (v2.1.12) [], and a custom database was built with the R package AnnotationForge (v1.44.0) []. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses were performed using the R package clusterProfiler (v4.10.1) []. The enriched terms with the criteria of p < 0.01 were selected to further explore the genes involved in pathways and biological processes.

3. Results

3.1. Measurement of Phenotypic and Genomic Data

Phenotypic data for 323 individuals showed a normal distribution (Figure 1B and Figure S1). Specifically, the mean ± standard deviation (SD) for BW was 218.74 ± 74.37 g, TruW was 132.07 ± 45.35 g, WEC was 157.66 ± 53.28 g, AppW was 86.67 ± 38.59 g, and CheW was 61.08 ± 30.90 g (Table 1), respectively. Pearson’s correlation analysis revealed correlation coefficients greater than 0.86 among BW, TruW and WEC traits, and 0.87 between CheW and AppW (Figure 1B). Sex-stratified correlation analysis confirmed consistency with the combined-sex results: BW, TruW, and WEC showed strong correlations, while CheW and AppW also exhibited highly significant correlations (Figure S1). Based on their body parts and correlations, the traits were classified into two groups: (1) the BTW group (body-related traits, comprising BW, TruW, and WEC), and (2) the AC group (appendage-related traits, containing CheW and AppW). The results of the Wilcoxon rank-sum test revealed significant sexual dimorphism in four out of five traits (all p < 0.01), except for BW (p = 0.4). Specifically, females showed higher TruW (p = 5.5 × 10⁻⁵) and WEC (p = 0.0017), but lower AppW (p = 1.9 × 10⁻¹⁴) and CheW (p < 2 × 10⁻¹⁶) than males (Table 1 and Figure S2). These findings necessitate sex adjustment in subsequent GWAS analyses.

Table 1. Summary statistic of weight-related traits in Scylla paramamosain.

The dataset utilized for this analysis comprises 3.5 TB of clean data, achieving a Q30 score of 91.11%. The effective rate was 96.86%, average GC content was 42.39%, and average coverage depth of 9× (Table S1). GATK (v3.8) identified a total of 55,230,846 SNPs, of which 5,572,631 were shared across all individuals with complete data (no missing calls), and 49,718,299 SNPs remained after hard filtering. The analysis identified 41,766,627 SNPs with MAF < 0.05, representing 84% of the SNPs subjected to hard-filtering criteria. After quality control by PLINK (v1.9) [] and phase and imputation by Beagle (v5.5) [], a total of 4,042,299 SNPs were obtained, which were evenly distributed across all chromosomes without large gaps except chromosome 48 (Figure 1C), and the average distance between SNPs was 300 bp. These SNPs were used in subsequent analyses.

We constructed the site frequency spectrum of the high-quality SNPs (Figure 1D). This spectrum demonstrates an L-shaped distribution (Figure 1D), indicating that as the MAF increases, the corresponding number of sites decreases. After conducting annotation analysis on these high-quality SNPs across all 323 individuals, we identified that the four most prevalent types of SNPs are intron (42.48%), followed by intergenic (32.63%), upstream (14.64%), and downstream (4.46%).

3.2. Population Structure

PCA results indicated no significant genetic disparities among individuals, with PC1 and PC2 explaining 0.59% and 0.54% of the variance (Figure 2A), respectively. Consistent with the PCA results, the heatmap of the genetic relationship matrix revealed small genetic differences among individuals (Figure 2B), suggesting that there is no significant population stratification in the population.

Figure 2. Population genetic structure analysis. (A) Principal component analysis; red and blue dots represent female and male individuals, respectively; (B) heatmap of the genetic relationship matrix for the tested population.

3.3. GWAS Results

In this study, 4,042,299 SNPs from 323 S. paramamosain samples were retained for GWAS analysis. Given the conservative nature of the genome-wide significance threshold (1.24 × 10⁻⁸, calculated as 0.05 divided by 4,042,299) derived from strict application of Bonferroni correction in GWAS results, this study employed a relatively lenient significance threshold of p < 10⁻⁵, which has been widely accepted as reliable in numerous studies [,,].

The quantile–quantile (Q-Q) plot was employed to compare the distribution of observed log₁₀(p) values for whole-genome SNPs to the theoretical distribution of expected values. The genomic inflation factor (λ) of these traits ranged from 0.993 to 1.011 (Figure S1), indicating that the effects of population stratification in the GWAS analytical model were reasonably corrected. For the traits of BW, TruW, WEC, AppW, and CheW, 101, 104, 99, 109, and 167 significant SNPs were identified across 30, 34, 34, 38, and 44 chromosomes, respectively (Table S2, Figure 3). Chromosomes 25 and 36 emerged as hotspots for SNPs related to BW, TruW, and WEC, while chromosomes 15 and 22 were notable for AppW and CheW. The PVE for significant association loci (p < 10⁻⁵) for five weight-related traits ranged from 6.24% to 10.97%, with a mean value of 7.05% (Table S2). Statistical power analysis revealed that detecting variants with an effect size equivalent to the mean PVE (7.05%) would require approximately 400 samples to achieve 80% power (power = 0.8). Under the actual sample size (n = 323), the detection power for variants with PVE = 7.05% is estimated at 60%. Notably, variants with larger effect sizes (e.g., PVE > 8.8%) still achieve over 80% power under the current sample size, demonstrating reliable detection capability for genetic variants with moderate-to-large effects.

Figure 3. Manhattan plots from genome-wide association studies. (A) Body weight (BW); (B) trunk weight (TruW); (C) weight excluding chelae (WEC); (D) appendage weight (AppW); (E) cheliped weight (CheW). Red dashed lines indicate the significance threshold (p = 10⁻⁵). Chromosome 6 (highlighted in red) corresponds to the sex chromosome as previously reported [].

Following gene annotation, 334, 346, 300, 381, and 678 genes were annotated for BW, TruW, WEC, AppW, and CheW traits (Table S3), respectively. In the BTW group, 93 significant SNPs were detected by at least two of three traits (Figure 4A), with 45 significant SNPs and 175 candidate genes shared by all three traits (Figure 4B). Similarly, in the AC group, 71 significant SNPs were common to both traits (Figure 4A), annotating 229 shared candidate genes (Figure 4B). Remarkably, nine significant SNPs were shared by all five traits (Figure 4A), primarily located on chromosomes 2, 3, 15, 25, 26, 30, and 36, and these SNPs were annotated to 47 shared genes (Figure 4B, Table 2). In addition, two candidate genes were identified as being shared among the five phenotypic candidate regions, albeit without sharing any SNPs (Table 2). Among the nine candidate SNPs, seven are intergenic variants, and two are located within introns of the uncharacterized genes LOC135113241 and ANKRD52, respectively. LD analysis revealed that the three SNPs on chromosome 3 reside within a 143 bp LD block (r² > 0.8, Figure S5), whereas the other six inter-chromosomal SNPs exhibit low LD (r² < 0.1), indicating that these signals may reflect independent genetic effects (Table S4).

Figure 4. Venn diagram analysis of GWAS results. (A) Shared significant SNP loci across five traits. Numerical values within intersection areas denote the quantity of shared SNPs, with the central overlap indicating nine SNPs common to all five traits. (B) Shared candidate genes across five traits. Numerical values within intersection areas denote the quantity of shared genes, with the central overlap indicating 49 candidate genes common to all five traits. The full names corresponding to the five phenotypic abbreviations can be found in Table 1.

Table 2. Shared candidate SNPs and genes across all five traits.

3.4. Functional Annotation

GO function analysis and KEGG pathway analysis are pivotal in investigating gene function and elucidating biological processes. In this study, we conducted GO and KEGG analyses on all candidate genes associated with the five traits to explore their functional roles. The candidate genes were mainly enriched in terms and pathways related to metabolism, growth, and immunity.

For the traits of the BTW group, a total of 757 GO terms were detected, including 616 biological process (BP) terms, 53 cellular component (CC) terms, and 88 molecular function (MF) terms (Figure 5A, Table S5). For the AC group, 429 GO terms were detected, encompassing 340 BP terms, 45 CC terms, and 40 MF terms (Figure 5C, Table S5). There were 155 GO terms shared between the two groups (Figure 5E, Table S5), which primarily involved biological processes such as actomyosin development (e.g., GO:0031566 actomyosin contractile ring maturation; GO:0000916 actomyosin contractile ring contraction), germ cell division (e.g., GO:0007112 male meiosis cytokinesis; GO:0007111 meiosis II cytokinesis), lipid transport (e.g., GO:0015914 phospholipid transport; GO:0110112 regulation of lipid transporter activity), skeletal development (e.g., GO:0048706 embryonic skeletal system development; GO:0048705 skeletal system morphogenesis), organogenesis (e.g., GO:0048703 embryonic viscerocranium morphogenesis; GO:0048645 animal organ formation), immunity (e.g., GO:0035722 interleukin-12-mediated signaling pathway; GO:0032740 positive regulation of interleukin-17 production), optesthesia (GO:0046669 regulation of compound eye retinal cell-programmed cell death; GO:0061074 regulation of neural retina development), and olfaction (GO:0021553 olfactory nerve development). In terms of cellular components, the enriched terms mainly related to cellular energy metabolism and differentiation (e.g., GO:0016006 Nebenkern; GO:0005766 primary lysosome), immune response (e.g., GO:0042582 azurophil granule; GO:0035577 azurophil granule membrane; and GO:0042581 specific granule), etc. For molecular functions, they primarily concerned lipid and fatty acid binding (e.g., GO:0070540 stearic acid binding; GO:0005504 fatty acid binding) and phospholipid transport (e.g., GO:0005548 phospholipid transporter activity; GO:0120013 lipid transfer activity), etc.

Figure 5. Functional enrichment analysis of candidate genes. The top 10 significant GO terms for biological process (BP), cellular component (CC), and molecular and function (MF) categories are presented for common candidate genes in the (A) BTW group, (B) AC group, and (C) the genes common to both BTW and AC groups. Additionally, the top 20 significant KEGG pathways are displayed for shared candidate genes in the (D) BTW group, (E) AC group, and (F) the genes common to both BTW and AC groups.

A comparison of the GO analysis results between the BTW and AC groups revealed that the GO terms of the BTW group contained more terms related to metabolism and immunity in biological processes. In contrast, the AC group was enriched with numerous terms relevant to development and morphogenesis, particularly terms related to limb development, such as GO:0035292, specification of segmental identity, and trunk. In terms of cellular components, the BTW group was enriched with more terms related to immune response (e.g., GO:0001772 immunological synapse; GO:0031091 platelet alpha granule) and membrane-associated components (e.g., GO:0033116 endoplasmic reticulum–Golgi intermediate compartment membrane; GO:0005798 Golgi-associated vesicle). For molecular functions, the BTW group was enriched with more GO terms related to enzymatic activity (e.g., GO:0003997 acyl-CoA oxidase activity; GO:0016505 peptidase activator activity involved in apoptotic process), as well as receptor and ligand binding (e.g., GO:0043560 insulin receptor substrate binding; GO:0051379 epinephrine binding).

For the candidate genes shared by the BTW and AC groups, 38 and 30 KEGG pathways were detected (Figure 5B,D, Table S6), respectively, which were mainly related to immunity, metabolism, and development. The shared KEGG pathways between the two groups primarily involved metabolism and growth (e.g., ko00040: pentose and glucuronate interconversions; ko04390: Hippo signaling pathway) (Figure 5F). The BTW group-specific KEGG pathways mainly included immune response (ko04658: Th1 and Th2 cell differentiation; ko04660: T cell receptor signaling pathway), sex hormone signaling (ko04912: GnRH signaling pathway; ko04915: estrogen signaling pathway), fatty acid degradation and metabolism (ko00071: fatty acid degradation; ko01212: fatty acid metabolism), etc. In contrast, the AC group-specific KEGG pathways primarily comprised vitamin metabolism (ko00053: ascorbate and aldarate metabolism; ko00830: retinol metabolism; and ko00750: vitamin B6 metabolism).

3.5. Key Candidate Area

In this section, we focused on the common regions among three traits in the BTW group, the candidate regions shared by the two traits in the AC group, and the candidate regions that are ubiquitous across all five traits (Figure 6). Within the common regions of the BTW group, 11 candidate SNPs were concentrated in the chr25: 11,640,576–11,928,077 region, which harbors the appetite-related gene CCHa1R (neuropeptide CCHamide-1 receptor) (Figure 6A). Additionally, one candidate SNP was mapped to the 5,581,593–5,781,593 region of chromosome 36, which encompasses the gene YIPF1 (Yip1 domain-containing) (Figure 6B).

Figure 6. Genome-wide association analyses reveal pleiotropic loci for growth-related traits in mud crab. Manhattan plots highlighting significant genomic regions associated with phenotypic traits: (A) chromosome 25 (11.40–12.40 Mb), shared association with BW, TruW, and WEC; (B) chromosome 36 (5.60–5.80 Mb), shared association with BW, TruW, and WEC; (C) chromosome 18 (7.48–7.90 Mb), shared association with AppW and CheW; (D) chromosome 22 (8.70–9.44 Mb), shared association with AppW and CheW; (E) chromosome 3 (40.26–40.47 Mb), pleiotropic region influencing all five traits; (F) chromosome 36 (2.60–3.00 Mb), pleiotropic region influencing all five traits. Red dashed line indicates the significance threshold (p = 1 × 10⁻⁵). The full names corresponding to the five phenotypic abbreviations can be found in Table 1. Blue dashed lines demarcate the physical position interval in the Manhattan plot (upper panel), which directly corresponds to the physical position range displayed in the lower panel.

In the shared regions of the AC group, we identified four significant SNPs within the 5,490,605–6,763,411 region of chromosome 15, which contains the gene DCX-EMAP (Doublecortin domain-containing echinoderm–microtubule-associated protein). Furthermore, two significant SNPs were found in the 7,651,246–7,905,855 region of chromosome 18, which includes the MST01 (Misato mitochondrial distribution and morphology regulator) gene (Figure 6C). Additionally, eight significant SNPs were concentrated in the 8,575,029–9,499,114 region of chromosome 22, which harbors the immune-related gene BGBP2 (beta-glucan-binding protein 2) (Figure 6D).

In the common regions of BTW and AC groups, three SNPs were concentrated in the 40,271,752–40,471,757 region of chromosome 3, which includes the genes PITP (phosphatidylinositol transfer protein), NVD (cholesterol 7-desaturase nvd), CYP307A1 (cytochrome P450 307A1), and FGF1 (fibroblast growth factor 1) (Figure 6E). One SNP was located in the 2,632,612–2,832,612 region of chromosome 36, containing the gene ANKRD52 (serine/threonine protein phosphatase 6 regulatory ankyrin repeat subunit C) (Figure 6F). Additionally, there was one candidate SNP in each of the chr2: 37,715,876–37,815,876, chr15: 22,304,281–22,504,281, chr25: 17,254,201–17,454,201, chr26: 8,689,084–8,889,084, and chr30: 14,347,661–14,547,661 regions (Table 2). These QTL regions encompass the genes DLX (homeotic protein Distal-less), ERGIC1 (endoplasmic reticulum–Golgi intermediate compartment protein 1), NaCh (sodium channel protein), HPGD (5-hydroxyprostaglandin dehydrogenase [NAD(+)]), RGS10 (regulator of G protein signaling 10), and ZCRB1 (zinc finger CCHC-type and RNA-binding motif-containing protein 1).

4. Discussion

Weight traits are among the most economically significant attributes in aquaculture species, making them a consistent focus of genetic research. In crabs, these traits are not limited to total body weight but are also significantly influenced by the weights of their carapace, cheliped, and appendages—key contributors to the value of crab meat. Hence, identifying genetic loci and genes associated with these traits is essential for advancing selective breeding programs and improving economic traits in crabs. Although the draft genome and transcriptome resources for the mud crab S. paramamosain have been established [,], genome-wide association studies (GWAS) on its economic traits remain limited. Most previous studies have relied on candidate gene screening or linkage analysis [,,,], which offer a narrow perspective on the complex multigenic regulation of traits. Advances in whole-genome resequencing (WGR) technology now enable high-precision GWAS by generating dense genome-wide SNP markers, particularly valuable for species with complex genetic backgrounds. In this study, we used WGR data (approximately 9× coverage) from 323 S. paramamosain individuals and conducted GWAS to identify genetic variation loci and genes associated with five weight-related traits, providing insights into the genetic mechanisms underlying these traits.

Population genetic structure analyses revealed small genetic differences among individuals, which were supported by the genetic distance matrix and PCA. To correct for potential population stratification, PCA was included as a covariate in the GWAS model, which was validated by λ values indicating effective correction. Pearson’s correlation analysis revealed strong genetic correlations among BW, WEC, and TruW (correlation coefficients > 0.86), as well as between App and CheW (correlation coefficient = 0.87). Based on these correlations, we categorized the traits into two groups, BTW and AC, to explore their shared and unique genetic bases. Four traits (excluding BW) exhibited significant sex-specific variation (p < 0.01). However, given the current sample size constraints (178 males vs. 145 females), sex-stratified GWAS would lack sufficient statistical power to reliably detect genetic associations. We therefore systematically incorporated sex as a fixed-effect covariate within our LMM framework to mitigate confounding effects arising from sex-related biases. This methodological strategy has been established as a standard practice in studies of sexually dimorphic traits across diverse species, including humans [,,], livestock [,,,], and aquatic species [,,,]. Future investigations will expand the cohort to approximately 1000 individuals per sex to enable systematic dissection of sex-specific genetic regulatory mechanisms underlying phenotypic sexual dimorphism. A significance threshold of p < 10⁻⁵ was selected for GWAS, due to the excessive stringency of the Bonferroni correction. This threshold allowed us to identify SNPs and genes with both statistical significance and potential breeding value.

GO analysis revealed both mutual and special terms in the BTW and AC groups. Shared GO terms included those related to actin and skeletal development, highlighting their role in muscle growth and weight improvement. Immune-related GO terms were enriched in both groups, suggesting that immune processes play a critical role in regulating body weight and appendage weight. Metabolic pathways, such as fatty acid metabolism and energy balance, were also shared, reflecting the metabolic regulation of growth and appendage specialization. Interestingly, vision- and olfaction-related GO terms were identified, hinting at a connection between feeding behavior and weight traits. Distinct pathways were observed between the two groups, aligning with their phenotypic differences. The BTW group showed enrichment in metabolism and immunity-related GO terms, reflecting the heightened genetic regulation necessary for the growth and development of the main body. Conversely, the AC group exhibited enrichment in morphogenesis, organ development, and behavior-related terms, consistent with the functional and morphological roles of appendages. For instance, GO terms related to sex organ development in the AC group corroborate the sexual dimorphism observed in crab appendages, such as male copulatory organs and female reproductive structures.

KEGG analysis further emphasized shared pathways, including the Hippo signaling pathway, which is a crucial regulator of organ size, tissue growth, and regeneration [,]. Previous studies have linked the Hippo pathway to size regulation in Drosophila eyes [], wings [], and mouse liver [], underscoring its conserved role across species. Comparative genomic analysis of the giant isopod (Bathynomus jamesi) revealed significant expansions in growth-related pathways including the Hippo signaling pathway, which may underpin its enormous body size []. The presence of some expanded gene families enriched in the Hippo signaling pathway in the genome of the swimming crab (Portunus trituberculatus) may be related to salinity adaption and immune stress in this species []. Additionally, the phototransduction pathway, involved in vision, was also enriched in both groups, aligning with its importance in feeding and growth. Notably, the enrichment of the phototransduction pathway has also been observed in a previous transcriptomic differential analysis between fast-growing and slow-growing groups of the Pacific white shrimp (P. vannamei) []. Meanwhile, comparative analysis revealed distinct KEGG pathway profiles between two groups. The BTW group exhibited unique enrichment in metabolic and immune-related pathways (e.g., lipid metabolism, lysosome activity), potentially reflecting their critical roles in systemic energy regulation and body development. In contrast, the AC group demonstrated specialized vitamin metabolism pathways, particularly involving vitamins A–, C–, and B6–micronutrients with established roles in chitin remodeling and skeletal homeostasis, as supported by recent reviews on vitamin–bone interactions [,]. These differences suggest that during the developmental process of different body parts in mud crabs, distinct biological pathways and mechanisms may be required to regulate their growth and function.

Comparative analyses of candidate regions revealed significant enrichment of SNP in specific regions: chr25 and chr36 for the BTW group and chr22 and chr36 for the AC group. These shared chromosomal regions may contain pleiotropic loci or genes with shared genetic determinants that simultaneously influence multiple traits within each group, potentially reflecting overlapping biological pathways or regulatory mechanisms. In view of the current limitations in resolving gene functions in the mud crab, this study used a cross-species functional annotation strategy to infer biological functions of candidate genes in mud crab by using gene function data in model organisms and economic aquatic animals. Consistent with expectations, key genes related to growth, development and immunity were annotated in these regions. For instance, the chr25 region (BTW group) harbors the neuropeptide CCHamide-1 receptor gene, which regulates feeding and growth in arthropods [,]. For example, suppressing the expression of CCHa1 or CCHa1R through RNA interference technology leads to a significant reduction in feeding and impaired growth and development in the pea aphid Acyrthosiphon pisum []. On the other hand, the chr15 region common to the AC group contains the DCX-EMA gene, which has been shown to be involved in insect locomotion and mechanosensory transduction []. The chr18 region contains the MSTO1 gene, which is relevant to the regulation of mitochondrial distribution and morphology. Mutations in this gene can lead to clinical manifestations of mitochondrial dysfunction, including muscle weakness, short stature, and delayed motor development []. Furthermore, the chr22 region comprises the beta-glucan-binding protein 2 gene, which is one of the key categories of pathogen recognition receptors and plays an important role in the immune system of arthropods [].

Shared candidate genes across the groups are mainly related to growth, development and immunity. For example, the NVD gene, essential for ecdysteroid synthesis and growth []. Loss of NVD function results in arrested molting and growth during Drosophila development []. Additionally, 7-DHC, a product of NVD, is a precursor of vitamin D3, which impacts crustacean growth, molting, and the immune system []. The CYP307A1 gene belongs to the cytochrome P450 (CYP) superfamily of cytochromes, and the CYP family undergoes significant expansion through gene duplication (e.g., tandem duplication) in crustaceans [,]. CYP307A1 is another key gene in ecdysteroid synthesis [] that is potentially involved in the regulation of insect growth and development []. FGF1, an important growth factor gene, plays a pivotal role in various biological processes, including accelerating wound healing and promoting tissue regeneration [,].

Moreover, other shared regions also contain many functional genes. These include the Merlin gene encoding a key upstream regulatory factor in the Hippo signaling pathway. Loss of Merlin function leads to Hippo pathway dysregulation, affecting growth and development []. The DLX gene is crucial for development [] and can influence insect recognition of specific odors by regulating olfactory-related gene expression []. The HPGD gene participates in inflammatory response regulation by modulating prostaglandin levels []. RGS10, a central regulator of the G-protein signaling pathway, affects Th1/Th17-mediated immune responses by modulating STAT1/STAT3 phosphorylation in mammals, suggesting its potential role in innate immune regulation []. In crustaceans (e.g., Litopenaeus vannamei, Eriocheir sinensis, and Scylla paramamosain), the Toll, IMD, and JAK/STAT pathways are core immune regulatory networks [,,]. There is cross-synergy among these pathways: JAK/STAT activation by Toll/IMD pathways collectively regulates antimicrobial peptide synthesis [,,]. Although direct experimental evidence of RGS10 in crustaceans is lacking, its conserved function in STAT signaling (e.g., JAK/STAT-mediated antifungal/antiviral immunity [,]) implies that RGS10 may indirectly influence Toll/IMD signaling via JAK/STAT pathways. For instance, Toll/IMD pathways in decapods rely on NF-κB signaling [], while JAK/STAT interacts with Toll pathways through interferon-like regulation [], indicating RGS10’s potential regulatory role in such networks. Finally, the ANKRD52 gene, located in the region of chromosome 36, is associated with height and body mass index (BMI) [,]. The candidate genes shared across the five traits are primarily related to growth, development, and immunity. These genes may modulate holistic growth-related networks, thereby influencing weight-related traits in each body part. These findings not only deepen our understanding of modular growth in decapod crustaceans, but also support diverse breeding strategies for S. paramamosain by identifying both shared and unique phenotypic genes.

Weight-related traits represent classic complex traits governed by gene–environment interplay [,]. Although our study strictly controlled macro-environmental conditions through standardized rearing protocols (uniform larval size, pond systems, and feeding management), residual micro-environmental variations and individual physiological differences may still introduce unmeasured environmental noise into GWAS analyses. For instance, individual feeding behavior variations could affect weight traits through differential nutrient intake efficiency, while micro-fluctuations in temperature might interact with growth-related phenotypes via epigenetic modifications or metabolic pathway regulation. Future studies could adopt precision agriculture technologies, such as deploying real-time environmental monitoring sensors (e.g., water temperature/oxygen loggers) and individual behavioral tracking systems (e.g., RFID feeding monitors), to construct linear mixed models incorporating micro-environmental variables. This would enable the deeper dissection of genotype-by-environment interactions underlying complex traits.

The relatively low SNP-based heritability estimates (h² = 0.20–0.40) likely arise from multiple factors: unaccounted environmental noise, non-additive genetic effects not captured by additive SNP models, limited sample sizes, and phenotypic measurement errors [,]. Nevertheless, genome-wide significant loci (p < 1 × 10⁻⁵) show substantially higher PVE (6.24–10.97%; mean 7.05%) than the genome-wide background (mean PVE = 0.3%), indicating their substantial genetic contributions. While power analysis confirms that our sample size is modest, all significant loci exhibit PVE > 6.24%, demonstrating practical genetic significance []. Under the current sample size, the statistical power reaches 60% for variants with mean effect size (PVE = 7.05%), which remains practically valuable for candidate gene screening. For loci with greater effect sizes (PVE > 8.8%), the statistical power exceeds 80%, confirming the reliability of our detection capability for significant associations. It is also noteworthy that published GWASs in aquaculture genetics with smaller sample sizes (e.g., n < 400) have similarly gained scientific recognition [,,,]. This study provides foundational candidate genes for mud crab weight-related traits and critical data for genetic dissection of economically important traits. Future work should incorporate non-additive genetic models, expanded sample sizes, refined environmental covariates, and multi-temporal phenotypic assessments to better disentangle genetic effects from confounding factors and improve detection power.

GWAS-identified SNPs significantly associated with weight-related traits facilitate quantitative trait locus (QTL) mapping, enabling the prioritization of candidate or even causal genes underlying these five traits. Notably, these QTL regions may harbor variants within coding regions (e.g., non-synonymous mutations) or regulatory elements (e.g., promoters, enhancers), which modulate causal gene expression through cis-regulatory interactions, thereby providing direct mechanistic insights into phenotypic determination. Furthermore, in describing the functions of GWAS candidate genes, while cross-species gene function annotations offer plausible mechanistic hypotheses, we emphasize the necessity of direct functional validation in crustacean systems to substantiate these associations and determine causality. Moving forward, targeted experimental validation utilizing integrated multi-omics approaches (e.g., transcriptomics, epigenomics) and molecular biology techniques (e.g., CRISPR/Cas9 gene editing) will be essential to rigorously test the observed gene–phenotype linkages and elucidate their biological significance. Practically, validated SNPs may serve as molecular markers to facilitate early-stage selection in mud crabs, thereby expediting genetic improvement of growth traits through marker-assisted selection (MAS) strategies.

5. Conclusions

In this study, we performed genome-wide association analyses for five weight-related traits in the mud crab S. paramamosain using whole-genome resequencing data. The results unveiled shared and specific GO terms and KEGG pathways among the five traits, which highlighted their genetic similarities and unique characteristics. Furthermore, we pinpointed common genes associated with growth and immunity across all five traits, along with distinct candidate genes specific to the BTW and AC groups. These promising candidate genes provide a valuable theoretical foundation for the genetic enhancement of weight-related traits in S. paramamosain. Further research, including multi-omics studies and functional verification experiments, is imperative to validate and refine the outcomes of this study.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ani15131829/s1, Figure S1: Correlation among the five traits of female and male; Figure S2: Sex difference analyses for five traits; Figure S3: Quantile–quantile (QQ) plots of GWAS results; Figure S4: Required sample sizes for linear multiple regression (fixed model) testing a single regression coefficient; Figure S5: LD block of candidate SNPs on chromosome 3; Table S1: Information on whole genome resequencing data; Table S2: Significant SNPs from GWAS for five traits; Table S3: Candidate genes identified by GWAS for five traits; Table S4: LD analysis of nine shared candidate SNPs; Table S5: GO analysis for candidate genes; Table S6: KEGG pathway analysis for candidate genes.

Author Contributions

Conceptualization, X.W. and H.Y.; software, L.C., Y.Z. and X.W.; validation, L.C., P.J., S.Z., Q.Q., W.Z. and K.H.; formal analysis, L.C., Y.Z., P.J., S.Z., Q.Q., W.Z. and K.H.; investigation, L.C., Y.Z., P.J., S.Z., Q.Q., W.Z. and K.H.; resources, X.W. and H.Y.; data curation, X.W.; writing—original draft preparation, L.C. and Y.Z.; writing—review and editing, X.W. and H.Y.; visualization, L.C. and Y.Z.; supervision, X.W. and H.Y.; funding acquisition, X.W. and H.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (Grant No.: 42206132), the Natural Science Foundation of Fujian Province, China (Grant No.: 2023J05154) and Jimei University (No.: ZQ2022022).

Institutional Review Board Statement

All animal experiments in this study were approved by the Institutional Animal Care and Use Committee of the Fisheries College of Jimei University (approval code: 2021-04; approval date: 22 January 2021).

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available upon reasonable request from the corresponding author as they are currently being used in ongoing research projects.

Acknowledgments

The author extends heartfelt gratitude to the mud crab farming company and other personnel for their contributions in sample delivery and collection.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

References

Zhao, M.; Wang, W.; Zhang, F.; Ma, C.; Liu, Z.; Yang, M.H.; Chen, W.; Li, Q.; Cui, M.; Jiang, K.; et al. A chromosome-level genome of the mud crab (Scylla paramamosain estampador) provides insights into the evolution of chemical and light perception in this crustacean. Mol. Ecol. Resour. 2021, 21, 1299–1317. [Google Scholar] [CrossRef] [PubMed]
Keenan, C.; Davie, P.J.F.; Mann, D.L. A revision of the genus Scylla de Haan, 1833 (Crustacea: Decapoda: Brachyura: Portunidae). Raffles Bull. Zool. 1998, 46, 217–245. [Google Scholar]
Lovatelli, A.; Shelley, C.; Tobias-Quinitio, E.; Khor, W.; Chan, D. Status, technological innovations, and industry development needs of mud crab (Scylla spp.) aquaculture. In Proceedings of the FAO Expert Workshop, Singapore, 27–30 November 2023; FAO Fisheries and Aquaculture Proceedings, No. 73. FAO: Rome, Italy, 2023. [Google Scholar]
Li, Y.; Ai, C.; Liu, L. Mud crab, Scylla paramamosain China’s leading maricultured crab. In Aquaculture in China: Success Stories and Modern Trends; Wiley: Hoboken, NJ, USA, 2018; pp. 226–233. [Google Scholar]
FAO. FAO Yearbook of Fishery and Aquaculture Statistics. In Fishery and Aquaculture Statistics—Yearbook 2021; FAO: Rome, Italy, 2024. [Google Scholar] [CrossRef]
Syafaat, M.N.; Azra, M.N.; Waiho, K.; Fazhan, H.; Abol-Munafi, A.B.; Ishak, S.D.; Syahnon, M.; Ghazali, A.; Ma, H.; Ikhwanuddin, M. A review of the nursery culture of mud crabs, genus Scylla: Current progress and future directions. Animals 2021, 11, 2034. [Google Scholar] [CrossRef] [PubMed]
Reis Neto, R.V.; Yoshida, G.M.; Lhorente, J.P.; Yáñez, J.M. Genome-wide association analysis for body weight identifies candidate genes related to development and metabolism in rainbow trout (Oncorhynchus mykiss). Mol. Genet. Genom. 2019, 294, 563–571. [Google Scholar] [CrossRef]
Yu, Y.; Wan, S.M.; Zhang, S.M.; Liu, J.Q.; Sun, A.L.; Wang, Y.; Zhu, Y.F.; Gu, S.X.; Gao, Z.X. Identification of SNPs and candidate genes associated with growth using GWAS and transcriptome analysis in Coilia nasus. Aquaculture 2024, 586, 740777. [Google Scholar] [CrossRef]
Gutierrez, A.P.; Yáñez, J.M.; Fukui, S.; Swift, B.; Davidson, W.S. Genome-wide association study (GWAS) for growth rate and age at sexual maturation in Atlantic salmon (Salmo salar). PLoS ONE 2015, 10, e0119730. [Google Scholar] [CrossRef]
Leeds, T.D.; Vallejo, R.L.; Weber, G.M.; Gonzalez-Pena, D.; Silverstein, J.T. Response to five generations of selection for growth performance traits in rainbow trout (Oncorhynchus mykiss). Aquaculture 2016, 465, 341–351. [Google Scholar] [CrossRef]
Omeka, W.K.M.; Liyanage, D.S.; Lee, S.; Lim, C.; Yang, H.; Sandamalika, W.M.G.; Udayantha, H.M.V.; Kim, G.; Ganeshalingam, S.; Jeong, T. Genome-wide association study (GWAS) of growth traits in olive flounder (Paralichthys olivaceus). Aquaculture 2022, 555, 738257. [Google Scholar] [CrossRef]
Oikonomou, S.; Samaras, A.; Tekeoglou, M.; Loukovitis, D.; Dimitroglou, A.; Kottaras, L.; Papanna, K.; Papaharisis, L.; Tsigenopoulos, C.S.; Pavlidis, M. Genomic selection and genome-wide association analysis for stress response, disease resistance and body weight in European seabass. Animals 2022, 12, 277. [Google Scholar] [CrossRef]
Zhou, Z.; Shao, G.; Shen, Y.; He, F.; Tu, X.; Ji, J.; Ao, J.; Chen, X. Extreme-phenotype genome-wide association analysis for growth traits in spotted sea bass (Lateolabrax maculatus) using whole-genome resequencing. Animals 2024, 14, 2995. [Google Scholar] [CrossRef]
Yang, Y.; Wu, L.; Wu, X.; Li, B.; Huang, W.; Weng, Z.; Lin, Z.; Song, L.; Guo, Y.; Meng, Z. Identification of candidate growth-related SNPs and genes using GWAS in brown-marbled grouper (Epinephelus fuscoguttatus). Mar. Biotechnol. 2020, 22, 153–166. [Google Scholar] [CrossRef]
Houston, R.D.; Bean, T.P.; Macqueen, D.J.; Gundappa, M.K.; Jin, Y.H.; Jenkins, T.L.; Selly, S.L.C.; Martin, S.A.M.; Stevens, J.R.; Santos, E.M. Harnessing genomics to fast-track genetic improvement in aquaculture. Nat. Rev. Genet. 2020, 21, 389–409. [Google Scholar] [CrossRef] [PubMed]
Lyu, D.; Yu, Y.; Wang, Q.; Luo, Z.; Zhang, Q.; Zhang, X.; Xiang, J.; Li, F. Identification of growth-associated genes by genome-wide association study and their potential application in the breeding of Pacific white shrimp (Litopenaeus vannamei). Front. Genet. 2021, 12, 611570. [Google Scholar] [CrossRef] [PubMed]
Garcia, B.F.; Mastrochirico-Filho, V.A.; Gallardo-Hidalgo, J.; Campos-Montes, G.R.; Medrano-Mendoza, T.; Rivero-Martínez, P.V.; Caballero-Zamora, A.; Hashimoto, D.T.; Yáñez, J.M. A high-density linkage map and sex-determination loci in Pacific white shrimp (Litopenaeus vannamei). BMC Genom. 2024, 25, 565. [Google Scholar] [CrossRef]
Gao, Z.; Zhang, W.; Jiang, S.; Qiao, H.; Xiong, Y.; Jin, S.; Fu, H. Genome-wide association and transcriptomic analysis and the identification of growth-related genes in Macrobrachium nipponense. BMC Genom. 2024, 25, 1182. [Google Scholar] [CrossRef] [PubMed]
Yan, F.-Y.; Xu, Y.-F.; Feng, W.-R.; He, Q.-H.; Hua, G.-A.; Li, W.-J.; Xu, P.; Zhou, J.; Tang, Y.-K. Genomic analysis of hypoxia-tolerant population of the Chinese mitten crab (Eriocheir sinensis). Fish Shellfish Immunol. 2024, 154, 109931. [Google Scholar] [CrossRef]
Santos, J.L.; Nick, F.; Adhitama, N.; Fields, P.D.; Stillman, J.H.; Kato, Y.; Watanabe, H.; Ebert, D. Trehalose mediates salinity-stress tolerance in natural populations of a freshwater crustacean. Curr. Biol. 2024, 34, 4160–4169. [Google Scholar] [CrossRef]
Zhang, Y.; Yuan, Y.; Zhang, M.; Yu, X.; Qiu, B.; Wu, F.; Tocher, D.R.; Zhang, J.; Ye, S.; Cui, W. High-resolution chromosome-level genome of Scylla paramamosain provides molecular insights into adaptive evolution in crabs. BMC Biol. 2024, 22, 255. [Google Scholar] [CrossRef]
Ye, S.; Zhou, X.; Ouyang, M.; Cui, W.; Xiang, Z.; Zhang, Y.; Yuan, Y.; Ikhwanuddin, M.; Li, S.; Zheng, H. Development and validation of a 40 K liquid SNP array for the mud crab (Scylla paramamosain). Aquaculture 2025, 594, 741394. [Google Scholar] [CrossRef]
Yin, L.; Zhang, H.; Tang, Z.; Xu, J.; Yin, D.; Zhang, Z.; Yuan, X.; Zhu, M.; Zhao, S.; Li, X. rMVP: A memory-efficient, visualization-enhanced, and parallel-accelerated tool for genome-wide association study. Genom. Proteom. Bioinform. 2021, 19, 619–628. [Google Scholar] [CrossRef]
Chen, S. Ultrafast one-pass FASTQ data preprocessing, quality control, and deduplication using fastp. Imeta 2023, 2, e107. [Google Scholar] [CrossRef]
Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv 2013, arXiv:1303.3997. [Google Scholar]
Danecek, P.; Bonfield, J.K.; Liddle, J.; Marshall, J.; Ohan, V.; Pollard, M.O.; Whitwham, A.; Keane, T.; McCarthy, S.A.; Davies, R.M. Twelve years of SAMtools and BCFtools. Gigascience 2021, 10, giab008. [Google Scholar] [CrossRef] [PubMed]
McKenna, A.; Hanna, M.; Banks, E.; Sivachenko, A.; Cibulskis, K.; Kernytsky, A.; Garimella, K.; Altshuler, D.; Gabriel, S.; Daly, M. The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010, 20, 1297–1303. [Google Scholar] [CrossRef] [PubMed]
Cingolani, P.; Platts, A.; Wang, L.L.; Coon, M.; Nguyen, T.; Wang, L.; Land, S.J.; Lu, X.; Ruden, D.M. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 2012, 6, 80–92. [Google Scholar] [CrossRef]
Chang, C.C.; Chow, C.C.; Tellier, L.C.A.M.; Vattikuti, S.; Purcell, S.M.; Lee, J.J. Second-generation PLINK: Rising to the challenge of larger and richer datasets. Gigascience 2015, 4, s13742-015. [Google Scholar] [CrossRef]
Yin, L.; Zhang, H.; Tang, Z.; Yin, D.; Fu, Y.; Yuan, X.; Li, X.; Liu, X.; Zhao, S. HIBLUP: An integration of statistical models on the BLUP framework for efficient genetic evaluation using big genomic data. Nucleic Acids Res. 2023, 51, 3501–3512. [Google Scholar] [CrossRef]
Peterson, B.G.; Carl, P.; Boudt, K.; Bennett, R.; Ulrich, J.; Zivot, E.; Lestel, M.; Balkissoon, K.; Wuertz, D.; Christidis, A.A.; et al. PerformanceAnalytics: Econometric Tools for Performance and Risk Analysis. R Package Version 2.0.4. Available online: https://CRAN.R-project.org/package=PerformanceAnalytics (accessed on 16 June 2025).
Wickham, H.; Sievert, C. ggplot2: Elegant Graphics for Data Analysis; Springer: New York, NY, USA, 2016. [Google Scholar]
Yang, J.; Lee, S.H.; Goddard, M.E.; Visscher, P.M. GCTA: A tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 2011, 88, 76–82. [Google Scholar] [CrossRef]
Kolde, R. pheatmap: Pretty Heatmaps. R Package Version 1.0.12. Available online: https://CRAN.R-project.org/package=pheatmap (accessed on 16 June 2025).
Zheng, J.-S.; Lai, C.-Q.; Parnell, L.D.; Lee, Y.-C.; Shen, J.; Smith, C.E.; Casas-Agustench, P.; Richardson, K.; Li, D.; Noel, S.E. Genome-wide interaction of genotype by erythrocyte n-3 fatty acids contributes to phenotypic variance of diabetes-related traits. BMC Genom. 2014, 15, 781. [Google Scholar] [CrossRef]
Fang, L.; Wang, Q.; Hu, Y.; Jia, Y.; Chen, J.; Liu, B.; Zhang, Z.; Guan, X.; Chen, S.; Zhou, B. Genomic analyses in cotton identify signatures of selection and loci associated with fiber quality and yield traits. Nat. Genet. 2017, 49, 1089–1098. [Google Scholar] [CrossRef]
Naderi, E.; Crijns, A.P.G.; Steenbakkers, R.J.H.M.; van den Hoek, J.G.M.; Boezen, H.M.; Alizadeh, B.Z.; Langendijk, J.A. A two-stage genome-wide association study of radiation-induced acute toxicity in head and neck cancer. J. Transl. Med. 2021, 19, 481. [Google Scholar] [CrossRef]
Shim, H.; Chasman, D.I.; Smith, J.D.; Mora, S.; Ridker, P.M.; Nickerson, D.A.; Krauss, R.M.; Stephens, M. A multivariate genome-wide association analysis of 10 LDL subfractions, and their response to statin treatment, in 1868 Caucasians. PLoS ONE 2015, 10, e0120758. [Google Scholar] [CrossRef] [PubMed]
Faul, F.; Erdfelder, E.; Lang, A.-G.; Buchner, A. G* Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behav. Res. Methods 2007, 39, 175–191. [Google Scholar] [CrossRef] [PubMed]
Gatti, D.M.; Svenson, K.L.; Shabalin, A.; Wu, L.-Y.; Valdar, W.; Simecek, P.; Goodwin, N.; Cheng, R.; Pomp, D.; Palmer, A. Quantitative trait locus mapping methods for diversity outbred mice. G3 Genes Genomes Genet. 2014, 4, 1623–1633. [Google Scholar] [CrossRef] [PubMed]
Gao, J.; Wang, Y.; Liu, J.; Chen, F.; Guo, Y.; Ke, H.; Wang, X.; Luo, M.; Fu, S. Genome-wide association study reveals genomic loci of sex differentiation and gonadal development in Plectropomus leopardus. Front. Genet. 2023, 14, 1229242. [Google Scholar] [CrossRef]
Silva, E.F.P.; Gaia, R.C.; Mulim, H.A.; Pinto, L.F.B.; Iung, L.H.S.; Brito, L.F.; Pedrosa, V.B. Genome-wide association study of conformation traits in Brazilian holstein cattle. Animals 2024, 14, 2472. [Google Scholar] [CrossRef]
Fonseca, P.A.; Suárez-Vega, A.; Marras, G.; Cánovas, A. GALLO: An R package for genomic annotation and integration of multiple data sources in livestock for positional candidate loci. Gigascience 2020, 9, giaa149. [Google Scholar] [CrossRef]
Huerta-Cepas, J.; Szklarczyk, D.; Heller, D.; Hernández-Plaza, A.; Forslund, S.K.; Cook, H.; Mende, D.R.; Letunic, I.; Rattei, T.; Jensen, L.J. eggNOG 5.0: A hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res. 2019, 47, D309–D314. [Google Scholar] [CrossRef]
Carlson, M.; Pagès, H. AnnotationForge: Tools For Building SQLite-Based Annotation Data Packages. R Package Version 1.44.0. 2019. Available online: https://bioconductor.org/packages/AnnotationForge (accessed on 16 June 2025).
Wu, T.; Hu, E.; Xu, S.; Chen, M.; Guo, P.; Dai, Z.; Feng, T.; Zhou, L.; Tang, W.; Zhan, L.I. clusterProfiler 4.0: A universal enrichment tool for interpreting omics data. Innovation 2021, 2, 100141. [Google Scholar] [CrossRef]
Browning, B.L.; Tian, X.; Zhou, Y.; Browning, S.R. Fast two-stage phasing of large-scale sequence data. Am. J. Hum. Genet. 2021, 108, 1880–1890. [Google Scholar] [CrossRef]
Smith, J.L.; Wilson, M.L.; Nilson, S.M.; Rowan, T.N.; Schnabel, R.D.; Decker, J.E.; Seabury, C.M. Genome-wide association and genotype by environment interactions for growth traits in US Red Angus cattle. BMC Genom. 2022, 23, 517. [Google Scholar] [CrossRef] [PubMed]
Tan, W.; Tang, Y.; Liu, F.; Lu, L.; Liu, A.; Ye, H. Evaluation of the Effect of Adipokinetic Hormone/Corazonin-Related Peptide (ACP) on Ovarian Development in the Mud Crab, Scylla paramamosain. Animals 2024, 14, 3706. [Google Scholar] [CrossRef]
Zhang, Y.; Wu, Q.; Fang, S.; Li, S.; Zheng, H.; Zhang, Y.; Ikhwanuddin, M.; Ma, H. mRNA profile provides novel insights into stress adaptation in mud crab megalopa, Scylla paramamosain after salinity stress. BMC Genom. 2020, 21, 559. [Google Scholar] [CrossRef] [PubMed]
Waiho, K.; Shi, X.; Fazhan, H.; Li, S.; Zhang, Y.; Zheng, H.; Liu, W.; Fang, S.; Ikhwanuddin, M.; Ma, H. High-density genetic linkage maps provide novel insights into ZW/ZZ sex determination system and growth performance in mud crab (Scylla paramamosain). Front. Genet. 2019, 10, 298. [Google Scholar] [CrossRef] [PubMed]
Zhao, M.; Wang, W.; Chen, W.; Ma, C.; Zhang, F.; Jiang, K.; Liu, J.; Diao, L.; Qian, H.; Zhao, J. Genome survey, high-resolution genetic linkage map construction, growth-related quantitative trait locus (QTL) identification and gene location in Scylla paramamosain. Sci. Rep. 2019, 9, 2910. [Google Scholar] [CrossRef]
Yengo, L.; Vedantam, S.; Marouli, E.; Sidorenko, J.; Bartell, E.; Sakaue, S.; Graff, M.; Eliasen, A.U.; Jiang, Y.; Raghavan, S. A saturated map of common genetic variants associated with human height. Nature 2022, 610, 704–712. [Google Scholar] [CrossRef]
Jonsdottir, A.B.; Sveinbjornsson, G.; Thorolfsdottir, R.B.; Tamlander, M.; Tragante, V.; Olafsdottir, T.; Rognvaldsson, S.; Sigurdsson, A.; Eggertsson, H.P.; Aegisdottir, H.M. Missense variants in FRS3 affect body mass index in populations of diverse ancestries. Nat. Commun. 2025, 16, 2694. [Google Scholar] [CrossRef]
Zhang, X.; Brody, J.A.; Graff, M.; Highland, H.M.; Chami, N.; Xu, H.; Wang, Z.; Ferrier, K.R.; Chittoor, G.; Josyula, N.S. Whole genome sequencing analysis of body mass index identifies novel African ancestry-specific risk allele. Nat. Commun. 2025, 16, 3470. [Google Scholar] [CrossRef]
Wang, J.; Fan, T.; Du, Z.; Xu, L.; Chen, Y.; Zhang, L.; Gao, H.; Li, J.; Ma, Y.; Gao, X. Genome-wide association analysis identifies the PMEL gene affecting coat color and birth weight in Simmental × Holstein. Animals 2023, 13, 3821. [Google Scholar] [CrossRef]
Han, M.; Wang, X.; Du, H.; Cao, Y.; Zhao, Z.; Niu, S.; Bao, X.; Rong, Y.; Ao, X.; Guo, F. Genome-wide association study identifies candidate genes affecting body conformation traits of Zhongwei goat. BMC Genom. 2025, 26, 37. [Google Scholar] [CrossRef]
Tu, T.-C.; Lin, C.-J.; Liu, M.-C.; Hsu, Z.-T.; Chen, C.-F. Genomic prediction and genome-wide association study for growth-related traits in taiwan country chicken. Animals 2025, 15, 376. [Google Scholar] [CrossRef] [PubMed]
Zhang, W.; Wang, H.; Brandt, D.Y.C.; Hu, B.; Sheng, J.; Wang, M.; Luo, H.; Li, Y.; Guo, S.; Sheng, B. The genetic architecture of phenotypic diversity in the Betta fish (Betta splendens). Sci. Adv. 2022, 8, eabm4955. [Google Scholar] [CrossRef] [PubMed]
Zhong, Z.; Jiao, Z.; Yu, F.-X. The Hippo signaling pathway in development and regeneration. Cell Rep. 2024, 43, 113926. [Google Scholar] [CrossRef]
Pan, D. Hippo signaling in organ size control. Genes Dev. 2007, 21, 886–897. [Google Scholar] [CrossRef] [PubMed]
Wu, S.; Huang, J.; Dong, J.; Pan, D. hippo encodes a Ste-20 family protein kinase that restricts cell proliferation and promotes apoptosis in conjunction with salvador and warts. Cell 2003, 114, 445–456. [Google Scholar] [CrossRef]
Liu, A.; O’Connell, J.; Wall, F.; Carthew, R.W. Scaling between cell cycle duration and wing growth is regulated by Fat-Dachsous signaling in Drosophila. eLife 2024, 12, RP91572. [Google Scholar] [CrossRef]
Dong, J.; Feldmann, G.; Huang, J.; Wu, S.; Zhang, N.; Comerford, S.A.; Gayyed, M.F.; Anders, R.A.; Maitra, A.; Pan, D. Elucidation of a universal size-control mechanism in Drosophila and mammals. Cell 2007, 130, 1120–1133. [Google Scholar] [CrossRef]
Yuan, J.; Zhang, X.; Kou, Q.; Sun, Y.; Liu, C.; Li, S.; Yu, Y.; Zhang, C.; Jin, S.; Xiang, J. Genome of a giant isopod, Bathynomus jamesi, provides insights into body size evolution and adaptation to deep-sea environment. BMC Biol. 2022, 20, 113. [Google Scholar] [CrossRef]
Lv, J.; Li, R.; Su, Z.; Gao, B.; Ti, X.; Yan, D.; Liu, G.; Liu, P.; Wang, C.; Li, J. A chromosome-level genome of Portunus trituberculatus provides insights into its evolution, salinity adaptation and sex determination. Mol. Ecol. Resour. 2022, 22, 1606–1625. [Google Scholar] [CrossRef]
Huang, Y.; Wang, G.; Liu, J.; Zhang, L.; Huang, S.; Wang, Y.; Yang, Z.; Ge, H. Analysis of transcriptome difference between rapid-growing and slow-growing in Penaeus vannamei. Gene 2021, 787, 145642. [Google Scholar] [CrossRef]
Skalny, A.V.; Aschner, M.; Tsatsakis, A.; Rocha, J.B.T.; Santamaria, A.; Spandidos, D.A.; Martins, A.C.; Lu, R.; Korobeinikova, T.V.; Chen, W. Role of vitamins beyond vitamin D 3 in bone health and osteoporosis. Int. J. Mol. Med. 2024, 53, 9. [Google Scholar] [CrossRef] [PubMed]
Fratoni, V.; Brandi, M.L. B vitamins, homocysteine and bone health. Nutrients 2015, 7, 2176–2192. [Google Scholar] [CrossRef]
Tan, J.; Neupert, S.; Paluzzi, J.-P. Functional characterization of CCHamides and deorphanization of their receptors in the yellow fever mosquito, Aedes aegypti. Gen. Comp. Endocrinol. 2024, 359, 114618. [Google Scholar] [CrossRef] [PubMed]
Shahid, S.; Amir, M.B.; Ding, T.B.; Liu, T.X.; Smagghe, G.; Shi, Y. RNAi of Neuropeptide CCHamide-1 and Its Receptor Indicates Role in Feeding Behavior in the Pea Aphid, Acyrthosiphon pisum. Insects 2024, 15, 939. [Google Scholar] [CrossRef] [PubMed]
Song, X.; Cui, L.; Wu, M.; Wang, S.; Song, Y.; Liu, Z.; Xue, Z.; Chen, W.; Zhang, Y.; Li, H. DCX-EMAP is a core organizer for the ultrastructure of Drosophila mechanosensory organelles. J. Cell Biol. 2023, 222, e202209116. [Google Scholar] [CrossRef]
Iwama, K.; Takaori, T.; Fukushima, A.; Tohyama, J.; Ishiyama, A.; Ohba, C.; Mitsuhashi, S.; Miyatake, S.; Takata, A.; Miyake, N. Novel recessive mutations in MSTO1 cause cerebellar atrophy with pigmentary retinopathy. J. Hum. Genet. 2018, 63, 263–270. [Google Scholar] [CrossRef]
Vargas-Albores, F.; Yepiz-Plascencia, G. Beta glucan binding protein and its role in shrimp immune response. Aquaculture 2000, 191, 13–21. [Google Scholar] [CrossRef]
Yoshiyama, T.; Namiki, T.; Mita, K.; Kataoka, H.; Niwa, R. Neverland is an evolutionally conserved Rieske-domain protein that is essential for ecdysone synthesis and insect growth. Development 2006, 133, 2565–2574. [Google Scholar] [CrossRef]
Liu, S.; Wang, X.; Bu, X.; Zhang, C.; Qiao, F.; Qin, C.; Li, E.; Qin, J.G.; Chen, L. Influences of dietary vitamin D3 on growth, antioxidant capacity, immunity and molting of Chinese mitten crab (Eriocheir sinensis) larvae. J. Steroid Biochem. Mol. Biol. 2021, 210, 105862. [Google Scholar] [CrossRef]
Han, J.; Kim, D.-H.; Kim, H.-S.; Nelson, D.R.; Lee, J.-S. Genome-wide identification of 52 cytochrome P450 (CYP) genes in the copepod Tigriopus japonicus and their B [α] P-induced expression patterns. Comp. Biochem. Physiol. Part D Genom. Proteom. 2017, 23, 49–57. [Google Scholar] [CrossRef]
Baldwin, W.S.; Marko, P.B.; Nelson, D.R. The cytochrome P450 (CYP) gene superfamily in Daphnia pulex. BMC Genom. 2009, 10, 169. [Google Scholar] [CrossRef] [PubMed]
Jin, X.; Ma, L.; Zhang, F.; Zhang, L.; Yin, J.; Wang, W.; Zhao, M. Identification and Evolution Analysis of the Genes Involved in the 20-Hydroxyecdysone Metabolism in the Mud Crab, Scylla paramamosain: A Preliminary Study. Genes 2024, 15, 1586. [Google Scholar] [CrossRef] [PubMed]
Zhang, L.; Lu, Y.; Xiang, M.; Shang, Q.; Gao, X. The retardant effect of 2-tridecanone, mediated by cytochrome P450, on the development of cotton bollworm, Helicoverpa armigera. BMC Genom. 2016, 17, 954. [Google Scholar] [CrossRef] [PubMed]
Yu, H.; Mo, H.; Gao, J.; Yao, M.; Du, Y.; Liu, K.; Zhang, Q.; Yu, J.; Li, Y.; Wang, L. Fibroblast growth factor 1 (FGF1) improves glucose homeostasis, modulates gut microbial composition, and reduces inflammatory responses in rainbow trout (Oncorhynchus mykiss) fed a high-fat diet. Int. J. Biol. Macromol. 2024, 281, 136226. [Google Scholar] [CrossRef]
Hsu, C.C.; Wu, K.L.H.; Peng, J.M.; Wu, Y.N.; Chen, H.T.; Lee, M.S.; Cheng, J.H. Low-energy extracorporeal shockwave therapy improves locomotor functions, tissue regeneration, and modulating the inflammation induced FGF1 and FGF2 signaling to protect damaged tissue in spinal cord injury of rat model: An experimental animal study. Int. J. Surg. 2024, 110, 7563–7572. [Google Scholar] [CrossRef]
Sherbet, G.V. Hippo Signalling in Cell Proliferation, Migration and Angiogenesis. In Molecular Approach to Cancer Management; Academic Press: Cambridge, MA, USA, 2017; pp. 68–80. [Google Scholar]
Rubenstein, J.L.; Nord, A.S.; Ekker, M. DLX genes and proteins in mammalian forebrain development. Development 2024, 151, dev202684. [Google Scholar] [CrossRef]
Duan, S.G.; Liu, A.; Wang, C.; Zhang, R.L.; Lu, J.; Wang, M.Q. Homeotic Protein Distal-Less Regulates NlObp8 and NlCsp10 to Impact the Recognition of Linalool in the Brown Planthopper Nilaparvata lugens. J. Agric. Food Chem. 2023, 71, 10291–10303. [Google Scholar] [CrossRef]
Tai, H.H.; Cho, H.; Tong, M.; Ding, Y. NAD+-linked 15-hydroxyprostaglandin dehydrogenase: Structure and biological functions. Curr. Pharm. Des. 2006, 12, 955–962. [Google Scholar] [CrossRef]
Yang, Y.; Shao, Y.; Gao, X.; Hu, Z.; Wang, Y.; Ma, C.; Jin, G.; Zhu, F.; Dong, G.; Zhou, G. RGS10 deficiency alleviated intestinal mucosal inflammation through suppression of Th1/Th17 cell immune responses in ulcerative colitis. Immunology 2025, 174, 139–152. [Google Scholar] [CrossRef]
Liu, Y.; He, Y.; Cao, J.; Lu, H.; Zou, R.; Zuo, Z.; Li, R.; Zhang, Y.; Sun, J. Correlative analysis of transcriptome and proteome in Penaeus vannamei reveals key signaling pathways are involved in IFN-like antiviral regulation mediated by interferon regulatory factor (PvIRF). Int. J. Biol. Macromol. 2023, 253, 127138. [Google Scholar] [CrossRef]
Wang, Y.; Yang, L.-G.; Feng, G.-P.; Yao, Z.-L.; Li, S.-H.; Zhou, J.-F.; Fang, W.-H.; Chen, Y.-H.; Li, X.-C. PvML1 suppresses bacterial infection by recognizing LPS and regulating AMP expression in shrimp. Front. Immunol. 2022, 13, 1088862. [Google Scholar] [CrossRef] [PubMed]
Betancourt, J.L.; Rodríguez-Ramos, T.; Dixon, B. Pattern recognition receptors in Crustacea: Immunological roles under environmental stress. Front. Immunol. 2024, 15, 1474512. [Google Scholar] [CrossRef]
Wang, Y.; Liu, A.; Huang, Y.; Lu, L.; Guo, S.; Ye, H. Role of crustacean female sex hormone in regulating immune response in the mud crab, Scylla paramamosain. Fish Shellfish Immunol. 2023, 142, 109094. [Google Scholar] [CrossRef]
Limkul, S.; Phiwthong, T.; Massu, A.; Jaree, P.; Thawonsuwan, J.; Teaumroong, N.; Boonanuntanasarn, S.; Somboonwiwat, K.; Boonchuen, P. The interferon-like proteins, Vagos, in Fenneropenaeus merguiensis elicit antimicrobial responses against WSSV and VPAHPND infection. Fish Shellfish Immunol. 2022, 131, 718–728. [Google Scholar] [CrossRef] [PubMed]
Kichaev, G.; Bhatia, G.; Loh, P.-R.; Gazal, S.; Burch, K.; Freund, M.K.; Schoech, A.; Pasaniuc, B.; Price, A.L. Leveraging polygenic functional enrichment to improve GWAS power. Am. J. Hum. Genet. 2019, 104, 65–75. [Google Scholar] [CrossRef]
Zhu, Z.; Guo, Y.; Shi, H.; Liu, C.-L.; Panganiban, R.A.; Chung, W.; O’Connor, L.J.; Himes, B.E.; Gazal, S.; Hasegawa, K. Shared genetic and experimental links between obesity-related traits and asthma subtypes in UK Biobank. J. Allergy Clin. Immunol. 2020, 145, 537–549. [Google Scholar] [CrossRef] [PubMed]
Ni, G.; Van Der Werf, J.; Zhou, X.; Hyppönen, E.; Wray, N.R.; Lee, S.H. Genotype–covariate correlation and interaction disentangled by a whole-genome multivariate reaction norm model. Nat. Commun. 2019, 10, 2239. [Google Scholar] [CrossRef]
de Kinderen, M.A.J.; Sölkner, J.; Mészáros, G.; Alemu, S.W.; Esatu, W.; Bastiaansen, J.W.M.; Komen, H.; Dessie, T. Genotype by environment interactions (G* E) of chickens tested in Ethiopia using body weight as a performance trait. Animals 2023, 13, 3121. [Google Scholar] [CrossRef]
Gamazon, E.R.; Park, D.S. SNP-based heritability estimation: Measurement noise, population stratification, and stability. In The Genetic Architecture of Neuropsychiatric Traits: Mechanism, Polygenicity, and Genome Gunction; Gamazon, E.R., Ed.; Universiteit van Amsterdam: Amsterdam, The Netherlands, 2016; pp. 277–294. [Google Scholar]
Zhu, H.; Zhou, X. Statistical methods for SNP heritability estimation and partition: A review. Comput. Struct. Biotechnol. J. 2020, 18, 1557–1568. [Google Scholar] [CrossRef]
Wang, M.; Xu, S. Statistical power in genome-wide association studies and quantitative trait locus mapping. Heredity 2019, 123, 287–306. [Google Scholar] [CrossRef]

Figure 1. Statistics of phenotypic and sequencing data. (A) Five weight-related traits of S. paramamosain. The abbreviations BW, WEC, TruW, AppW, and CheW correspond to body weight, weight excluding chelae, trunk weight, appendage weight, and cheliped weight, respectively. (B) Correlation among the five traits. *** indicates a p-value less than 0.001. (C) Density plot of SNP distribution across chromosomes based on sequencing data, generated using the R package CMplot (v4.5.1) []. (D) Distribution plot of the minor allele frequency (MAF) counts for SNPs. (E) Classification information of SNPs after annotation.

Figure 2. Population genetic structure analysis. (A) Principal component analysis; red and blue dots represent female and male individuals, respectively; (B) heatmap of the genetic relationship matrix for the tested population.

Figure 3. Manhattan plots from genome-wide association studies. (A) Body weight (BW); (B) trunk weight (TruW); (C) weight excluding chelae (WEC); (D) appendage weight (AppW); (E) cheliped weight (CheW). Red dashed lines indicate the significance threshold (p = 10⁻⁵). Chromosome 6 (highlighted in red) corresponds to the sex chromosome as previously reported [].

Figure 4. Venn diagram analysis of GWAS results. (A) Shared significant SNP loci across five traits. Numerical values within intersection areas denote the quantity of shared SNPs, with the central overlap indicating nine SNPs common to all five traits. (B) Shared candidate genes across five traits. Numerical values within intersection areas denote the quantity of shared genes, with the central overlap indicating 49 candidate genes common to all five traits. The full names corresponding to the five phenotypic abbreviations can be found in Table 1.

Figure 5. Functional enrichment analysis of candidate genes. The top 10 significant GO terms for biological process (BP), cellular component (CC), and molecular and function (MF) categories are presented for common candidate genes in the (A) BTW group, (B) AC group, and (C) the genes common to both BTW and AC groups. Additionally, the top 20 significant KEGG pathways are displayed for shared candidate genes in the (D) BTW group, (E) AC group, and (F) the genes common to both BTW and AC groups.

Figure 6. Genome-wide association analyses reveal pleiotropic loci for growth-related traits in mud crab. Manhattan plots highlighting significant genomic regions associated with phenotypic traits: (A) chromosome 25 (11.40–12.40 Mb), shared association with BW, TruW, and WEC; (B) chromosome 36 (5.60–5.80 Mb), shared association with BW, TruW, and WEC; (C) chromosome 18 (7.48–7.90 Mb), shared association with AppW and CheW; (D) chromosome 22 (8.70–9.44 Mb), shared association with AppW and CheW; (E) chromosome 3 (40.26–40.47 Mb), pleiotropic region influencing all five traits; (F) chromosome 36 (2.60–3.00 Mb), pleiotropic region influencing all five traits. Red dashed line indicates the significance threshold (p = 1 × 10⁻⁵). The full names corresponding to the five phenotypic abbreviations can be found in Table 1. Blue dashed lines demarcate the physical position interval in the Manhattan plot (upper panel), which directly corresponds to the physical position range displayed in the lower panel.

Table 1. Summary statistic of weight-related traits in Scylla paramamosain.

Trait ^a	No ^b	Mean (±SD) ^c/g	Female: Mean (±SD) ^c/g	Male: Mean (±SD) ^c/g	CV (%) ^d	Female: CV (%) ^d	Male: CV (%) ^d	h^{2 e}
BW	320	218.74 ± 74.37	213.52 ± 74.31	219.49 ± 66.93	34.00	34.80	30.49	0.32
TruW	317	132.07 ± 45.35	141.93 ± 48.44	119.66 ± 30.44	34.34	34.12	25.43	0.20
WEC	319	157.66 ± 53.28	168.15 ± 58.42	145.79 ± 39.14	33.79	34.74	26.84	0.25
AppW	317	86.67 ± 38.59	69.24 ± 25.97	97.23 ± 35.08	44.53	37.51	36.08	0.38
CheW	313	61.08 ± 30.90	44.56 ± 15.75	69.54 ± 25.52	50.59	35.35	36.70	0.40

^a Five weight-related traits: BW, body weight; TruW, trunk weight; WEC, weight excluding chelae; AppW, appendage weight; and CheW, cheliped weight. ^b Number of animals used for GWAS, ^c mean (±standard deviation), ^d coefficient of variation, ^e heritability value.

Table 2. Shared candidate SNPs and genes across all five traits.

Chr	Nsnp	QTL Region	Ngene
2	1	37,665,876–37,865,876	4
3	3	40,271,752–40,471,757	9
15	1	22,304,281–22,504,281	15
21	0	14,509,234–14,885,809	1
25	1	17,254,201–17,454,201	1
26	1	8,689,084–8,889,084	6
30	1	14,347,661–14,547,661	9
34	0	15,078,189–15,522,120	1
36	1	2,632,612–2,832,612	3

Chr, chromosome; Nsnp, number of shared candidate SNPs; QTL, quantitative trait locus; Ngene, number of shared candidate genes.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Genome-Wide Association Study for Weight-Related Traits in Scylla paramamosain Using Whole-Genome Resequencing

Simple Summary

Abstract

1. Introduction

2. Materials and Methods

2.1. Sample and Phenotype Collection

2.2. DNA Extraction, Sequencing, and Variant Calling

2.3. Phenotypic Heritability and Correlations

2.4. Population Structure Analysis

2.5. Genome-Wide Association Study

2.6. Gene Annotation and Functional Enrichment Analysis

3. Results

3.1. Measurement of Phenotypic and Genomic Data

3.2. Population Structure

3.3. GWAS Results

3.4. Functional Annotation

3.5. Key Candidate Area

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics