Next Article in Journal
Differential Metabolomics and Cardiac Function in Trained vs. Untrained Yili Performance Horses
Previous Article in Journal
ECR-MobileNet: An Imbalanced Largemouth Bass Parameter Prediction Model with Adaptive Contrastive Regression and Dependency-Graph Pruning
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Complex Sex Determination in the Grey Mullet Mugil cephalus Suggested by Individual Whole Genome Sequence Data

by
Mbarsid Racaku
1,*,
Serena Ferraresso
1,
Massimiliano Babbucci
1,
Andres Blanco
2,
Costas S. Tsigenopoulos
3,
Tereza Manousaki
3,
Jelena Radojicic
3,
Vasileios Papadogiannis
3,
Paulino Martínez
2,
Luca Bargelloni
1 and
Tomaso Patarnello
1
1
Department of Comparative Biomedicine and Food Science, University of Padova, 35020 Legnaro, Padua, Italy
2
Department of Zoology, Genetics and Physical Anthropology, Faculty of Veterinary, Universidade de Santiago de Compostela, Campus Terra, 27002 Lugo, Spain
3
Institute of Marine Biology, Biotechnology and Aquaculture, Hellenic Centre for Marine Research, P.O. Box 2214, 71003 Heraklion, Crete, Greece
*
Author to whom correspondence should be addressed.
Animals 2025, 15(16), 2445; https://doi.org/10.3390/ani15162445
Submission received: 16 June 2025 / Revised: 13 August 2025 / Accepted: 19 August 2025 / Published: 20 August 2025

Simple Summary

Determining whether a fish will develop as male or female is a crucial issue for aquaculture, as it can significantly impact growth rates, reproduction, and production planning. For the flathead grey mullet (Mugil cephalus), a commercially valuable fish, the genetic factors that control sex determination are still unclear. In this study, we generated a new genome assembly and compared genomic data of male and female fish from two distinct Mediterranean populations (Aegean and Tyrrhenian Seas). We found that a previously known sex-related gene was relevant in the Tyrrhenian population but not the Aegean one, and we also identified other putative sex-associated variants which might play a role in sex determination. These results suggest that sex in the flathead grey mullet is influenced by several genetic factors and probably also by the environment. Our works provides new insight for the better understating and management of reproduction in this species.

Abstract

Mugil cephalus is a cosmopolitan marine fish highly relevant from ecological and economic perspectives. Previous studies identified sex-associated variants in the follicle-stimulating hormone receptor (fshr) gene following an XX/XY sex determination (SD) system. However, these variants could not be fully associated with sex in all samples. This suggests other genes and/or environmental factors may be involved in the SD of this species, denoting intraspecific variation. In this study, we constructed a new high-quality genome assembly of M. cephalus. We then re-sequenced the whole genome in males and females from two divergent Mediterranean populations to ascertain whether other genetic variants could also be involved in SD. fshr gene variants showed to only partially explain SD, while a new intronic variant in the sestd1 gene appeared to be associated with SD following a ZZ/ZW system. The presence of other putative candidate SD variants showing significant differences between the two populations suggested a regional pattern of variation in SD in the Mediterranean Sea. The incomplete association of all the identified variants also pointed to a potential role for environmental factors.

1. Introduction

The flathead grey mullet, Mugil cephalus, is a teleost belonging to the family Mugilidae, which comprises 25 genera and 76 species [1]. It is a cosmopolitan marine fish highly significant both from ecological and economic perspectives. M. cephalus has been a mainstay of traditional aquaculture for centuries, especially in the Mediterranean region. The traditional Italian delicacy “bottarga” is the primary factor contributing to the commercial relevance of M. cephalus. This is a product made from salted and dried female gonads that is considered a luxury food (the price is more than 200 EUR/kg) in numerous regions of the world [2]. Mullet roe is a costly product used to supplement fish flesh, particularly in Asian and Mediterranean markets [3].
The flathead grey mullet is a species without obvious sexual dimorphism, although females grow slightly faster than males and reach bigger sizes. Its aquaculture production increased from 25,600 tonnes in 1997 to 147,000 tonnes in 2003 [4], with growing interest also motivated by mullets being a low-trophic species. Furthermore, different studies consider M. cephalus an excellent sentinel for environmental health, particularly for hypoxia and heavy metal pollution [5]. High levels of heavy metal pollution have been found in the gonads of this species but also in fish organs where metals accumulate during crucial stages of embryonic development, negatively affecting embryogenesis [6].
Teleost fish exhibit remarkable diversity in physiology, behaviour, and developmental processes, including sex determination mechanisms, which show huge variation across the different taxonomic levels [7,8]. Unlike the heteromorphic and highly differentiated sex chromosomes of mammals and birds, the sex chromosome pair of most fishes is homomorphic and slightly differentiated [9]. Indeed, only around 5–10% of fish species was shown to possess heteromorphic sex chromosomes [10,11]. In fish, sex chromosomes have undergone quick evolutionary changes characterised by transitions between SD master genes (Master SD, MSD, genes) and different SD systems [12].
Sex determination in teleosts can range from strictly genetic sex determination (GSD) to environmental sex determination (ESD) mechanisms [13]. This plasticity may allow for rapid adaptation to changing environmental conditions, contributing to the evolutionary success of teleosts [14], as demonstrated by the nearly 30,000 species described for this taxonomic group accounting for half of all extant vertebrates and approximately 98% of all ray-finned fish species. While several genetic loci underlying SD have been reported, environmental factors such as temperature, photoperiod, social interaction, pH, and hypoxia also play a significant role in many species [15]. Within GSD systems, fish display diverse mechanisms including male heterogamety (XY system), female heterogamety (ZW system), and polygenic SD with or without multiple sex chromosomes [12,16].
The increasing information on SD mechanism in teleosts has demonstrated a rapid evolutionary turnover, divergence, and diversity, even between closely related species [17]. The genetic architecture of SD in fish often resembles a complex trait, involving major and minor genetic factors, environmental influence, and potential epistatic interactions between genes [12]. GSD can range from fully heteromorphic chromosomes to cryptic differences that are only detectable at the molecular level. The same SD genes have been recruited repeatedly along evolution, mostly belonging to specific developmental pathways. These include transcription factors at the top of the sex differentiation cascade, the transforming growth factor beta (TGF-β) pathway and, more recently, genes implicated in the steroidogenesis pathway. Up to now, more than 21 distinct SD genes have been found across 114 fish species spanning 18 orders, most of which are connected to these particular pathways [17].
Previous studies on M. cephalus identified several single-nucleotide polymorphisms (SNP) associated with sex in the follicle-stimulating hormone receptor (fshr) gene [18,19]. Subsequently, the fshr gene was also reported in Solea senegalensis as the MSD gene following an XX/XY SD system [20]. Ferraresso et al. [18] identified three fshr SNP variants associated with sex in exon 14 (Muce179, Muce206, and Muce322), the first two corresponding to missense mutations and the third one to a synonymous variant. Later, Curzon et al. [19] and Anderson et al. [21] reported fshr SNP variants in both exon 1 and 14; those in exon 14 (c.1732G>A and c.1759T>G) correspond to the Muce179 and Muce206 variants from Ferraresso et al. [18].
However, these two main variants were not completely associated with phenotypic sex in all samples. Ferraresso et al. [18] studied two Mediterranean regions, the Aegean and the Tyrrhenian, and revealed variable associations of fshr genotypes with sex, especially for males. In fact, in the Aegean region the association between phenotype and genotype at fshr in males was around 50% (heterozygous WT/mut1: wild type/alternative variant), while in the Tyrrhenian region, it was approximately 86%, very close to the 90% reported in an Israeli Mediterranean estuary by Curzon et al. [19]. These observations suggest that other genes and/or environmental factors may be involved in the SD of this species, denoting intraspecific SD variation. Intraspecific SD variation has been documented in different fish species, such as Northern pike (Esox lucius) or Atlantic silverside (Menidia menidia) [22].
In Ferraresso et al. [18], a pool-sequencing approach was used on a Tyrrhenian population sample to identify putative SD loci against a draft mullet genome. Nucleotide variants at the most significant locus, fshr, were tested using targeted amplicon sequencing on the Tyrrhenian individual samples and a second population from the Aegean Sea. In this study, we assembled the whole genome of grey mullet to provide a robust genomic framework to identify genetic variants putatively associated association with sex. A subset of samples from the two populations analysed in Ferraresso et al. [18] was re-sequenced at the whole genome level to assess whether other genetic loci in addition to fshr might be involved, thus providing a more comprehensive explanation for SD in the species.

2. Materials and Methods

2.1. Genome Assembly

2.1.1. DNA Extraction and Nanopore Sequencing

One juvenile specimen of flathead grey mullet (M. cephalus), captured in December 2020, was used for genome assembly. DNA extraction were performed using Miller salt extraction [23] with RNase digestion and chloroform steps added, followed by final elution in 5 mM TRIS/HCl pH 8.5 buffer. This DNA was further purified using QIAGEN Genomic-tip 20/G (Cat. No.: 10223, QIAGEN, Hilden, Germany), with elution in 55–60 µL TE. The purification process involved two rounds of pooling ten separate DNA extractions, resulting in approximately 40 µg of DNA for genome sequencing and assembly.
After genomic tip purification, the DNA was size-selected following centrifugation with PEG8000 and NaCl, which depletes DNA below 4–6 Kb. A total amount of 12.4 µg of HMW DNA was finally obtained as measured in a Qubit fluorometer and NanoDrop microvolume spectrophotometer.
Four LSK109 libraries were constructed using the Ligation sequencing kit (SQK-LSK109) and deployed across five 5 flow cells on the MinION and MinION Mk1C portable sequencers (Oxford Nanopore Technologies, Oxford, UK). The end-prep step was extended for 35 min, enhancing the proportion of longer reads rendering a coverage of 142×. MinION and the MinION Mk1C were operated in parallel and displayed comparable performance. Raw sequencing data from Illumina sequencing yielded an average mean depth ~48×.

2.1.2. Genome Assembly

For genome assembly, a combination of bioinformatic tools was employed to process and analyse both short and long-read sequencing data. Trimming for Illumina reads was performed with Trimmomatic v0.39 [24], and for MinION reads with Porechop v0.2.4 [25]. Quality control was performed with FastQC v0.11.8 [26] for Illumina data, and Nanoplot v1.33.0 [27] for MinION data, ensuring the integrity of input sequences.
The initial assembly was performed with long-read data using the Flye assembler v2.8.1 [28] with the parameter “--scaffold” for self-scaffolding, which is specifically designed to handle these data efficiently. To improve the accuracy of this initial assembly, we implemented two steps of the polishing process. First, we used Racon v1.4.3 [29] and Medaka v1.6.1 [30] to polish the long-read assembly, leveraging the length and coverage of these reads. Subsequently, further refinement of the assembly was performed with Pilon v1.24 [31], which incorporates the high accuracy of short-read data to polish errors.
To assess the quality of our final assembly, we employed QUAST v5.0.2 [32] for evaluating contiguity metrics and BUSCO 5.4.7 [33] for assessing completeness, providing a comprehensive evaluation of our genome assembly.
The mitochondrial DNA genome was retrieved from the assembly using GetOrganelle v1.7.6.1 [34] and its annotation was performed with MITOS v2.1.9 [35]. This genome was removed for further analysis.

2.1.3. Genome Annotation

Annotation was performed previously masking the genome following the user guide of FasTE [36]. First, a transposable element (TE) library was constructed using EDTA [37] for their annotation in the target genome. Using the --sensitive parameter, EDTA uses RepeatModeler v2.0.1 [38] to identify remaining TEs.
Subsequently, the EDTA library was classified with DeepTE.py [39], to be used as an input for RepeatMasker [38], which employs a library of TEs to filter repetitive DNA sequences, masking them. The library used for genome masking was a combination of the de novo TE library created from the genome of M. cephalus and FishTEDB [40], a fish-specific TE library.
De novo gene prediction was performed using the soft-masked genome with MAKER v3.01.03 [41] pipeline. First, an ab initio gene prediction was performed using SNAP [42], GeneMark-ES v4.70 [43], and Augustus [44]. Genome functional annotation was performed from BLAST+ v 2.9.0 [45] queries of extracted protein sequences against Unitprot [46] and the InterPro database [47]. The final GFF produced was analysed with AGAT v1.0.0 [48] to fix, check, and add missing information to create a complete, sorted, and standardised gff3 format.

2.1.4. Alignment of M. cephalus Genome Assemblies

M. cephalus genome assembly was mapped against the chromosome-level genome assembly of M. cephalus constructed from a Chennai Indian sample (CIBA_Mcephalus_1.1, GenBank GCA_022458985.1) [49] using minimap2 v2.25-r1173 [50]. For comparative analysis, 155 contigs and 26 scaffolds, representing ~91.2% of our total genome assembly, were homologous to the CIBA_Mcephalus_1.1 reference genome, indicating the high genomic correspondence between them (Table S1). Despite the high similarity between the two genomes, we decided to employ the genome assembly generated in the present study for SNP calling, considering the genetic differentiation that might exist between Indian and Mediterranean populations, as also documented in a recent work by Thieme and colleagues [51].

2.2. Whole Genome Sequencing (WGS) SNP Genotyping

2.2.1. Sample Collection and Genotyping

The M. cephalus samples came from two distinct Mediterranean areas, the Aegean and the Tyrrhenian (West Mediterranean) Seas (Figure 1). The Aegean samples were collected in Kavala (Northern Greece), whereas in the Tyrrhenian Sea, samples were collected at Cabras, Orbetello, and Tortolì (Italy).
The sampling, DNA extraction, and fshr genotyping of these samples were previously conducted [18]. From this dataset, we selected a subset of individuals for whole genome sequencing intending to explore other genomic regions putatively involved in SD that could explain the lack of full association of fshr with sex in the species.

2.2.2. Sample Sequencing

Genomic libraries for paired-end whole genome sequencing (WGS) were prepared according to Illumina DNA Prep Tagmentation (cat#20018704, San Diego, CA, USA) protocol. The quality of the libraries was assessed using Agilent 2100 Bioanalyzer (Palo Alto, CA, USA) and each DNA library was quantified using the Qubit® dsDNA HS Assay Kit (Invitrogen, Carlsbad, CA, USA). Sequencing was performed on the Novaseq6000 machine (Oslo, Norway) with an average mean depth ~28× per sample.
A total of 72 individuals were selected for WGS, namely 32 females and 40 males. The criterion for sample selection from Ferraresso et al. [18] was to include individuals with expected and unexpected genotypes according to the fshr information to identify other putative loci underlying SD in M. cephalus. Thus, for both geographic regions, 16 females were selected, with 12 of them showing the expected WT/WT fshr genotype, but 4 showing WT/mut1. Additionally, of the 20 males from each population, 11 were had the WT/mut1 genotype and 9 had the unexpected WT/WT (Table S2).

2.2.3. Raw Reads Processing and SNP Calling

Raw sequencing data from Illumina sequencing were checked for quality using FastQC software v0.11.5. Then, raw reads were trimmed with Trim Galore! v0.6.10 [52] to remove adaptors and low-quality bases; default parameters were applied except for length (set to 70) and q (25), and the option “nextera” was applied to remove adaptors.
The trimmed reads were mapped against our reference genome using the BWA version 0.7.17-r1188 [53]. To ensure that subsequent analyses with PicardTools [54] and variant calling were compatible, the -c 1 and -M parameters were used. Every SAM file produced from BWA was converted and sorted into a BAM file using SAMtools [55]. Finally, we identified and removed any duplicate reads with PicardTools v3.0.0 using default mode.
The reference genome and the final read alignment were used as input for the variant calling BCFtools v1.17 [55] using mpileup, call, and filter commands. To avoid keeping alignments with low mapQ, we set -q to 20 for the mpileup command and the -m parameter for call. Filtering was performed with BCFtools to remove indels setting -g to 5. Finally, VCFtools v0.1.16 [56] was used to retain only bi-allelic sites with minor allele frequency (MAF) > 0.15 and to remove sites that contained any indel or missing data. The same workflow for mapping and SNP calling was performed using the CIBA_Mcephalus_1.1 genome [49].
The VCF data contained genetic information from 1275 contigs and 29 scaffolds. To refine the analysis, the CIBA_Mcephalus_1.1 genome was used, selecting the 155 contigs and 26 scaffolds outlined before. Subsequently, the main VCF file was reorganised by grouping these selected contigs into the 23 reconstructed chromosome VCF file. This approach allowed to anchor the genetic data to the chromosomes of the species, aiding meaningful genomic analyses.

2.3. Sex Determination (SD) in M. cephalus

2.3.1. Genetic Differentiation

Since no significant genetic differentiation was detected among the three Tyrrhenian locations studied (see Section 3), they were grouped in a single population. Genetic differentiation between the two main geographical areas studied (Tyrrhenian and Aegean) was performed as a necessary reference for further analyses on intraspecific SD variation in M. cephalus. The relative component of genetic differentiation (FST) was estimated with Genepop 4.7 [57]. A discriminant analysis of principal components (DAPC) was also performed for an integral visualisation of genetic differentiation using the two most informative components [58]. Based on this information, the Tyrrhenian and Aegean samples were analysed as two differentiated regions (see Section 3). Then, the analysis with VCFtools v0.1.16 was repeated starting from raw data to avoid losing any region-specific genetic variants.

2.3.2. Identification of the SD Regions

Each reconstructed chromosome was screened to estimate the relative component of genetic differentiation between males and females (FST) and the intrapopulation (sex) fixation index (FIS) [59] using adegenet packages v2.1.10 [60] from each vcf chromosome file. FST and FIS were explored to identify genomic regions associated with sex, consistent with a ZZ/ZW or XX/XY SD system (FST = 0.5; FIS = −1), reflecting allele fixation in the homogametic sex and full heterozygosity in the heterogametic sex. Candidate SNPs with associations were also selected based on their associated p-values (p < 0.005), calculated with Genepop 4.7 [57].

3. Results

3.1. Genome Sequencing

The Oxford Nanopore Technologies (ONT) MinION sequencing of the juvenile M. cephalus used for genome assembly produced 13,613,901 raw reads, amounting to a total of 100,081,010,194 bp with an N50 of 13,358 bp. After trimming, the MinION data resulted in a slightly increased read count of 13,626,708, which can be attributed to the computational splitting of reads containing internal adapters, thus comprehending 99,527,883,358 bp and very similar N50. All data suggested high-quality sequencing with minimal adapter contamination or low-quality regions. Illumina 150 bp pair-end (PE) sequencing of the same individual generated 126,368,599 raw reads, 109,184,810 of them being retained after trimming.

3.2. Genome Assembly and Annotation

The final assembly consisted of 1275 contigs and 29 scaffolds, a total length of 658,521,108 bp, and a contig N50 of 8.38 Mb with a maximum contig length of 30.5 Mb. The completeness results from BUSCO, using the Actinopterygii database, yielded 98.2%, with 97.6% single-copy and 0.6% duplicates. The identification of transposable element (TE) resulted in 31.9% of the total ONT-generated genome (~211 Mb) (Table S3) within the range reported for other teleost species (1.6% to 37.1%) [61].
After masking TEs in the genome assembly, a total of 26,536 protein-coding genes were predicted from the MAKER pipeline. The BUSCO completeness of gene annotation using the same Actinopterygii database was 92.1%, with 90.7% single-copy and 1.4% duplicates.
The mitochondrial genome was obtained as a single contig encompassing 16,706 bp and including 37 annotated coding genes (Figure S1).

3.3. SNP Calling

The average number of 150 bp PE Illumina reads per sample was 74.2 million, with 72.7 million being retained after trimming. The raw read count per sample ranged from a minimum of 56.7 million (54.7 million after trimming) to a maximum of 104.6 million (102.2 million after trimming) (Table S4). The percentage of mapped reads in the reference genome was 99%. BCFtools identified a total of 2,835,032 SNPs after filtering (MAF > 0.15) from the total number of aligned reads from the 72 samples against our genome assembly.

3.4. Comparison CIBA_Mcephalus_1.1

The alignment between the CIBA_Mcephalus_1.1 genome against our genome assembly rendered 99.8% sequence homology. Although the BUSCO metrics of our assembly were similar to CIBA_Mcephalus_1.1, the contiguity was significantly lower. By selecting only the 155 contigs and 26 scaffolds that matched the CIBA_Mcephalus_1.1, the contiguity of our genome assembly was improved (Table S5). However, a portion of some contigs were found to be present on both chromosome 2 and the very small chromosome 24. We decided to maintain those contigs on the bigger chromosome 2.
We compared the number of SNPs identified in our 72 individuals using the two reference genomes. This analysis revealed 4,813,525 SNPs with the CIBA_Mcephalus_1.1 genome, significantly higher than the 2,835,021 SNPs identified when using present study’s genome assembly as the reference. The increase in the number of SNPs detected (58.9%) might be explained by the high geographical distance between the populations to which the individuals used for assembly belong, and also by the higher completeness of the CIBA_Mcephalus_1.1 genome. However, considering the genetic divergence between the Mediterranean Sea and the Indian Ocean samples, we opted to use our assembly as a reference for SNP calling and genotyping.

3.5. Genetic Structure

Genetic differentiation between Tyrrhenian samples was low and not significant (p > 0.05), thus they were subsequently treated as a single population (Figure S2). Considering the low but significant genetic differentiation between Tyrrhenian and Aegean regions (FST = 0.0041, p < 0.05; Figure 2), we performed SNP calling within each region to avoid losing genetic variants exceeding the MAF threshold only in one region.
Following the outlined procedure, 2,814,512 SNPs were called in the Aegean population, and 3,013,979 SNPs in the Tyrrhenian one. We should note that the reference genome was assembled using an individual belonging to the Aegean region. Additionally, the main 155 contigs and 26 scaffolds previously selected which matched the CIBA_Mcephalus_1.1 genome were subsequently extracted from the VCF file, and the number of SNPs retained was similar to those observed in the original VCF file. A total of 2,871,004 SNPs were called for the Tyrrhenian population, and 2,676,984 SNPs were called for the Aegean population.

3.6. Screening the Genome for Sex-Associated SNPs

The number of SNPs examined for genome screening in the Aegean and Tyrrhenian populations ranged from 65,293 and 69,320 SNPs in C23 to 217,950 and 231,210 in C1, respectively (Table S6).
The analysis of single FIS and FST values allowed us to identify several variants potentially associated with SD, according to the criteria established (FIS < −0.5 and FST > 0.3). Only five variants met those criteria for the Aegean population (Figure 3), while nine variants were found for the Tyrrhenian population (Figure 4). All the candidate variants were in different contigs and positions in both populations, except for an intron of SEC14 and spectrin domains 1 (sestd1) gene from the scaffold_2829 p412770 shared by both geographical regions (Table S7).

3.7. SD Candidate Variants

Of the five sex-associated variants detected in the Aegean region, four were intergenic and one was intronic. The association of genotypes with sex-adjusted either to a ZW/ZZ system for three SNPs or to a XX/XY system for another two SNPs (Table 1; Figure 3). Focusing on the four intergenic variants, two were consistent with an XY sex-determining system, where the Y-allele frequency in males ranged between 0.375 and 0.400. Conversely, the other two intergenic variants were consistent with a ZW SD system, where the W-allele frequency in females ranged from 0.370 and 0.43.
In the Tyrrhenian population, three analyses with different datasets were performed. A first analysis was initially performed on a subset of samples that reflected natural frequencies. The frequencies of males with the expected fshr genotype (WT/mut1), males with a non-concordant genotype (WT/WT), females with the expected fshr genotype (WT/WT), and females with a non-concordant genotype (WT/mut1) were balanced based on the work reported in Ferraresso et al. [18]. Despite the small sample size, one fshr variant showed a significant association with phenotypic sex, confirming the results reported in Ferraresso et al. [18]. Moreover, other loci showing possible sex-associated were detected.
A second comparison was carried out using only males with a non-concordant genotype (WT/WT) versus females with the expected fshr genotype (WT/WT)). In this case, several variants, but especially the intronic variant on the sestd1 gene, were found.
In the third comparison, all available individuals were used, 12 females with the expected WT/WT fshr genotype, 4 females with genotype WT/mut1, 11 males were with the WT/mut1 genotype, and 9 males with the genotype WT/WT. Only nine variants with a sex-association were found, and these included the aforementioned intronic variant on sestd1 (Table 2; Figure 4).
After identifying the sex-associated SNPs, the flanking regions within ±100 kb of each variant were analysed, several genes were detected within these intervals. Only a few flanking genes could be linked to biological pathways or processes related to sex determination (Supplementary Materials, Tables S2 and S3) [62,63,64,65,66,67,68,69,70,71,72,73,74,75,76].

4. Discussion

The mechanism underlying SD in teleosts is known to depend on different genetic and environmental factors. Several studies have reported the presence of male heterogamety (XY system), female heterogamety (ZW system), or polygenic SD with or without multiple sex chromosomes [12,16]. SD might also show intraspecific variation [22]. Various environmental factors might be involved by influencing, partially or totally, the gonad differentiation cascade at the top or bottom level in different stages of growth [15]. Genetic and environmental factors may also interact to determine phenotypic sex. For instance, in the pejerrey (Odontesthes bonariensis), amhy has been consistently identified as a MSD gene [77]. However, in this species, SD is also strongly influenced by temperature. If the temperature during a critical stage is around 17 °C, a 100% female sex ratio is observed. Conversely, if the temperature is around 29 °C, a 100% male sex ratio is achieved [78]. Another well documented case is the European sea bass (Dicentrarchus labrax), where early exposure to low temperature (16 °C) increases the proportion of females, but prolonged exposure leads to a sex ratio skewed towards males by up to 90% [79,80]
The flathead grey mullet apparently shows a complex mechanism of SD with several of the features summarised above. The present study builds upon the work of Ferraresso et al. [18], utilising whole-genome re-sequencing on individual samples and extending the analysis on the Aegean population that was previously analysed only with a targeted locus approach. Single-individual WGS data were analysed against a novel and more complete genome reference, allowing for greater accuracy in SNP calling and mapping and a much higher resolution genome scan to identify potential sex-linked variants. Although the fshr gene has previously been reported to be the putative MSD gene in M. cephalus, not all individuals showed the expected correspondence between genotype and phenotype [18,19,21]. Despite the limited sample size, the individual whole-genome re-sequencing approach confirmed the potential role of fshr in the Tyrrhenian population with the natural frequencies. Likewise, the already known variants at this locus showed no significant association with sex in the Aegean population, as already reported in [18]. Extending the analysis to other SNPs within the fshr region clearly showed that this locus is not significantly involved in sex determination in the Aegean population sample. For this population, five SNPs, located on different contigs, all in non-coding regions, showed a significant association with phenotypic sex. The most interesting one is the intronic variant located within the sestd1 gene. The encoded protein has been shown to interact with key components of the β-catenin-dependent and independent Wnt signalling pathways [81,82], which is known to have an important role in sex determination and gonadal development in vertebrates. The finding that putative SD SNPs are located in non-coding regions, either in introns or intergenic regions, at variance with the evidence for fshr might be explained with the hypothesis that they are involved the regulation of gene expression. It is well documented that intronic variants may play a relevant role in mRNA transcription [83,84,85,86]. They can activate cryptic splice sites (CSS), resulting in the inclusion of intronic sequence or the partial exclusion of exons in the mature mRNA [87], leading to an altered protein with possible effects on its function [88,89]. Moreover, many enhancers are located within introns and intergenic regions [90,91]. Understanding the effects of these non-coding variants and identifying their target gene represents a significant challenge [88].
Considering the limited sample size, these putative SD loci remain to be fully validated on a larger set of animals. However, it is already evident, even with the current sample size, that all putative SD variants in the Aegean population showed only incomplete association with phenotypic sex, therefore it seems that in this population there is not an equivalent of the fshr locus and an MSD may not exist at all.
The evidence of incomplete association between any identified genetic variants and phenotypic sex points toward not only the absence of an MSD for the flathead grey mullet, but also to the possible role of multiple loci that differ across populations. A polygenic genetic architecture has been already proposed for other species such as the European sea bass [92], and the same species was reported to show variation in GSD across populations [93]. A second, not mutually exclusive hypothesis is that environmental factors might reduce the effect of genetic ones. It should also be considered that genetic variants might determine the response to environmental variables as in the case of temperature for the European sea bass [94].
In the case of M. cephalus, the environmental component might explain the incomplete concordance between fshr genotypes and phenotypic sex observed in the Tyrrhenian and, the absence of putative MSD in the Aegean. In the European sea bass and the tongue sole (Cynoglossus semilaevis), temperature can modify the final sex via epigenetic regulation and/or modulating the stress hormone-mediated pathway [95,96]. It is tempting to speculate that temperature could play a role in mullet SD as well, which might explain the differences observed between the Aegean and the Tyrrhenian populations. The Aegean Sea is typically warmer than the Tyrrhenian Sea and a long-term trend analysis suggests that the eastern Mediterranean is experiencing faster warming rates compared to the western part [97,98,99].

5. Conclusions

While the individual WGS allowed a genome scan for SD variants at single nucleotide resolution, the current sample size was limited and the putative SD loci remain to be fully confirmed. On the other hand, even with the highest possible resolution at the whole genome level there was no evidence for an MSD locus, especially for the Aegean population. This suggests that either several genetic loci contribute to SD in the grey mullet in a population-specific manner or local environmental factors are at play. As mentioned, these two hypotheses are not mutually exclusive, but both require a much more challenging experimental design to be fully tested. In the first case, a genome-wide association study (GWAS) on a much larger experimental population raised in the same controlled environment would be necessary for the GWAS to reliably identify major and minor loci that contribute to phenotypic sex and to assess the relative contribution. To perform a robust GWAS, it is necessary to produce a large number of families under controlled conditions, using parents from each population under study, if differences between populations are the objective. In the case of environmental factors, hatchery-produced mullets need to be raised from the early larval stages under controlled conditions testing the influence of specific external variables (e.g., high or low temperatures) on sex ratios in a trial-and-error process. Both approaches are currently quite challenging as artificial reproduction in the grey mullet is not as advanced as in other well-studied species, such as the European sea bass.
Finally, considering the complexity of SD in the flathead grey mullet and in fish in general, it would be interesting to study SD in other mullet species of the Mugilidae family, as has already been done in other teleost taxonomic groups such as flatfishes [20].

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ani15162445/s1. Table S1 Chromosome associations between the CIBA_Mcephalus_1.1 reference genome and the reduced assembled genome; Table S2 Genotype distribution of fshr in 72 individuals (32 females, 40 males) from two geographic regions; Table S3 Transposable element results; Table S4 Summary of Illumina sequencing read counts; Table S5 Comparative report of genome contiguity and completeness; Table S6 Number of SNPs per chromosome in each population; Table S7 Summary of candidate variants potentially linked to SD across populations; Figure S1 Location of mitochondrial protein genes M. cephalus mitochondrial genome; Figure S2 Scatterplot of DAPC analysis for Tyrrhenian population using 2,835,032 SNPs; Table S8 Metrics results from BUSCO and QUAST results for the genome assembly; Table S9 Intergenic variants identified in the Aegean population and annotation of genes present within ±100 kb of each variant’s position. NA (not available). Table S10 Intergenic variants identified in the Tyrrhenian population and annotation of genes present within ±100 kb of each variant’s position. NA (not available).

Author Contributions

M.R.: conceptualization, data curation, formal analysis, visualisation, writing—original draft. S.F.: investigation, resources. M.B.: formal analysis, resources. A.B.: formal analysis. C.S.T.: conceptualization, funding acquisition. T.M.: formal analysis. J.R.: resources. V.P.: formal analysis. P.M.: conceptualization, writing—review and editing. L.B.: supervision, writing—review and editing. T.P.: conceptualization, funding acquisition, project administration, supervision, writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Recovery and Resilience Plan (NRRP), Mission 4, Component 2, Investment 1.4—funded by the European Union through NextGeneration EU—award number: project code CN_00000033, Italian Ministry of University and Research, “National Biodiversity Future Centre—NBFC”. This study received also funding from Regional Government Xunta de Galicia (Spain) (grant number ED431C 2022/33). This study received also funding from the European Union’s Horizon 2020 through the AQUAEXCEL 2020 project [grant no. 652831, TNA AE14008]. This research was supported in part through computational resources provided by the Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC) of the Hellenic Centre for Marine Research (HCMR). Funding for establishing the IMBBC HPC was received from the MARBIGEN (EU Regpot) project, LifeWatchGreece RI, and the CMBR (Centre for the study and sustainable exploitation of Marine Biological Resources) RI, implemented under the Action “Reinforcement of the Research and Innovation Infrastructure”, funded by the Operational Programme “Competitiveness, Entrepreneurship and Innovation” (NSRF 2014-2020), and co-financed by Greece and the European Union (European Regional Development Fund).

Institutional Review Board Statement

Mugil cephalus samples were received from commercial catches in the Mediterranean sea. No fish were handled while alive for the purpose of this project. All fish were dead when they were selected for the study. Thus, the research did not involve animal experimentation or harm, and required no ethical permits.

Data Availability Statement

In present study, all the data were deposited in the two NCBI BioProject, accession number PRJNA1221841 for Illumina data and accession number PRJNA1220361 for genome data.

Acknowledgments

We would like to thank Adrián Casanova for his help in performing and understanding DAPC analysis. We also thank Jon B. Kristoffersen for help with the MinION sequencing experiments and Alexia Alevra for providing samples of M. cephalus.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Fricke, R.; Eschmeyer, W.N.; Van der Laan, R. (2024) Eschmeyer’s Catalog of Fishes: Genera, Species, References. Available online: http://researcharchive.calacademy.org/research/ichthyology/catalog/fishcatmain.asp (accessed on 7 November 2024).
  2. Crosetti, D.; Blaber, S.J.M. (Eds.) Biology, Ecology and Culture of Grey Mullets (Mugilidae); CRC Press: Boca Raton, FL, USA, 2016; ISBN 978-0-429-17475-9. [Google Scholar]
  3. Aizen, J.; Meiri, I.; Tzchori, I.; Levavi-Sivan, B.; Rosenfeld, H. Enhancing Spawning in the Grey Mullet (Mugil cephalus) by Removal of Dopaminergic Inhibition. Gen. Comp. Endocrinol. 2005, 142, 212–221. [Google Scholar] [CrossRef]
  4. FAO. Mugil cephalus. In Cultured Aquatic Species Fact Sheets. Text by Saleh, M.A. Edited and Compiled by Valerio Crespi and Michael. New.CD-ROM (Multilingual). 2009. Available online: https://www.fao.org/fishery/docs/CDrom/aquaculture/I1129m/file/es/es_flatheadgreymullet.htm (accessed on 6 November 2024).
  5. Waltham, N.J.; Teasdale, P.R.; Connolly, R.M. Use of Flathead Mullet (Mugil cephalus) in Coastal Biomonitor Studies: Review and Recommendations for Future Studies. Mar. Pollut. Bull. 2013, 69, 195–205. [Google Scholar] [CrossRef]
  6. Jones, J.C.; Reynolds, J.D. Effects of Pollution on Reproductive Behaviour of Fishes. Rev. Fish Biol. Fish. 1997, 7, 463–491. [Google Scholar] [CrossRef]
  7. Kikuchi, K.; Hamaguchi, S. Novel Sex-Determining Genes in Fish and Sex Chromosome Evolution. Dev. Dyn. 2013, 242, 339–353. [Google Scholar] [CrossRef]
  8. Mank, J.E.; Avise, J.C. Evolutionary Diversity and Turn-Over of Sex Determination in Teleost Fishes. Sex. Dev. 2009, 3, 60–67. [Google Scholar] [CrossRef] [PubMed]
  9. Charlesworth, D.; Charlesworth, B.; Marais, G. Steps in the Evolution of Heteromorphic Sex Chromosomes. Heredity 2005, 95, 118–128. [Google Scholar] [CrossRef] [PubMed]
  10. Devlin, R.H.; Nagahama, Y. Sex Determination and Sex Differentiation in Fish: An Overview of Genetic, Physiological, and Environmental Influences. Aquaculture 2002, 208, 191–364. [Google Scholar] [CrossRef]
  11. Oliveira, C.; Foresti, F.; Hilsdorf, A.W.S. Genetics of Neotropical Fish: From Chromosomes to Populations. Fish Physiol. Biochem. 2009, 35, 81–100. [Google Scholar] [CrossRef]
  12. Martínez, P.; Viñas, A.M.; Sánchez, L.; Díaz, N.; Ribas, L.; Piferrer, F. Genetic Architecture of Sex Determination in Fish: Applications to Sex Ratio Control in Aquaculture. Front. Genet. 2014, 5, 340. [Google Scholar] [CrossRef]
  13. Penman, D.J.; Piferrer, F. Fish Gonadogenesis. Part I: Genetic and Environmental Mechanisms of Sex Determination. Rev. Fish. Sci. 2008, 16, 16–34. [Google Scholar] [CrossRef]
  14. Mank, J.E.; Promislow, D.E.L.; Avise, J.C. Evolution of Alternative Sex-Determining Mechanisms in Teleost Fishes. Biol. J. Linn. Soc. 2006, 87, 83–93. [Google Scholar] [CrossRef]
  15. Baroiller, J.F.; D’Cotta, H.; Saillant, E. Environmental Effects on Fish Sex Determination and Differentiation. Sex. Dev. 2009, 3, 118–135. [Google Scholar] [CrossRef] [PubMed]
  16. Moore, E.C.; Roberts, R.B. Polygenic Sex Determination. Curr. Biol. 2013, 23, R510–R512. [Google Scholar] [CrossRef] [PubMed]
  17. Kitano, J.; Ansai, S.; Takehana, Y.; Yamamoto, Y. Diversity and Convergence of Sex-Determination Mechanisms in Teleost Fish. Annu. Rev. Anim. Biosci. 2024, 12, 233–259. [Google Scholar] [CrossRef]
  18. Ferraresso, S.; Bargelloni, L.; Babbucci, M.; Cannas, R.; Follesa, M.C.; Carugati, L.; Melis, R.; Cau, A.; Koutrakis, M.; Sapounidis, A.; et al. Fshr: A Fish Sex-Determining Locus Shows Variable Incomplete Penetrance across Flathead Grey Mullet Populations. iScience 2020, 24, 101886. [Google Scholar] [CrossRef]
  19. Curzon, A.Y.; Dor, L.; Shirak, A.; Meiri-Ashkenazi, I.; Rosenfeld, H.; Ron, M.; Seroussi, E. A Novel c.1759T>G Variant in Follicle-Stimulating Hormone-Receptor Gene Is Concordant with Male Determination in the Flathead Grey Mullet (Mugil cephalus). G3 Genes Genomes Genet. 2021, 11, jkaa044. [Google Scholar] [CrossRef]
  20. de la Herrán, R.; Hermida, M.; Rubiolo, J.A.; Gómez-Garrido, J.; Cruz, F.; Robles, F.; Navajas-Pérez, R.; Blanco, A.; Villamayor, P.R.; Torres, D.; et al. A Chromosome-Level Genome Assembly Enables the Identification of the Follicule Stimulating Hormone Receptor as the Master Sex-Determining Gene in the Flatfish Solea Senegalensis. Mol. Ecol. Resour. 2023, 23, 886–904. [Google Scholar] [CrossRef]
  21. Pan, Q.; Feron, R.; Jouanno, E.; Darras, H.; Herpin, A.; Koop, B.; Rondeau, E.; Goetz, F.W.; Larson, W.A.; Bernatchez, L.; et al. The Rise and Fall of the Ancient Northern Pike Master Sex-Determining Gene. eLife 2021, 10, e62858. [Google Scholar] [CrossRef]
  22. Miller, S.A.; Dykes, D.D.; Polesky, H.F. A Simple Salting out Procedure for Extracting DNA from Human Nucleated Cells. Nucleic Acids Res. 1988, 16, 1215. [Google Scholar] [CrossRef]
  23. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A Flexible Trimmer for Illumina Sequence Data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef]
  24. Wick, R.R.; Judd, L.M.; Gorrie, C.L.; Holt, K.E. Completing Bacterial Genome Assemblies with Multiplex MinION Sequencing. Microb. Genom. 2017, 3, e000132. [Google Scholar] [CrossRef] [PubMed]
  25. Andrews, S. FastQC: A Quality Control Tool for High Throughput Sequence Data. 2010. Available online: https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (accessed on 2 December 2024).
  26. De Coster, W.; Rademakers, R. NanoPack2: Population-Scale Evaluation of Long-Read Sequencing Data. Bioinformatics 2023, 39, btad311. [Google Scholar] [CrossRef]
  27. Kolmogorov, M.; Yuan, J.; Lin, Y.; Pevzner, P.A. Assembly of Long, Error-Prone Reads Using Repeat Graphs. Nat. Biotechnol. 2019, 37, 540–546. [Google Scholar] [CrossRef] [PubMed]
  28. Vaser, R.; Sović, I.; Nagarajan, N.; Šikić, M. Fast and Accurate de Novo Genome Assembly from Long Uncorrected Reads. Genome Res. 2017, 27, 737–746. [Google Scholar] [CrossRef]
  29. Available online: https://github.com/nanoporetech/medaka (accessed on 2 December 2024).
  30. Walker, B.J.; Abeel, T.; Shea, T.; Priest, M.; Abouelliel, A.; Sakthikumar, S.; Cuomo, C.A.; Zeng, Q.; Wortman, J.; Young, S.K.; et al. Pilon: An Integrated Tool for Comprehensive Microbial Variant Detection and Genome Assembly Improvement. PLoS ONE 2014, 9, e112963. [Google Scholar] [CrossRef]
  31. Gurevich, A.; Saveliev, V.; Vyahhi, N.; Tesler, G. QUAST: Quality Assessment Tool for Genome Assemblies. Bioinformatics 2013, 29, 1072–1075. [Google Scholar] [CrossRef]
  32. Manni, M.; Berkeley, M.R.; Seppey, M.; Simão, F.A.; Zdobnov, E.M. BUSCO Update: Novel and Streamlined Workflows along with Broader and Deeper Phylogenetic Coverage for Scoring of Eukaryotic, Prokaryotic, and Viral Genomes. Mol. Biol. Evol. 2021, 38, 4647–4654. [Google Scholar] [CrossRef]
  33. Jin, J.-J.; Yu, W.-B.; Yang, J.-B.; Song, Y.; de Pamphilis, C.W.; Yi, T.-S.; Li, D.-Z. GetOrganelle: A Fast and Versatile Toolkit for Accurate de Novo Assembly of Organelle Genomes. Genome Biol. 2020, 21, 241. [Google Scholar] [CrossRef]
  34. Bernt, M.; Donath, A.; Jühling, F.; Externbrink, F.; Florentz, C.; Fritzsch, G.; Pütz, J.; Middendorf, M.; Stadler, P.F. MITOS: Improved De Novo Metazoan Mitochondrial Genome Annotation. Mol. Phylogenet. Evol. 2013, 69, 313–319. [Google Scholar] [CrossRef]
  35. Bell, E.A.; Butler, C.L.; Oliveira, C.; Marburger, S.; Yant, L.; Taylor, M.I. Transposable Element Annotation in Non-Model Species: The Benefits of Species-Specific Repeat Libraries Using Semi-Automated EDTA and DeepTE De Novo Pipelines. Mol. Ecol. Resour. 2022, 22, 823–833. [Google Scholar] [CrossRef] [PubMed]
  36. Ou, S.; Su, W.; Liao, Y.; Chougule, K.; Agda, J.R.A.; Hellinga, A.J.; Lugo, C.S.B.; Elliott, T.A.; Ware, D.; Peterson, T.; et al. Benchmarking Transposable Element Annotation Methods for Creation of a Streamlined, Comprehensive Pipeline. Genome Biol. 2019, 20, 275. [Google Scholar] [CrossRef]
  37. Flynn, J.M.; Hubley, R.; Goubert, C.; Rosen, J.; Clark, A.G.; Feschotte, C.; Smit, A.F. RepeatModeler2: Automated Genomic Discovery of Transposable Element Families. Proc. Natl. Acad. Sci. USA 2019, 117, 9451–9457. [Google Scholar] [CrossRef] [PubMed]
  38. Yan, H.; Bombarely, A.; Li, S. DeepTE: A Computational Method for de Novo Classification of Transposons with Convolutional Neural Network. Bioinformatics 2020, 36, 4269–4275. [Google Scholar] [CrossRef] [PubMed]
  39. Shao, F.; Wang, J.; Xu, H.; Peng, Z. FishTEDB: A Collective Database of Transposable Elements Identified in the Complete Genomes of Fish. Database 2018, 2018, bax106. [Google Scholar] [CrossRef]
  40. Holt, C.; Yandell, M. MAKER2: An Annotation Pipeline and Genome-Database Management Tool for Second-Generation Genome Projects. BMC Bioinform. 2011, 12, 491. [Google Scholar] [CrossRef] [PubMed]
  41. Korf, I. Gene Finding in Novel Genomes. BMC Bioinform. 2004, 5, 59. [Google Scholar] [CrossRef]
  42. Lomsadze, A.; Ter-Hovhannisyan, V.; Chernoff, Y.O.; Borodovsky, M. Gene Identification in Novel Eukaryotic Genomes by Self-Training Algorithm. Nucleic Acids Res. 2005, 33, 6494–6506. [Google Scholar] [CrossRef]
  43. Stanke, M.; Waack, S. Gene Prediction with a Hidden Markov Model and a New Intron Submodel. Bioinformatics 2003, 19, ii215–ii225. [Google Scholar] [CrossRef]
  44. Camacho, C.; Coulouris, G.; Avagyan, V.; Ma, N.; Papadopoulos, J.; Bealer, K.; Madden, T.L. BLAST+: Architecture and Applications. BMC Bioinform. 2009, 10, 421. [Google Scholar] [CrossRef]
  45. The UniProt Consortium. UniProt: The Universal Protein Knowledgebase in 2023. Nucleic Acids Res. 2023, 51, D523–D531. [Google Scholar] [CrossRef]
  46. Blum, M.; Chang, H.-Y.; Chuguransky, S.; Grego, T.; Kandasaamy, S.; Mitchell, A.; Nuka, G.; Paysan-Lafosse, T.; Qureshi, M.; Raj, S.; et al. The InterPro Protein Families and Domains Database: 20 Years On. Nucleic Acids Res. 2021, 49, D344–D354. [Google Scholar] [CrossRef]
  47. Dainat, J.; Hereñú, D.; Davis, E.; Crouch, K.; LucileSol; Agostinho, N.; pascal-git; tayyrov. NBISweden/AGAT: AGAT-v0.9.2. 2022. Available online: https://zenodo.org/records/6621429 (accessed on 2 December 2024).
  48. Shekhar, M.S.; Katneni, V.K.; Jangam, A.K.; Krishnan, K.; Prabhudas, S.K.; Jani Angel, J.R.; Sukumaran, K.; Kailasam, M.; Jena, J. First Report of Chromosome-Level Genome Assembly for Flathead Grey Mullet, Mugil Cephalus (Linnaeus, 1758). Front. Genet. 2022, 13, 911446. [Google Scholar] [CrossRef]
  49. Li, H. Minimap2: Pairwise Alignment for Nucleotide Sequences. Bioinformatics 2018, 34, 3094–3100. [Google Scholar] [CrossRef]
  50. Thieme, P.; Reisser, C.; Bouvier, C.; Rieuvilleneuve, F.; Béarez, P.; Coleman, R.R.; Anissa Volanandiana, J.J.; Pereira, E.; Nirchio–Tursellino, M.; Roldán, M.I.; et al. Historical Biogeography of the Mugil cephalus Species Complex and Its Rapid Global Colonization. Mol. Phylogenet. Evol. 2025, 205, 108296. [Google Scholar] [CrossRef] [PubMed]
  51. Krueger, F.; James, F.; Ewels, P.; Afyounian, E.; Weinstein, M.; Schuster-Boeckler, B.; Hulselmans, G. sclamons. FelixKrueger/TrimGalore: V0.6.10—Add Default Decompression Path. 2023. Available online: https://zenodo.org/records/7598955 (accessed on 2 December 2024).
  52. Li, H. Aligning Sequence Reads, Clone Sequences and Assembly Contigs with BWA-MEM 2013. arXiv 2013, arXiv:1303.3997. [Google Scholar]
  53. Picard Tools—Broad Institute. Available online: https://broadinstitute.github.io/picard/ (accessed on 14 March 2024).
  54. Danecek, P.; Bonfield, J.K.; Liddle, J.; Marshall, J.; Ohan, V.; Pollard, M.O.; Whitwham, A.; Keane, T.; McCarthy, S.A.; Davies, R.M.; et al. Twelve Years of SAMtools and BCFtools. GigaScience 2021, 10, giab008. [Google Scholar] [CrossRef]
  55. Danecek, P.; Auton, A.; Abecasis, G.; Albers, C.A.; Banks, E.; DePristo, M.A.; Handsaker, R.E.; Lunter, G.; Marth, G.T.; Sherry, S.T.; et al. The Variant Call Format and VCFtools. Bioinformatics 2011, 27, 2156–2158. [Google Scholar] [CrossRef] [PubMed]
  56. Raymond, M.; Rousset, F. GENEPOP (Version 1.2): Population Genetics Software for Exact Tests and Ecumenicism. J. Hered. 1995, 86, 248–249. [Google Scholar] [CrossRef]
  57. Jombart, T.; Devillard, S.; Balloux, F. Discriminant Analysis of Principal Components: A New Method for the Analysis of Genetically Structured Populations. BMC Genet. 2010, 11, 94. [Google Scholar] [CrossRef]
  58. Wright, S. The Genetical Structure of Populations. Ann. Eugen. 1949, 15, 323–354. [Google Scholar] [CrossRef]
  59. Jombart, T.; Ahmed, I. Adegenet 1.3-1: New Tools for the Analysis of Genome-Wide SNP Data. Bioinformatics 2011, 27, 3070–3071. [Google Scholar] [CrossRef]
  60. Reinar, W.B.; Tørresen, O.K.; Nederbragt, A.J.; Matschiner, M.; Jentoft, S.; Jakobsen, K.S. Teleost Genomic Repeat Landscapes in Light of Diversification Rates and Ecology. Mob. DNA 2023, 14, 14. [Google Scholar] [CrossRef]
  61. Zhang, Y.; Cao, X.; Zou, Y.; Yan, Z.; Huang, Y.; Zhu, Y.; Gao, J. De Novo Gonad Transcriptome Analysis of Elongate Loach (Leptobotia Elongata) Provides Novel Insights into Sex-Related Genes. Comp. Biochem. Physiol. Part D Genom. Proteom. 2022, 42, 100962. [Google Scholar] [CrossRef]
  62. Meng, F.; Sun, S.; Xu, X.; Yu, W.; Gan, R.; Zhang, L.; Zhang, W. Transcriptomic Analysis Provides Insights into the Growth and Maturation of Ovarian Follicles in the Ricefield Eel (Monopterus albus). Aquaculture 2022, 555, 738251. [Google Scholar] [CrossRef]
  63. Li, J.; Bai, L.; Liu, Z.; Wang, W. Dual Roles of PDE9a in Meiotic Maturation of Zebrafish Oocytes. Biochem. Biophys. Res. Commun. 2020, 532, 40–46. [Google Scholar] [CrossRef] [PubMed]
  64. Evsiukova, V.S.; Kulikova, E.A.; Kulikov, A.V. Age-Related Alterations in the Behavior and Serotonin-Related Gene mRNA Levels in the Brain of Males and Females of Short-Lived Turquoise Killifish (Nothobranchius furzeri). Biomolecules 2021, 11, 1421. [Google Scholar] [CrossRef] [PubMed]
  65. Qu, J.; Li, R.; Xie, Y.; Liu, Y.; Liu, J.; Zhang, Q. Differential Transcriptomic Profiling Provides New Insights into Oocyte Development and Lipid Droplet Formation in Japanese Flounder (Paralichthys olivaceus). Aquaculture 2022, 550, 737843. [Google Scholar] [CrossRef]
  66. He, Z.; Deng, F.; Yang, D.; He, Z.; Hu, J.; Ma, Z.; Zhang, Q.; He, J.; Ye, L.; Chen, H.; et al. Crosstalk between Sex-Related Genes and Apoptosis Signaling Reveals Molecular Insights into Sex Change in a Protogynous Hermaphroditic Teleost Fish, Ricefield Eel Monopterus albus. Aquaculture 2022, 552, 737918. [Google Scholar] [CrossRef]
  67. He, L.; Wang, Q.; Jin, X.; Wang, Y.; Chen, L.; Liu, L.; Wang, Y. Transcriptome Profiling of Testis during Sexual Maturation Stages in Eriocheir Sinensis Using Illumina Sequencing. PLoS ONE 2012, 7, e33735. [Google Scholar] [CrossRef]
  68. Pauletto, M.; Milan, M.; Huvet, A.; Corporeau, C.; Suquet, M.; Planas, J.V.; Moreira, R.; Figueras, A.; Novoa, B.; Patarnello, T.; et al. Transcriptomic Features of Pecten Maximus Oocyte Quality and Maturation. PLoS ONE 2017, 12, e0172805. [Google Scholar] [CrossRef]
  69. Sreenivasan, R.; Jiang, J.; Wang, X.; Bártfai, R.; Kwan, H.Y.; Christoffels, A.; Orbán, L. Gonad Differentiation in Zebrafish Is Regulated by the Canonical Wnt Signaling Pathway1. Biol. Reprod. 2014, 90, 45. [Google Scholar] [CrossRef]
  70. Wang, C.; Yang, L.; Xiao, T.; Li, J.; Liu, Q.; Xiong, S. Identification and Expression Analysis of Zebrafish Gnaq in the Hypothalamic–Pituitary–Gonadal Axis. Front. Genet. 2022, 13, 1015796. [Google Scholar] [CrossRef]
  71. Kubo, S.; Black, C.S.; Joachimiak, E.; Yang, S.K.; Legal, T.; Peri, K.; Khalifa, A.A.Z.; Ghanaeian, A.; McCafferty, C.L.; Valente-Paterno, M.; et al. Native Doublet Microtubules from Tetrahymena Thermophila Reveal the Importance of Outer Junction Proteins. Nat. Commun. 2023, 14, 2168. [Google Scholar] [CrossRef]
  72. Hu, L.; Chen, W.; Qian, A.; Li, Y.-P. Wnt/β-Catenin Signaling Components and Mechanisms in Bone Formation, Homeostasis, and Disease. Bone Res. 2024, 12, 39. [Google Scholar] [CrossRef]
  73. Kratzer, M.-C.; England, L.; Apel, D.; Hassel, M.; Borchers, A. Evolution of the Rho Guanine Nucleotide Exchange Factors Kalirin and Trio and Their Gene Expression in Xenopus Development. Gene Expr. Patterns 2019, 32, 18–27. [Google Scholar] [CrossRef] [PubMed]
  74. Beal, A.P.; Martin, F.D.; Hale, M.C. Using RNA-Seq to Determine Patterns of Sex-Bias in Gene Expression in the Brain of the Sex-Role Reversed Gulf Pipefish (Syngnathus scovelli). Mar. Genom. 2018, 37, 120–127. [Google Scholar] [CrossRef]
  75. Wang, M.; Chen, L.; Zhou, Z.; Xiao, J.; Chen, B.; Huang, P.; Li, C.; Xue, Y.; Liu, R.; Bai, Y.; et al. Comparative Transcriptome Analysis of Early Sexual Differentiation in the Male and Female Gonads of Common Carp (Cyprinus carpio). Aquaculture 2023, 563, 738984. [Google Scholar] [CrossRef]
  76. Zhang, Y.; Hattori, R.S.; Sarida, M.; García, E.L.; Strüssmann, C.A.; Yamamoto, Y. Expression Profiles of Amhy and Major Sex-Related Genes during Gonadal Sex Differentiation and Their Relation with Genotypic and Temperature-Dependent Sex Determination in Pejerrey Odontesthes bonariensis. Gen. Comp. Endocrinol. 2018, 265, 196–201. [Google Scholar] [CrossRef] [PubMed]
  77. Ito, L.S.; Yamashita, M.; Takashima, F.; Strüssmann, C.A. Dynamics and Histological Characteristics of Gonadal Sex Differentiation in Pejerrey (Odontesthes bonariensis) at Feminizing and Masculinizing Temperatures. J. Exp. Zoolog. A Comp. Exp. Biol. 2005, 303A, 504–514. [Google Scholar] [CrossRef]
  78. Vandeputte, M.; Piferrer, F. Genetic and Environmental Components of Sex Determination in the European Sea Bass. In Sex Control in Aquaculture; John Wiley & Sons, Ltd.: Hoboken, NJ, USA, 2018; pp. 305–325. ISBN 978-1-119-12729-1. [Google Scholar]
  79. Vandeputte, M.; Clota, F.; Sadoul, B.; Blanc, M.-O.; Blondeau-Bidet, E.; Bégout, M.-L.; Cousin, X.; Geffroy, B. Low Temperature Has Opposite Effects on Sex Determination in a Marine Fish at the Larval/Postlarval and Juvenile Stages. Ecol. Evol. 2020, 10, 13825–13835. [Google Scholar] [CrossRef] [PubMed]
  80. Anderson, K.C.; Morgan, J.A.T.; Goulden, E.F. A New Sex-Specific Genetic Marker (Fshr 1834G>T) for Flathead Grey Mullet, Mugil cephalus, in Queensland, Australia. Aquac. Rep. 2023, 33, 101858. [Google Scholar] [CrossRef]
  81. Yang, X.; Fisher, D.A.; Cheyette, B.N. SEC14 and Spectrin Domains 1 (Sestd1), Dishevelled 2 (Dvl2) and Dapper Antagonist of Catenin-1 (Dact1) Co-Regulate the Wnt/Planar Cell Polarity (PCP) Pathway during Mammalian Development. Commun. Integr. Biol. 2013, 6, e26834. [Google Scholar] [CrossRef]
  82. Yang, X.; Cheyette, B.N.R. SEC14 and Spectrin Domains 1 (Sestd1) and Dapper Antagonist of Catenin 1 (Dact1) Scaffold Proteins Cooperatively Regulate the Van Gogh-like 2 (Vangl2) Four-Pass Transmembrane Protein and Planar Cell Polarity (PCP) Pathway during Embryonic Development in Mice. J. Biol. Chem. 2013, 288, 20111–20120. [Google Scholar] [CrossRef] [PubMed]
  83. Pagani, F.; Baralle, F.E. Genomic Variants in Exons and Introns: Identifying the Splicing Spoilers. Nat. Rev. Genet. 2004, 5, 389–396. [Google Scholar] [CrossRef]
  84. Law, A.J.; Kleinman, J.E.; Weinberger, D.R.; Weickert, C.S. Disease-Associated Intronic Variants in the ErbB4 Gene Are Related to Altered ErbB4 Splice-Variant Expression in the Brain in Schizophrenia. Hum. Mol. Genet. 2007, 16, 129–141. [Google Scholar] [CrossRef]
  85. Scotti, M.M.; Swanson, M.S. RNA Mis-Splicing in Disease. Nat. Rev. Genet. 2016, 17, 19–32. [Google Scholar] [CrossRef]
  86. Douglas, A.G.L.; Wood, M.J.A. RNA Splicing: Disease and Therapy. Brief. Funct. Genom. 2011, 10, 151–164. [Google Scholar] [CrossRef] [PubMed]
  87. Kwan, T.; Benovoy, D.; Dias, C.; Gurd, S.; Provencher, C.; Beaulieu, P.; Hudson, T.J.; Sladek, R.; Majewski, J. Genome-Wide Analysis of Transcript Isoform Variation in Humans. Nat. Genet. 2008, 40, 225–231. [Google Scholar] [CrossRef]
  88. Paul, D.S.; Soranzo, N.; Beck, S. Functional Interpretation of Non-Coding Sequence Variation: Concepts and Challenges. BioEssays 2014, 36, 191–199. [Google Scholar] [CrossRef] [PubMed]
  89. Lieberman-Aiden, E.; van Berkum, N.L.; Williams, L.; Imakaev, M.; Ragoczy, T.; Telling, A.; Amit, I.; Lajoie, B.R.; Sabo, P.J.; Dorschner, M.O.; et al. Comprehensive Mapping of Long-Range Interactions Reveals Folding Principles of the Human Genome. Science 2009, 326, 289–293. [Google Scholar] [CrossRef]
  90. Coulombe-Huntington, J.; Lam, K.C.L.; Dias, C.; Majewski, J. Fine-Scale Variation and Genetic Determinants of Alternative Splicing across Individuals. PLoS Genet. 2009, 5, e1000766. [Google Scholar] [CrossRef]
  91. Mattick, J.S.; Amaral, P.P.; Carninci, P.; Carpenter, S.; Chang, H.Y.; Chen, L.-L.; Chen, R.; Dean, C.; Dinger, M.E.; Fitzgerald, K.A.; et al. Long Non-Coding RNAs: Definitions, Functions, Challenges and Recommendations. Nat. Rev. Mol. Cell Biol. 2023, 24, 430–447. [Google Scholar] [CrossRef]
  92. Vandeputte, M.; Dupont-Nivet, M.; Chavanne, H.; Chatain, B. A Polygenic Hypothesis for Sex Determination in the European Sea Bass Dicentrarchus labrax. Genetics 2007, 176, 1049–1057. [Google Scholar] [CrossRef]
  93. Faggion, S.; Vandeputte, M.; Chatain, B.; Gagnaire, P.-A.; Allal, F. Population-Specific Variations of the Genetic Architecture of Sex Determination in Wild European Sea Bass Dicentrarchus labrax L. Heredity 2019, 122, 612–621. [Google Scholar] [CrossRef] [PubMed]
  94. Geffroy, B.; Besson, M.; Sánchez-Baizán, N.; Clota, F.; Goikoetxea, A.; Sadoul, B.; Ruelle, F.; Blanc, M.-O.; Parrinello, H.; Hermet, S.; et al. Unraveling the Genotype by Environment Interaction in a Thermosensitive Fish with a Polygenic Sex Determination System. Proc. Natl. Acad. Sci. USA 2021, 118, e2112660118. [Google Scholar] [CrossRef] [PubMed]
  95. Chen, S.; Zhang, G.; Shao, C.; Huang, Q.; Liu, G.; Zhang, P.; Song, W.; An, N.; Chalopin, D.; Volff, J.-N.; et al. Whole-Genome Sequence of a Flatfish Provides Insights into ZW Sex Chromosome Evolution and Adaptation to a Benthic Lifestyle. Nat. Genet. 2014, 46, 253–260. [Google Scholar] [CrossRef] [PubMed]
  96. Navarro-Martín, L.; Viñas, J.; Ribas, L.; Díaz, N.; Gutiérrez, A.; Croce, L.D.; Piferrer, F. DNA Methylation of the Gonadal Aromatase (Cyp19a) Promoter Is Involved in Temperature-Dependent Sex Ratio Shifts in the European Sea Bass. PLoS Genet. 2011, 7, e1002447. [Google Scholar] [CrossRef]
  97. Martínez, J.; Leonelli, F.E.; García-Ladona, E.; Garrabou, J.; Kersting, D.K.; Bensoussan, N.; Pisano, A. Evolution of Marine Heatwaves in Warming Seas: The Mediterranean Sea Case Study. Front. Mar. Sci. 2023, 10, 1193164. [Google Scholar] [CrossRef]
  98. Pastor, F.; Valiente, J.A.; Khodayar, S. A Warming Mediterranean: 38 Years of Increasing Sea Surface Temperature. Remote Sens. 2020, 12, 2687. [Google Scholar] [CrossRef]
  99. Pisano, A.; Marullo, S.; Artale, V.; Falcini, F.; Yang, C.; Leonelli, F.E.; Santoleri, R.; Buongiorno Nardelli, B. New Evidence of Mediterranean Climate Change and Variability from Sea Surface Temperature Observations. Remote Sens. 2020, 12, 132. [Google Scholar] [CrossRef]
Figure 1. Sampling sites of M. cephalus (ORB: Orbetello Lagoon; CAB: Cabras Lagoon; TOR: Tortolì Lagoon; KAV: Kavala) from Ferraresso et al. [18].
Figure 1. Sampling sites of M. cephalus (ORB: Orbetello Lagoon; CAB: Cabras Lagoon; TOR: Tortolì Lagoon; KAV: Kavala) from Ferraresso et al. [18].
Animals 15 02445 g001
Figure 2. Scatterplot of DAPC analysis in the M. cephalus using 2,835,032 SNPs. Clusters and inertia ellipses for each sample are shown in different colours (ORB: Orbetello; CAB: Cabras; TOR: Tortolì Lagoon; KAV: Kavala).
Figure 2. Scatterplot of DAPC analysis in the M. cephalus using 2,835,032 SNPs. Clusters and inertia ellipses for each sample are shown in different colours (ORB: Orbetello; CAB: Cabras; TOR: Tortolì Lagoon; KAV: Kavala).
Animals 15 02445 g002
Figure 3. Diagram showing the allele frequency of the allele of the heterogametic sex (Y or W) for the four SNPs meeting the sex-association criteria (FIS < −0.5 and FST > 0.3) in the Aegean population.
Figure 3. Diagram showing the allele frequency of the allele of the heterogametic sex (Y or W) for the four SNPs meeting the sex-association criteria (FIS < −0.5 and FST > 0.3) in the Aegean population.
Animals 15 02445 g003
Figure 4. Diagram showing the genotypes of males and females for the SNPs showing FIS < −0.5 and FST > 0.3 in the Tyrrhenian population.
Figure 4. Diagram showing the genotypes of males and females for the SNPs showing FIS < −0.5 and FST > 0.3 in the Tyrrhenian population.
Animals 15 02445 g004
Table 1. Main features of SNPs associated with sex in the Aegean population. AN (total number of alleles called), AC (allele count in genotypes), AF (allele frequency), MDP (mean read depth) and NA (not available).
Table 1. Main features of SNPs associated with sex in the Aegean population. AN (total number of alleles called), AC (allele count in genotypes), AF (allele frequency), MDP (mean read depth) and NA (not available).
Aegean PopulationFemaleMale
CIBA_Chr.ContigPos. in bpANACAFANACAFSystemLocationGeneMDPp-Value
18contig_3267427998320040160.4XYIntergenicNA26.783.2 × 10−5
16contig_761372832140.43754020.05ZWIntergenicNA187.926.2 × 10−5
19contig_9481665937320040150.375XYIntergenicNA24.753 × 10−5
12scaffold_2829412177032110.343754000ZWIntronicsestd121.144.6 × 10−5
9scaffold_5201510136532120.3754010.025ZWIntergenicNA25.690.000342
Table 2. Main features of SNPs associated with sex in the Tyrrhenian population. AN (total number of alleles called), AC (allele count in genotypes), AF (allele frequency), MDP (mean read depth) and NA (not available).
Table 2. Main features of SNPs associated with sex in the Tyrrhenian population. AN (total number of alleles called), AC (allele count in genotypes), AF (allele frequency), MDP (mean read depth) and NA (not available).
Tyrrhenian PopulationFemaleMale
CIBA_Chr.ContigPos. in bpANACAFANACAFSystemLocationGeneMDPp-Value
12contig_1367584196320040150.375XYIntronickalrn_214.395.2 × 10−5
11contig_2266559320040150.375XYIntergenicNA182.312 × 10−5
21contig_285837872032120.3754010.025ZWIntergenicNA6.280.000288
3contig_434470899032110.3444000ZWIntergenicNA194 × 10−6
16contig_57247384932130.4064020.05ZWIntronicmpp324.610.000268
8contig_892447709632140.4384000ZWIntronicgnaq_220.610
19contig_9485446550320040150.375XYIntergenicNA11.581.4 × 10−5
5scaffold_22651047690832120.3754010.25ZWIntroniclimch1_216.690.000304
12scaffold_2829412177032110.3444000ZWIntronicsestd120.082.4 × 10−5
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Racaku, M.; Ferraresso, S.; Babbucci, M.; Blanco, A.; Tsigenopoulos, C.S.; Manousaki, T.; Radojicic, J.; Papadogiannis, V.; Martínez, P.; Bargelloni, L.; et al. Complex Sex Determination in the Grey Mullet Mugil cephalus Suggested by Individual Whole Genome Sequence Data. Animals 2025, 15, 2445. https://doi.org/10.3390/ani15162445

AMA Style

Racaku M, Ferraresso S, Babbucci M, Blanco A, Tsigenopoulos CS, Manousaki T, Radojicic J, Papadogiannis V, Martínez P, Bargelloni L, et al. Complex Sex Determination in the Grey Mullet Mugil cephalus Suggested by Individual Whole Genome Sequence Data. Animals. 2025; 15(16):2445. https://doi.org/10.3390/ani15162445

Chicago/Turabian Style

Racaku, Mbarsid, Serena Ferraresso, Massimiliano Babbucci, Andres Blanco, Costas S. Tsigenopoulos, Tereza Manousaki, Jelena Radojicic, Vasileios Papadogiannis, Paulino Martínez, Luca Bargelloni, and et al. 2025. "Complex Sex Determination in the Grey Mullet Mugil cephalus Suggested by Individual Whole Genome Sequence Data" Animals 15, no. 16: 2445. https://doi.org/10.3390/ani15162445

APA Style

Racaku, M., Ferraresso, S., Babbucci, M., Blanco, A., Tsigenopoulos, C. S., Manousaki, T., Radojicic, J., Papadogiannis, V., Martínez, P., Bargelloni, L., & Patarnello, T. (2025). Complex Sex Determination in the Grey Mullet Mugil cephalus Suggested by Individual Whole Genome Sequence Data. Animals, 15(16), 2445. https://doi.org/10.3390/ani15162445

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop