Next Article in Journal
Effect of Holstein Genotype and Energy Balance on Lipids, Carnitine and Short-Chain Carboxylic Acids in Milk During Early Lactation
Previous Article in Journal
Universal Paediatric and Newborn Screening for Familial Hypercholesterolaemia—Challenges and Opportunities: An Australian Perspective
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Integration of Global Lipidomics and Gonad Histological Analysis via Multivariate Chemometrics and Machine Learning: Identification of Potential Lipid Markers of Ovarian Development in the Blue Mussel (Mytilus edulis)

by
Vincenzo Alessandro Laudicella
1,2,*,
Stefano Carboni
3,4,
Cinzia De Vittor
2,
Phillip D. Whitfield
5,6,
Mary K. Doherty
5 and
Adam D. Hughes
1
1
Scottish Association for Marine Sciences, Dunstaffnage Marine Laboratory, Oban PA34 1QA, UK
2
National Institute of Oceanography and Applied Geophysics (OGS), 34010 Trieste, Italy
3
Institute of Aquaculture, Faculty of Natural Sciences, University of Stirling, Stirling FK9 4LA, UK
4
International Marine Center Foundation, Località Sa Mardini, 09170 Oristano, Italy
5
Division of Biomedical Sciences, Centre for Health Sciences, University of the Highlands and Islands, Inverness IV2 3JH, UK
6
Glasgow Polyomics, College of Medical, Veterinary and Life Sciences, Garscube Campus, University of Glasgow, Glasgow G61 1QH, UK
*
Author to whom correspondence should be addressed.
Lipidology 2025, 2(1), 5; https://doi.org/10.3390/lipidology2010005
Submission received: 16 December 2024 / Revised: 3 February 2025 / Accepted: 3 March 2025 / Published: 10 March 2025

Abstract

:
Background/Objectives: Gonad histological analysis (GHA) is the traditional method for assessing the gonad maturation status of blue mussels (Mytilus edulis). GHA has some operational disadvantages, such as limited processing outputs, subjectivity in the assessment of transitional stages of gonadal maturation and the need for experienced and trained operators. Lipids could become important indicators of gonadal maturation as they cover many essential functions during such processes in mussels. In this work, blue mussel ovary (BMO) ultrastructure is integrated with liquid chromatography coupled with mass spectrometry (LC-MS) lipidomics fingerprinting to identify suitable markers for ovarian maturation through the application of chemometrics and machine learning approaches. Methods: BMOs are classified here as ripe or non-ripe by means of GHA and the gamete volume fraction (GVF). Receiving operating characteristic (ROC) curves were used to classify the results of the different statistics according to their area under the curve (AUC), and the functional role of important lipids was assessed by lipid ontology enrichment (LiOn) analysis. Results: This approach allowed for the selection of a panel of 35 lipid molecules (AUC > 0.8) that can distinguish non-ripe from ripe BMOs. Ceramide phosphoethanolamine (CerPE) 40:2 was the molecule with the highest classification ability (AUC 0.905), whereas glycerophosphoserine (PS) was the class mostly changing between the two groups. LiOn analysis indicated significant differences in the functional roles of these lipids, highlighting enrichment terms associated with membrane lipids, lysosomes and highly unsaturated triglycerides (TGs) in non-ripe ovaries, whereas terms associated with storage lipids and low-saturated TG characterised ripe BMOs.

Graphical Abstract

1. Introduction

Mussels are important species for marine ecosystems as they are recognised as habitat engineers and provide valuable ecosystem services for coastal areas and their economies [1,2]. Mussels are valuable aquaculture resources characterised by an extremely low environmental footprint [3,4] and high nutritional properties [5], and their demand as food products is projected to increase in the future’s net-zero society [6]. Unfortunately, despite their prospective importance, mussel production in Europe has stagnated since 1999 [7], and the expansion of the mussel aquaculture sector is hampered by various factors, including climate change [8]. Therefore, it is important to increase our knowledge of mussel physiology and biochemistry and to identify further useful tools to monitor and predict the ability of mussels to adapt and thrive in future environmental scenarios.
Gonadal development is the main physiological process that runs through the annual life cycle of temperate mussels [9]. Blue mussels (Mytilus edulis) have an annual reproductive cycle with an accumulation of energy resources in late summer and autumn, a gonadal maturation process that extends through winter, and one or more spawning events in spring and early summer [10,11,12,13,14]. During maturation, gonads proliferate on the mantle, which undergoes a series of ultrastructural and biochemical reorganisations. In mature mussels, the mantle can account for 50% of the total body dry weight [14]. In resting females, the mantle consists of a single-layered cuboidal epithelium that is ciliated on the inner surface and composed of adipogranular (ADG) and vesicular connective tissue (VCT) cells [15]. ADG and VCT cells support oocyte development by providing glycogen and lipids, respectively [12]. The oocytes increase in numbers and mature, accumulating lipid droplets that form the yolk reserves of the embryos [16]. Glycogen is consumed during the development process, while lipids are accumulated in the oocytes [17,18]. This process leads to the annual accumulation patterns of lipids in several bivalve species, including Magallana gigas [19,20,21], Ostrea edulis [22], M. edulis [17,23], Mytilus galloprovincialis [24], Modiolus barbatus [25], Mimachlamys crassicostata [26], Ruditapes decussatus and Ruditapes philippinarum [27].
Detailed information about the maturation of the gonads is important for the industrial production of mussels in hatcheries, where the mussels are artificially conditioned (so-called broodstock conditioning) to maximise gamete production. Mussels do not store their oocytes for prolonged periods of time, as it is their physiology to partially or completely spawn within a few weeks of sexual maturity [28]. Missing the right time window for gonad development can lead to unnecessary resource consumption, uncontrolled spawning or the occurrence of pre-spawning atresia [28,29,30].
Various histological, biochemical and molecular methods have been used to assess gonadal maturation in bivalves. Historical approaches to study gonad maturation in mussels included the use of allometric ratios as a measure of condition. The condition index, which measures the ratio of soft tissue to shell [31], and the gonadosomatic index, which measures the ratio of mantle to soft tissue [32], are two commonly cited examples in the literature. Traditional techniques also include gonadal smears and gonadal histological analysis (GHA) [33,34]. The fastest method to obtain information on the gonadal status of mussels is the gonadal smear, although the loss of tissue structure due to sample preparation makes it difficult to distinguish between different stages of maturity. The ultrastructure of the tissue is well preserved in the GHA, making it the most important technique for analysing the reproductive cycle in bivalves [10,12,30,34,35]. The limitations of GHA are the limited sample size (e.g., the number of samples that can be processed per time unit), the difficulty in distinguishing transitional stages of development, and a certain degree of subjectivity that requires experienced operators in classifying the degree of development of gonads [36,37]. GHA has also been coupled with stereological indices, leading to the development of several quantitative indices of gonadal maturation, such as gamete volume fraction (GVF) or oocyte cytoplasmic area, overcoming some of the limitations of classical GHA [11,38,39,40,41]. Hines and co-workers compared classical histological approaches with metabolic and molecular techniques and showed that the latter two methods are advantageous in distinguishing sex in dormant or spent mussels [42].
Lipids play an important role in mussel gonad maturation, as several authors have reported direct changes in lipid composition as a result of the maturation process [18,21,43,44,45]. Triglycerides (TGs) were the major lipid class that changed during gonadal development in the ovaries of both Pecten maximus and M. gigas, accumulating in mature individuals [21,43]. Martinez-Pita and co-workers found no changes in the composition of lipid classes between the different stages of ovarian development in M. galloprovincialis and Donax trunculus [18,44]. When analysing fatty acids (FA), these authors observed an increase in FA18:4n-3 in the mature females of both species, and of FA18:0, FA18:3n-3 and docosahexaenoic acid (DHA, FA22:6n-3) specifically in mature female mussels. The complexity of observing a direct effect of gamete development on gonad lipid composition may be related to the seasonal behaviour of the gonad development in bivalves, which tends to span over several months [11,12,13,32]. Furthermore, gonads are also a non-uniform tissue, as several developmental stages (or regressions) can occur simultaneously in the same organism [18,46], which greatly increases the difficulty of such a task.
Lipidomics and the use of liquid chromatography coupled with mass spectrometry (LC-MS) could offer several advantages over conventional lipid analysis techniques when analysing a complex tissue such as bivalve gonads. First, the study of lipids at the molecule level combines information from FA and lipid class composition analysis, revealing patterns that are otherwise lost when relying on only one of these two traditional lipid profiling techniques [47]. Second, modern MS platforms are characterised by higher sensitivity than traditional detectors used in lipid class analysis, providing an opportunity to evidence and extend existing knowledge on the role covered by minor, uncommon and poorly understood lipid classes in the ovarian maturation of mussels. To date, rare are the studies investigating gamete development in marine invertebrates using LC-MS. One of the first examples of this type of study comes from Chansela and coworkers who investigated the lipid composition of Paeneus merguirensis ovaries using a mass spectrometry imaging approach [48]. The authors showed how follicles from the same histological preparation, but with different maturation stages, differed in their composition of major phosphocholine (PC) and TG molecular species. The results of this study demonstrate the presence of lowly unsaturated PC species (PC 32:1) and highly unsaturated TGs (58:9) especially in early-development gonadal areas. However, PC and TG molecular species associated with essential polyunsaturated FA such as eicosapentaenoic acid (EPA, FA20:5n-3), arachidonic acid (AA, FA20:4n-6) and DHA were only abundant in mature follicles.
More recently, the lipidome of the gonads of M. edulis has been characterised in detail, focusing on sex-specific differences between testis and ovaries [49]. In the present study, we aimed to build on this previous work and continue here by analysing the blue mussel ovarian (BMO) lipidome by integrating GHA with LC-MS global untargeted lipidomics through a comprehensive statistical approach, incorporating unsupervised and supervised multivariate chemometrics and machine learning (RandomForest—RF [50]), applied here to highlight potential lipid markers that distinguish between histologically ripe (stage III) and non-ripe BMOs (stages I–II).

2. Materials and Methods

2.1. Mussel Collection and Sample Preparation

Commercial-size blue mussels (Mytilus edulis, average shell size: 5.6 ± 0.6 cm) were obtained from local growers on the West Coast of Scotland (Inverlussa Marine Services—www.inverlussa.com, Loch Spelve, Isle of Mull, UK) in February 2017. The mussel rearing and processing (sizing, feeding, aquarium rearing conditions) used in this study has been described in a previous publication [49], and details are provided in Method Supplement S1. Each adult was opened ‘book-like’ by cutting the hinge and abductor muscle with a scalpel and carefully removing the gills with bow scissors. The left valve mantle lobe was dissected and placed in a 1.5 mL tube (Eppendorf), snap-frozen in liquid nitrogen and stored at −80 °C for lipid analysis. The right mantle lobe was excised and stored in 7 mL Bijoux tubes (VWR) in 4% buffered seawater formalin for GHA.

2.2. Histological Analyses

The sex of each adult animal was determined by GHA after haematoxylin and eosin (H&E) staining, the standard method for sex determination and assessment of gonadal maturation [30,35,42,51]. H&E-stained slides were photographed with an Axioshop 2 inverted microscope equipped with an AxioCam MRc5 camera (CarlZeiss, Cambridge, UK) and acquired with the corresponding AxioVision 4.7 software. GHA scores were assigned according to the ultrastructure of the gonads using a 4-staging classification system [10,12,52]. The reference samples used in the gonad classification system are shown in Method Supplement S2a. Three different sections (anterior, medial and posterior) of the mantle of each mussel were scored to distinguish between ripe (stage III, N = 25) and non-ripe (stages I-II, N = 26) ovaries. A distinction between vitellogenic and atretic/degenerating gonads was made according to the criteria reported in [16,28] as described in Method Supplement S2b. When different stages occurred simultaneously in a single section, the staging decision was based on the condition of the largest part of the preparation [18]. A stereological variable with which to quantify the percentage of mantle area covered by gametes (gamete volume fraction—GVF) was calculated through an image analysis of the GHA preparations using ImageJ software (version 1.52r; www.imagej.nih.gov) via the colour threshold method (see Method Supplement S2c for details). The GVF was calculated as the percentage between the total mantle surface area and the mantle surface area covered with gametes. The relationship between GHA and the GVF is shown in Appendix A (Figure A1). All GHA and GVF scores are provided in Supplementary Data S1.

2.3. Lipid Extraction

Frozen and dissected BMOs were freeze-dried (ẞ18 LO plus, Christ) and ground to a fine powder in liquid nitrogen; ≈0.05 g samples of dry BMO powder were individually homogenised in HPLC water (Fisherbrand) for 1 min in glass potters (1:8 weight–volume) on ice. Aliquots of the homogenate (20 µL) from each gonad were pooled and used to form the biological quality control (BQC) samples. Lipid extraction was performed according to [53] as described in [49]. Total lipids were extracted from 50 µL of BMO homogenate and BQC with 6 mL of 2:1 chloroform–methanol (both Fisherbrand) in pre-combusted (450 °C, 5 h) 20 mL screw-cap vials (Pyrex). The mixture was shaken vigorously, each vial was flushed with nitrogen, and the mixture was then extracted on ice for 1 hr under dark conditions. The organic extract was separated from the pellet by centrifugation (350× g, 5 min, 4 °C) and collected in a clean tube. The pellet was re-extracted with a additional 3 mL of chloroform–methanol. A 0.9% KCl solution (VWR) was added to the organic extract to obtain a final ratio of 2:1:0.8 chloroform–methanol–KCl, and the polar and organic phases were separated after a second centrifugation step (350× g, 5 min, 4 °C). The organic layer was recovered and evaporated under nitrogen flow (NVap, Organomation). The dried lipid extracts were weighed (0.0001 g accuracy) and resuspended in 0.5 mL chloroform, yielding the BMO total lipid extract (TLE). The fatty acid analysis of the BMOs is reported in Supplementary Data S2a.

2.4. LC-MS Untargeted Lipidomics

Global lipidomics analyses of BMO TLE were performed by liquid chromatography coupled with high-resolution mass spectrometry (LC-MS) via a binary ultra-high pressure liquid chromatography (UHPLC) system (1250 Accela, ThermoFisher) connected to an electron spray ionisation (ESI) source and an Orbitrap mass analyser (Exactive, ThermoFisher). Separation was performed on a C18 Hypersil Gold, with a 100 × 2.1 mm column and 1.9 nm particle size (ThermoFisher, Waltham, MA, USA), maintained at 50 °C. The binary solvent system comprised a constant flow rate of 400 μL min−1 with a gradient as described in Supplementary Table S1. Water and acetonitrile were HPLC-grade and purchased from Fisherbrand; isopropanol was LC-MS-grade (Hypergrade LiChrosolv, Merck, Gilligham, UK), while ammonium formate and formic acid were both LC-MS-grade and purchased from Sigma-Aldrich (St. Louis, MI, USA). TLEs were resuspended in 3:1 methanol–chloroform at a concentration of 1 mg mL−1 for a 3 μL injection volume.
All samples were analysed in positive (POS) and negative (NEG) ion mode over the m/z range of 250–2000 Da at a resolution of 100,000 FWHM and a scan speed of 1 Hz. Mass error was kept below 5 ppm by routine calibrations for both polarities using a calibration solution (Pierce™ LTQ ESI calibration solutions, ThermoFisher). Chromatograms and mass spectra were reviewed and integrated using Xcalibur software (version 3.2, ThermoFisher, Waltham, MA, USA).

2.5. Data Processing

Raw LC-MS data were processed using Progenesis QI (Nonlinear Dynamics, Waters). Lipids were identified using the exact mass of the precursor ion (MS’, 5 ppm Δm/z), and with the exception of CAEPs (ceramide aminoethyl phosphonates [54]), lipid IDs were reported according to the LIPID MAPs nomenclature standard system [55,56]. Raw data from LC-MS lipid adducts for POS and NEG were experimentally evaluated using the lipid standard mixture (Supplementary Table S2) purchased from Avanti Polar Lipids (www.avantilipids.com) and Matreya (www.matreya.com). Lipid IDs were obtained by searching for exact masses in the lipid mass spectra libraries LIPID MAPS (www.lipidmaps.org—accessed on 10 October 2024), the Human Metabolome Data Base (HMDB) (www.hmdb.ca—accessed on 10 October 2024) and Metlin (www.metlin.scripps.edu—accessed on 13 March 2020), as well as from an in-house lipid database derived from lipidomics studies on bivalves and other marine invertebrates, which served as a tool for the identification of unusual marine lipids [54,57,58,59,60,61,62,63,64,65]. Isomeric lipid molecules resolved by reverse phase separation (same exact mass of 5 Δppm, but with a different retention time) are distinguished with a lowercase letter after the lipid ID (i.e., PC 38:5a and PC 38:5b). The full set of lipids identified in the BMO and used in this study is included in Supplementary Data S1. All LC-MS raw data files and experimental metadata were deposited in the Metabolights [66] data repository (www.ebi.ac.uk/metabolights/MTBLS8492—accessed on 10 October 2024) and are available under study identifier MTBLS8492.

2.6. Data Analysis

Statistical analysis was performed with the statistical platform R (version 4.2.0). Data are presented as mean ± standard deviation unless otherwise stated. Statistical differences were considered significant at p < 0.05.
Peak intensity tables (PITs), created from the LC-MS data obtained by POS and NEG ionisation modalities, were manually sorted to remove contaminant masses. Further data filtering included removing features with >30% missing values, with the remaining missing values replaced with a small value (half of the minimum intensity value) and characteristics with low repeatability removed by filtering using BQC and the interquartile range. The filtering of the raw data was performed using the filtering functions of the R-based package ‘MetaboAnalystR’ [67]. At this stage, POS and NEG data were merged into a single dataset to avoid redundancy in the analysis. Duplicate lipids (i.e., observed in the POS and NEG data) were manually checked for their retention time between the different samples; if they matched, the ionisation polarity that gave the best results in terms of signal for that specific lipid class was retained, assessed using as reference the lipid standard mixture.
A PIT was autoscaled (mean centred and divided by each lipid standard deviation), and the preliminary approach used to highlight lipids that differ between ripe and non-ripe BMOs involved principal component analysis (PCA) obtained from the R package ‘factoextra’ [68] as an unsupervised statistical model [69]. This approach was used to highlight the presence of outliers; a brief discussion of the results can be found in Appendix A. Other important markers were identified with supervised statistical methods on Pareto-scaled data (mean centred and divided by the square root of the standard deviation of each lipid). The Pareto scaling of the data was used to include information regarding molecule abundances that would otherwise be lost by autoscaling [70]. The first approach used in biomarker discovery involved the identification of significant variables via volcano plots. A volcano plot was calculated via the R package ‘MetaboAnalystR’ [67] and plotted via the package ‘EnhancedVolcano’ [71]. For the volcano plot, the lipids were filtered by a fold change (FC) > 1.3 following a Wilcoxon rank-sum test with a false discovery rate (FDR [72]) and adjusted p-value < 0.05. Orthogonal partial least squares discriminant analysis (OPLS-DA, [73]), from the ‘MetaboAnalystR’ package, was the second supervised method used to identify lipids associated with the two gonad maturation conditions. The performance of OPLS-DA was assessed by 10-fold leave-one-out cross-validation (LOOCV), and model fit was tested for using a 2000-fold permutation test (see Appendix A [74]). Important lipids were identified in a Sigma plot, and lipids characterised by a p(corr) < −0.6 and >0.6 were retained and considered important in distinguishing ripe and non-ripe BMOs. RandomForest (RF) [50] was the last method used here. RF is a machine learning algorithm that was trained as shown in [75] and was obtained from the package ‘RandomForest’ [76]. In this study, RF was used as a classification method (RFclass) using the grouping variables (ripe/non-ripe) and as a regression method (RFreg) considering the GVF. In both cases, the best 10% of RF classification (lowest out-of-bag error rate—OOB-ER) and RF regression (highest percentage of variability explained—PVE) were considered important markers. A backward purging approach for the top 5% of lipids for both RFclass and RFreg was used to identify the most important molecules detected by the two approaches (see Method Supplement S2d for details on model training). The results of all four statistical approaches were compared using a Venn diagram and ranked according to their ROC (receiving operating characteristic) curves and AUC (area under the curve) values. ROC and AUC values were calculated using the biomarker discovery function of the MetaboAnalyst package. Details of the full ROC and AUC analyses of all biomarkers identified in this section can be found in Table A1 (Appendix A).
Lipid ontology (LiOn) enrichment analysis terms were calculated and analysed via the LiOn web platform [77]. PCA of LiOn terms was visualised via GO-PCA [78] and presented as a heatmap plot using Euclid as the distance matrix and Complete as the clustering algorithm. Enriched LiOn terms were ordered by p-value (calculated with a t-test), and terms were considered significantly different if the FDR-adj p-value was <0.05 [72].

3. Results and Discussion

In this study, a total of 51 BMOs were analysed using LC-MS global untargeted lipidomics. Ovaries were categorised as ripe (N = 25) or not ripe (N = 26) based on their GHA scores and GVF (Supplementary Data S1). Stereological analyses such as of the GVF are less prone to subjectivity bias than GHA, especially when assessing histological preparation, which indicates the presence of different gonadal stages in the same tissue [79]. In contrast, gonadal atresia and gamete degeneration are hardly recognised when using GVF alone [28]. Therefore, the two approaches offer the best performance in combination. GVF increased significantly in ripe BMOs, providing higher confidence in the criteria used for GHA classification (Figure A1).
During ovarian maturation, the area covered by follicles increases due to oocyte proliferation, with the volume covered by energy support cells decreasing [10,15,16]. As the oocytes proliferate, lipids are accumulated in the gonads at the expense of glycogen content [17,18,40]. This usually leads to the reported strong seasonal fluctuations in the major lipid classes of bivalves [19,23,27,43,80]. Here, BMO lipidomics profiles were integrated with GHA and the GVF through various chemometric and machine learning approaches to identify specific lipid molecules associated with the maturation process of BMOs.

3.1. Multivariate Chemometrics

Our investigation started with the application of a multivariate unsupervised chemometric approach such as PCA (Figure A2). As an unsupervised approach (e.g., samples are plotted without ‘a priori’ consideration of grouping variables), PCA does not suffer from overfitting problems; however, PCA can be difficult to interpret when distinguishing highly similar and heterogeneous groups with small multivariate differences, with a large number of features and a small amount of variability explained by the main axes [81]. Selecting the correct orthogonal axes that best explain the differences between ripe and immature BMO was problematic through PCA, because a high percentage of variability is explained by the first four pairs of principal components of the PCA model (>10%, Figure A2A). Indeed, in all the PCA plots showing the combination of the main five principal components, the two BMO groups did not evidence marked differences in their lipidomic profiles, which is an expected observation given the heterogeneous origin of the different gonad samples considered in this study (see Method Supplement S1).
To highlight specific lipids linked to the two BMO groups, we continued the biomarker discovery approach using more powerful supervised statistical models such as volcano plots, OPLS-DA and RF. Through volcano analysis, important lipids distinguishing ripe from undeveloped BMOs were highlighted based on their fold change (FC; here, 1.3 was considered the cut-off value) and their statistical significance (FDR p < 0.05, calculated according to a non-parametric Wilcoxon rank test). The resulting plot is shown in Figure 1A and led to the selection of 25 significant features (Table A1). Volcano plots are a quick and reliable approach for identifying markers that define a particular condition, although this statistical approach is limited to pairwise comparisons and often lacks statistical power, especially when analysing typical omics, datasets where the number of variables is far greater than the number of observations. In these cases, correction factors for the false discovery rate are usually applied, which reduce the statistical power of the test [82,83], and indeed, volcano analysis turned out to be one the most restrictive (lowest number of important feature highlighted) methods discussed in this work.
OPLS-DA is an extension of partial least squares regression and an approach characterised by a fast computation time and a decomposition of the variance of the samples into uncorrelated (orthogonal to the groups, orthogonal T-score) and correlated (predictive, T-score) to the grouping factor [73]. The results of OPLS-DA can be found in Figure 1B and show that 10.2% of the variance within our observation is correlated with the two BMO maturation classes (T-score). OPLS-DA as a supervised method may suffer from overfitting problems, especially when the number of observations is small. Therefore, an assessment of model performance (cross-validation) and permutation tests (time-consuming) are required to obtain a measure of model fit [74]. Model performances and fitting were evaluated, respectively, with a 10-fold LOOCV (predicted: R2x: 0.102, R2y: 0.5, Q2: 0.363; orthogonal: R2x: 0.167, R2y: 0.17, Q2: 0.07; Figure A3A) and 2000-fold permutation (Q2: 0.436 p < 0.001 and R2y: 0.669 p < 0.05, Figure A3B). From the resulting S-Plot, a set of 40 important lipids (Table A1) correlated with the grouping variable (ripe and non-ripe BMOs) were selected, setting an arbitrary threshold for the p(corr) value < −0.6 and >0.6 (Figure 1C).
The final approach used in this study was RF, a machine learning method that creates a series of classification (or regression) trees (a ‘forest’) by continuously bootstrapping the sample set. For each bootstrapping, the dataset is split into a test sample (66% of the samples) and an out-of-bag sample (training data with 33% of the observations). As a bootstrapping method, RF is also robust to several attributes that may constrain other statistical approaches (e.g., zero values, normality and homoscedasticity assumptions), while in classification mode, it assumes a balanced number of observations [50,75], and compared to the previous statistical approaches, RF is far more computationally intensive [75]. From the training data, the algorithm recursively selects a random set of variables whose ability to predict a categorical response (ripe or non-ripe BMO) or its correlation with a continuous response variable (such as the GVF) is evaluated using training data and tested on OOB samples.
Partitioning is based on the identification of the predictor that minimises either the within-group variance (categorical response variable) or the error in the regression against the response, in the case of a continuous response variable [50,75]. The optimal predictor becomes the best node of the forest, and the process is repeated for all the number of trees specified in the function. In RF classification (RFclass), the variables with the lowest OOB error rate (OOB-ER) are the best predictors for the categorical variables, with the lower OOB-ER indicating the higher predictive significance of a given molecule. In RF regression (RFreg), the importance of each predictor is evaluated as the proportion of variation explained (PVE), a measure of the reduction in the predictive ability of the tree when that variable is removed. A backward purging (BP) approach is used to select the most important features in the RF. BP consists of performing multiple RFs on the most informative subset of variables (here, the top 5% of features were selected from both RFclass and RFreg, Figure 2) and recording the performance of each variable (in terms of OOB-ER or PVE). The results of the BP process of the RFclass data showed that the saturated PC 28:0 (OOB-ER 0.12) was the best predictor of the ripe/non-ripe condition (Figure 2A). As for RFreg, the BP process yielded unsaturated phosphatidic acid (PA) 44:4 (PVE 0.70, Figure 2B), followed by saturated ether-linked PC such as PC(O-30:1/P-30:0), PC(O-30:0) and PC(30:0), all with a PVE of 0.69 like most important molecules. All the saturated plasmanyl/plasmenyl and diacyl PC molecules and PA(44:4) are included in the panel of 35 lipids reported in Table 1, and will be discussed later in the text. In the next sections, we compare the different roles of important lipids (through LiOn enrichment analysis) and the results of the four different statistical approaches here described. In aiming to maintain the maximum amount of meaningful information [75], it was agreed on to select the top 10% of molecules (with a threshold over which model performances substantially improved; see Method Supplement S2d) as important for RF (≈83 molecules for RFclass and RFreg, Table A1).

3.2. Comparison of the Outputs Obtained with the Different Statisatical Approaches

The four different statistical models applied in this study allowed for the selection of a total of 123 relevant lipid molecules (respectively, 25, 40, 72 and 92 via volcano plot, OPLS-DA, RFclass and RFreg, Table A1) differentiating ripe from non-ripe BMOs from 838 molecules present in the PIT (Supplementary Data S2b). It should be noted that none of the PCs, glycerophosphoethanolamines (PEs), glycerophosphoinositols (PIs) and triglycerides (TGs) detected by the present statistical approach was a major component of the blue mussel gonad lipidome [49], supporting also the previous observation made on the PCA plots.
Key features highlighted by the different approaches are compared using a Venn diagram (Figure 3). RFreg and RFclass yielded the largest number of unique molecules (respectively, 41 and 12 molecules, i.e., 33.3% and 9.8% of the total), while only four variables were exclusive to OPLS-DA (3.3% of the total), and two variables were detected only by volcano analysis (1.6%). Ten variables were common for all four approaches (8.1%). The comparison between classification (volcano, OPLS-DA and RFclass) and regression approaches (RFreg) shows that 51 (41.5% of the total) molecules (out the 92 considered important and highlighted by RFreg) were common with one of more of the models here evaluated. This, again, indicates that there was a large consensus between the GVF and the classification criteria used to define the two BMO groups.
The functional role of the lipids that differentiated undeveloped from ripe BMOs was investigated via LiOn enrichment analysis (Figure 4). Figure 4A shows the PCA for LiOn terms, which reveals a clear distinction of the function of those molecules since 84% of the samples (43/51) could be correctly clustered according to the functional role of the highlighted lipids. Terms related to storage lipids, lipid bodies and TGs were enriched in ripe BMOs, which could be attributed to the vitellogenesis and accumulation of lipid bodies in the yolk granules [16]. Interestingly, TG species, identified by the various statistical methods, showed an inverse pattern according to their unsaturation levels: TG molecules bearing highly unsaturated acyl residuals (DB > 10) were found to be inversely correlated with the GVF and also more abundant in undeveloped BMOs, with the opposite pattern observed for TGs with <10 DB (Table A1). This observation could be the result of the rearrangment and/or resorbition of PUFA during ovarian maturation/regression [84]. Although increased content of unsaturated TG was reported in ripe ovarian follicles of P. merguensis [48], others have reported the dominance of saturated and monounsaturated acyl moyeties in TGs of mud crab ovaries, Scylla paramamosain [85] and sea sponges [86].
In the non-ripe BMO group, significant up-regulation of ‘membrane components’, an enrichment term associated with membrane lipids, was observed instead (Figure 4B). The previous term is mainly linked to the larger abundance in undeveloped ovaries of diacyl and ether-linked (associated with the term ‘plasmalogen’) PC and PE molecules. This observation could be related to the greater surface area of the gonad covered by support cells, such as ADG and VCT, that are characteristic of early maturation stages of bivalve ovaries [15], which is also supported by the reduction in the GVF found in non-ripe BMOs compared to mature ones (Figure A1).
At the same time, many of such membrane lipids included saturated and short-chain molecules, features connected with the up-regulation of terms including ‘below average transition temperature’ and ‘below average bilayer thickens’. On the contrary, longer and more unsaturated PCs and PEs were more abundant in ripe BMOs, leading to the up-regulation of ‘PUFA’ and ‘fatty acids with more than 18C’ terms for this group.
In addition, terms related to ‘lysosomes’, the ‘Golgi apparatus’ and ‘mitochondrion’ were also up-regulated in the immature BMO group. The former two of these terms are connected with sphingolipids (ceramides—Cer, CAEP and CerPE), since these classes of lipids are produced in the Golgi apparatus and act in autolysis and atretic processes [87]. The higher content of such lipids in undeveloped BMOs (Table A1) could be related to gonadal degeneration and atresia, as the presence of lysosomes and vacuoles together with macrophage infiltration into follicles characterises atresia in mussel ovaries [28]. Interestingly, opposite patterns could be observed between Cer, which is higher on ripe BMOs and correlated with the GVF, CAEP and CerPE. Such a pattern could be related to the role of Cer as stabilisers in adipose tissue, and hence could be again related to the accumulation of yolk reserves in ripe ovaries [88]. The mitochondrion term is associated with the detection via the RFreg of a cardiolipin (CL) 88:21, which was also found to be weakly, inversely correlated with the GVF (Spearman rank r = −0.05, Table A1). The reason behind this observation is unclear, since the same molecule did not exhibit significantly different content between the two BMO groups (FDR p > 0.05). CLs are fundamental lipids found in the inner mitochondrial membrane, participating in cellular respiration [89]. Highly unsaturated CLs were found predominant in mussel gonads [49] and generally distinguish mussels from other groups of bivalves such as clams [90]. CL(88:21) must contain four 22C acyl residuals, since 24C FA detected in blue mussel gonads were saturated or monounsaturated species [49]. Therefore, it is likely that this molecule included three DHA and one FA22:3 non-methylene-interrupted dienoic (NMID). Nevertheless, the connection of the aforementioned molecule with mussel ovarian development is unclear if we exclude the structural role of NMID FA in mussel gonads [91]. An increase in CL(88:24) has been observed in quiescent scallops (Placopecten magellanicus), whereas larger extents of CL containing 18C FA (FA18:0 and FA18:1) were found in ripe (pre-spawning) individuals [92]. Given the different biology of the two species (dioecious for mussels and hermaphrodite for scallops), it becomes more difficult to directly compare our results with those of the previous study.

3.3. Ranking the Panel of Potential Biomarkers

ROC curves were generated for all important variables selected by each statistical method to distinguish ripe from non-ripe BMOs, and these were ranked using the AUC (Table A1). ROC curves are the graphical representation between the sensitivity (true positive) and specificity (true negative) of a biomarker [93]. AUC can range between 0.5 (diagonal line, random effect, 50% specific and 50% sensitive) and 1 (perfect biomarker, 100% specific and sensitive). The optimal cut-off value (maximum specificity and sensitivity) of a specific biomarker is calculated using the maximum distance between the ROC curve and the diagonal line (Youden index [94]). Molecules with AUC values between 0.8 and 0.9 have been considered potential biomarker candidates for various human diseases in the past [95,96]. Out of all the important variables highlighted by the statistical approach used in this study, 35 had an AUC value > 0.8 and are listed in Table 1. From the AUC > 0.8 variables (35/123), 10 were detected by all four statistical approaches, 17 by a combination of three of these approaches, 6 by two models and just 2 by a single model. This observation emphasises the importance of combining multiple orthogonal statistical approaches, as any biomarker discovery method will inevitably have some limitations if unaccompanied.
In the present study, no lipid was found to be a perfect marker of the two different gonadal stages (e.g., AUC = 1). One molecule showed an AUC > 0.9; CerPE(40:2), which had an AUC of 0.905 (95%CI: 0.799–0.981, Figure 5A) and was more abundant in undeveloped ovaries (Figure 5B). CerPE(40:2), was characterised by a sensitivity of 80% (true positive rate) and a specificity of 100% (true negative rate). Three other CerPEs ranked among AUC > 0.8 molecules (Table 1): CerPE(35:3/2), CerPE(34:3)a and CerPE(40:1), with AUC values of 0.874, 0.845 and 0.806, respectively. Other ceramide lipids important for distinguishing between ripe and immature BMOs included CAEP 42:2 and 42:3 with AUC values of 0.808 and 0.804, respectively. All the previous CerPE and CAEP molecules were minor components of these classes detected in Mytilus gonads [49] and were all found mainly in undeveloped BMOs; the role of these sphingolipid classes in marine invertebrates is not yet fully understood. CerPE and CAEP are mainly found in invertebrate membranes [54,97,98,99,100] and are synthesised from ceramide precursors through the action of a specific CDP ethanolamine:ceramide phosphotransferase localised in the Golgi apparatus [101]. In the scientific literature, there are scarce reports on the role of CerPE and CAEP in bivalve ovarian maturation. Pauletto and coworkers [102] reported a differential expression for genes involved in complex ceramide metabolism between ovarian oocytes and spawned oocytes in the clam R. decussatus. Other authors hypothesised a similar function for the CerPE and CAEP of the vertebrate analogue sphingomyelin (SM), which can form specific domains of the cell membrane defined as membrane rafts [100]. The decrease in CerPE and CAEP observed in ripe females could also be a consequence of the gamete maturation process, as a decrease in SM was observed in Xenopus laevis during oocyte maturation under the influence of progesterone [103].
Glycerophosphoserines (PSs) represented the lipid class that, overall, changed mostly between the two BMO groups (Table 1). PS(38:5) was the second most important biomarker of BMO maturity (AUC 0.894), and several of the other nine PS molecules with AUC > 0.8 (36:2, 38:5, 38:6a, 39:6, 40:2, 40:5, 40:6 and O-38:5/P-38:4 and O-38:6/P-38:5) were between the main diacyl and plasmanyl/plasmenyl PS species found in Mytilus gonads [49]. All AUC > 0.8 PSs were more abundant in undeveloped BMOs. PSs are secondary components of cell membranes, found mainly on the cytosolic side of the lipid bilayer [104]. Their relevance as markers of BMO development could be linked to different reasons. PSs may be converted into different glycerophospholipids, through the action for phosphotransferases, during ovarian maturation, as observed in largemouth gudgeon (Coreius guichenoti) ovaries [105]. On the contrary, the larger content of PSs in undeveloped BMOs could also be linked to apoptotic processes in degenerating gonads, since the exteriorisation of PSs is considered to be one of the ‘triggers’ for the phagocytosis of apoptotic cells [106,107], as observed in UV-exposed O. edulis haemocytes [108] and in apoptotic ovarian cells of Drosophila melanogaster [109,110].
The third molecule for prerelevance as a biomarker evidenced in this study is the ether-linked PE O-36:2/P-36:1c (AUC: 0.875; Table 1). All PE molecules with an AUC > 0.8 were plasmanyl/plasmenyl forms (O-34:3/P-34:2, O-36:1/P-36:0b, O-36:2/P-36:1b, O-36:3/P-36:2, O-37:2/P-37:1, O-37:3/P-37:2, O-37:4/P-37:3 and O-39:3/P-39:4) but did not include any of the principal PE forms described in blue mussel gonads [49], increasing the complexity of explaining the physiological motivation behind our observation. All PEs with an AUC > 0.8 were found to be mainly abundant in undeveloped BMOs, and this could possibly be related to the synthesis of PCs during ovarian development, thorugh the sequential methylation of PE [111]. A total of seven PC molecules had an AUC > 0.8; six of which were saturated molecules (28:0, 30:0 and the plasmanyl/plasmenyl O-30.0, O-31:0 and O-32:0; Table 1) and found to be distincly abundant in non-ripe BMOs. The long-chain and highly unsaturated ether-linked PC(46:10/P-46:9)a characterised, instead, ripe BMOs. This trend is in agreement with that reported by other authors on the PC composition of ovaries in the shrimp P. merguiensis: saturated PC molecules were found in undeveloped regions of the ovary, whereas highly unsaturated PCs (containing 20-22C PUFAs) had a higher coverage of ripe ovarian parts [48]. PCs and PEs are the two major membrane lipids in bivalves [21,49,65,112,113]. Given the low unsaturation of PCs and PEs with an AUC > 0.8 and abundance in non-ripe BMOs, it is likely that these molecules carried saturated or monounsaturated FA rather than long-chain PUFA. Acyl residues of membrane lipids represent energy sources during embryonic development in invertebrates [114,115], so their enrichment with PUFA at later developmental stages may be important for the proper development of the offspring. To this extent, PUFA enrichment patterns have been observed for PC and PE fractions in maturating scallop ovaries [43].
Lastly, four PA molecules (38:6, 40:6, 44:4 and the plasmalogen O-38:6/P-38:5), one glycerophosphoglycerol (PG) 34:1 and lyso-glycerophosphocholine (LPC) 22:2 completed the panel of 35 molecules with an AUC > 0.8 in distinguishing ripe from non-ripe BMOs (Table 1). All these lipid species were more abuntand in undeveloped ovaries, with the exception of PA(44:4) and LPC(22:2). There are not many reports on the role of such minor glycerophospholipid classes in bivalve or invertebrate ovarian maturation, due to their low abundance, which is hardly observable through traditional lipid analysis methods [47]. Therefore, we can only speculate about their role in the development of BMOs. PAs and PGs are intermediates of glycerophospholipid biosynthesis, each forming the precursors of the major membrane lipid classes and of cardiolipins [104]. PA concentration increases in Xenopus oocytes as a response to insulin treatment, as a result of phospholipase D, which produces lyso-glycerophosphate (LPA), which participates in the lipid signalling of oocyte maturation [111]. A reduction in most of PG was instead observed in sea sponges in reproductive individuals [86], which is an observation that could be in agreement with data discussed in the present study. LPCs are metabolites of PC formed by the Lands cycle through the action of a phospholipase that cleaves one of the FA residues [116]. LPC(22:2) is a molecule that is likely to bear a 22:2 NMID FA [49]. NMID FAs are endogenous lipids of mussels that are preferentially incorporated into the mantle, competing with C20 and C22 PUFA under reproductive demands [91], which can suggest the a role for LPC(22:2) in the restructuring of membrane lipids in ripe BMOs. Nevertheless, especially for these minor lipids, further evidence is needed to obtain a definitive overview of the system.

4. Conclusions

Rapid and informative tools for measuring the maturity stage of female mussels are useful both for industrial applications, as they can be used to monitor gamete maturation in hatcheries to reduce problems with uncontrolled spawning and pre-spawning atresia, and for monitoring the reproductive status of native mussel populations. The high resolution and ability to work at the lipid molecule level make LC-MS lipidomics an important platform for studying lipid biomarkers that can assess maturation status in BMOs. LC-MS analysis has produced a very complex and convoluted dataset that requires experienced data analysis skills to extract meaningful information from it. To fulfil this task, in this work we applied different supervised statistical approaches that were more powerful than classical unsupervised approaches (PCA) to highlight the differences between the studied BMO groups. In combining the four supervised statistical approaches used here (volcano, OPLS-DA, RFclass and RFreg), it was possible to highlight a panel of 35 lipid species that differ to a high degree (AUC > 0.8) between ripe and undeveloped BMOs. These potential markers include CerPE, CAEP, LPC, PA, PC, PE, PG and PS. In many cases, however, the exact role and reason for the importance of AUC > 0.8 lipids for classification could only be speculated upon, based on similarities reported by others in different model organisms. Future studies should aim to better understand the role played by lipids during gamete development in marine invertebrates such as bivalves, focussing on less studied lipid classes.
The set of potential lipid biomarkers should be validated and possibly quantified using further independent gonad samples, and if confirmed, the protocol should be improved and refined to develop a non-lethal assay. The method should be more fully developed if the lipid panel presented here is validated by moving to a targeted lipid quantification approach, which will significantly reduce the complexity of the dataset and the amount of data analysis required to obtain information on these important lipids. GHA is considered a traditional approach for monitoring gonadal development in marine mussels [117] and is an important tool for investigating the reproductive physiology, toxicology, life cycle and population structure of bivalves [10,11,16,28,32,34,36,118]. In the future, GHA will continue to be a necessary diagnostic tool, but technological advances and the application of state-of-the-art high-throughput omics techniques should support and complement traditional techniques to increase our knowledge of the physiology of marine organisms and their ability to adapt to a changing environment.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/lipidology2010005/s1: Method Supplement S1: Mussel rearing conditions; Method Supplement S2: Histological analyses and machine learning model training; Supplementary Tables: LC-MS conditions and lipid molecular standard mix; Supplementary Data S1: Raw peak intensity table and molecule ID details; Supplementary Data S2: List of all lipid and fatty acids species identified in the blue mussel ovaries.

Author Contributions

Conceptualisation, V.A.L.; methodology, V.A.L. and P.D.W.; software, V.A.L. and M.K.D.; formal analysis, V.A.L.; investigation, V.A.L.; data curation, V.A.L.; writing—original draft preparation, V.A.L.; writing—review and editing, S.C., A.D.H. and C.D.V.; visualisation, V.A.L.; supervision, M.K.D., P.D.W., S.C. and A.D.H.; project administration, A.D.H.; funding acquisition, A.D.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the European Social Fund and Scottish Funding Council as part of Developing Scotland’s Workforce in the Scotland 2014–2020 European Structural and Investment programme (Ref: UHI_SAMS_DSW_PGR_AY16/17).

Institutional Review Board Statement

The animal study protocol was approved by the Institutional Review Board (or Ethics Committee) of the University of the Highlands and Islands with protocol code ETH884 (Lipidomics and Proteomics investigation of Commercial Bivalve production in Scotland on the 6th February 2017).

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are available through the various Supplementary Materials enclosed with the manuscript, and all LC-MS raw data files and experimental metadata have been deposited in the Metabolights [66] data repository (www.ebi.ac.uk/metabolights/MTBLS8492) and are available under study identifier MTBLS8492.

Acknowledgments

The authors would like to thank Inverlussa Marine Services (www.inverlussa.com) for providing their top-quality mussels for our experiment; Christine Beveridge (Scottish Association for Marine Sciences—SAMS) for her fantastic expertise in aquarium setup; Debbie Falchney (Institute of Aquaculture, University of Stirling) for her tutorial on gonad histological preparation; and Seshu Tammireddy (University of the Highlands and Islands) for those many hours discussing LC-MS and lipid HPLC separation. V.A.L. also thanks Holger Husi for long chats on data analysis and machine learning, which helped a lot in the conceptualisation of this work.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

The relation between the ripe and non-ripe female GVF is shown in Figure A1. Ripe females were characterised by a significantly higher GVF (Welch’s t-test t: −7.865; df:49; p < 0.001) compared with non-ripe ones (the average GVF was 60.73 ± 11.01% in ripe and 36.01 ± 11.44% in non-ripe females).
Figure A1. Gonad histology analysis (GHA) and gamete volume fraction (GVF): differences between non-ripe and ripe females. n = 25 for ripe and n = 26 for non-ripe females. The marker in each box plot indicates the group average; the band inside the box indicates the group median; the box includes the observation between the 1st (25th percentile) and 3rd (75th percentile) quartile; whiskers represent values within ±1.5 × interquartile range (IQR); observations over ±1.5 × IQR are reported as outliers (black dots); violin shapes represent the variable distribution in each sample group. Plotted via R package ‘ggplot2’.
Figure A1. Gonad histology analysis (GHA) and gamete volume fraction (GVF): differences between non-ripe and ripe females. n = 25 for ripe and n = 26 for non-ripe females. The marker in each box plot indicates the group average; the band inside the box indicates the group median; the box includes the observation between the 1st (25th percentile) and 3rd (75th percentile) quartile; whiskers represent values within ±1.5 × interquartile range (IQR); observations over ±1.5 × IQR are reported as outliers (black dots); violin shapes represent the variable distribution in each sample group. Plotted via R package ‘ggplot2’.
Lipidology 02 00005 g0a1
The score plots representing principal component analysis are reported in Figure A2. In the plots, the observations are colour-coded according to the BMO ripening groups, while shapes are coded according to the different origins of the BMOs included in this study. There are no marked differences in the lipidome between the two BMO groups or according to the different origins of the samples. The clustering in principal components one and two accounted for 31% of the variability explained (Figure A2A); components two and three, 21.2% (Figure A2B); and components three and four, 16.9% (Figure A2C), whereas the fourth and fifth principal components taken together explained 13.4% of the multivariate differences (Figure A2D). These latter two components possibly resulted in the best separation of the two BMO ripening groups, still with an observed large overlapping of the confidence intervals.
Figure A2. Principal component analysis (PCA) of global lipidomics data in relation to ripe and non-ripe females (colour-coded) and gamete volume fraction (GVF, blue stacked arrow). Different shapes indicate the origin of the BMOs analysed in this study. Scree plot representing main principal components explaining the variability of lipidomics data. (A) PCA score plot for 1st and 2nd components; (B) PCA score plot for 2nd and 3rd components; (C) PCA score plot for 3rd and 4th components; (D) PCA score plot for 4th and 5th components. Individual samples are reported as single markers in the plot; ellipses indicate the 95% confidence interval. Computed and plotted via R package ‘factoextra’ [68].
Figure A2. Principal component analysis (PCA) of global lipidomics data in relation to ripe and non-ripe females (colour-coded) and gamete volume fraction (GVF, blue stacked arrow). Different shapes indicate the origin of the BMOs analysed in this study. Scree plot representing main principal components explaining the variability of lipidomics data. (A) PCA score plot for 1st and 2nd components; (B) PCA score plot for 2nd and 3rd components; (C) PCA score plot for 3rd and 4th components; (D) PCA score plot for 4th and 5th components. Individual samples are reported as single markers in the plot; ellipses indicate the 95% confidence interval. Computed and plotted via R package ‘factoextra’ [68].
Lipidology 02 00005 g0a2
Figure A3. Orthogonal partial least squares discriminant analysis (OPLS-DA) model validation: for discrimination between ripe and non-ripe females. (A) Ten-fold leave-one-out cross-validation (LOOCV); (B) 2000-fold permutation test. Computed via ‘MetaboAnalystR’.
Figure A3. Orthogonal partial least squares discriminant analysis (OPLS-DA) model validation: for discrimination between ripe and non-ripe females. (A) Ten-fold leave-one-out cross-validation (LOOCV); (B) 2000-fold permutation test. Computed via ‘MetaboAnalystR’.
Lipidology 02 00005 g0a3
The LOOCV model (Figure A3A) can be interpreted as 10.2% of the variance being predicted by the model (p1R2x), while 16.7% of the variance is uncorrelated with the grouping variable (oR2x). R2y instead provides information on the variance within the two groups, which is correlated with the grouping variable (p1R2y) and uncorrelated with the grouping variable (oR2y). The model accuracy (goodness of prediction, Q2) is close to the value of 0.4 accepted for metabolomics data, and it depends, also, on the number of variables correlated with the grouping factor. The permutation test (Figure A3B) shows that the two BMO groups were correctly separated by the model, as the permuted distribution—the true one with the ‘correct labels’—was significantly different from the permuted one.
Table A1. Lipid molecule markers of gonad ripeness identified witg the supervised statistical approaches in BMO and ranked according to receiving operating curves (ROCs) and area under the curve (AUC). Isomeric lipid molecules resolved by reverse phase separation (same exact mass of 5 Δppm, but different retention time) are distinguished with a lowercase letter after the lipid ID. Pval: Wilcoxon rank test with p-value adjusted for false discovery rate [72]; FC: fold change; r: Spearman rank correlation coefficient calculated on gamete volume fraction (GVF). Computed via R package ‘MetaboanalystR’. Average calculated on Pareto-scaled data ± 95% CI (N = 26, non-ripe and N = 25, ripe). In Bold: panel of potential lipid biomarkers (AUC > 0.8).
Table A1. Lipid molecule markers of gonad ripeness identified witg the supervised statistical approaches in BMO and ranked according to receiving operating curves (ROCs) and area under the curve (AUC). Isomeric lipid molecules resolved by reverse phase separation (same exact mass of 5 Δppm, but different retention time) are distinguished with a lowercase letter after the lipid ID. Pval: Wilcoxon rank test with p-value adjusted for false discovery rate [72]; FC: fold change; r: Spearman rank correlation coefficient calculated on gamete volume fraction (GVF). Computed via R package ‘MetaboanalystR’. Average calculated on Pareto-scaled data ± 95% CI (N = 26, non-ripe and N = 25, ripe). In Bold: panel of potential lipid biomarkers (AUC > 0.8).
Lipid IDAUCPvalFCrVolcanoOPLS-DARFclassRFregNot RipeRipe
Average ± CI (95%)Average ± CI (95%)
CAEP.35.30.74<0.010.39−0.48 x x1855.96 ± 247.941393.62 ± 125.7
CAEP.42.20.81<0.011.58−0.60 xxx47.59 ± 9.0620.67 ± 7.5
CAEP.42.30.8<0.013.13−0.56x xx36.95 ± 11.2511.19 ± 6.08
Cer.36.2a0.580.090.21−0.10 x 267.81 ± 60.62224.45 ± 23.52
Cer.38.10.76<0.01−0.990.38 x 47.24 ± 13.1875.37 ± 14.06
Cer.42.00.76<0.01−0.50.51 x 61.06 ± 8.3287.2 ± 11.11
CerPE.34.10.76<0.010.57−0.43 x 450.48 ± 85.69277.56 ± 41.11
CerPE.34.2b0.75<0.010.58−0.54 x128.16 ± 22.5884.28 ± 13.44
CerPE.34.3a0.84<0.010.98−0.67 xxx95.62 ± 20.8448.43 ± 8.13
CerPE.35.3/20.87<0.011.08−0.67 xxx132.82 ± 29.8255.95 ± 7.87
CerPE.40.10.81<0.011.17−0.57 xxx34.15 ± 7.0213.71 ± 5.25
CerPE.40.20.9<0.011.48−0.74xxxx46.4 ± 7.8817.69 ± 4.48
CerPE.42.10.80.015.5−0.53x xx7.66 ± 5.080.23 ± 0.45
CerPE.d44.20.79<0.010.73−0.52 x 52.96 ± 10.3631.51 ± 4.19
CL.88.210.560.68−0.3−0.05 x39.63 ± 15.3244.23 ± 15.59
DG.32.10.590.2−0.210.17 x1223.06 ± 217.881396.08 ± 272.03
DG.33.20.560.43−0.130.18 x254.51 ± 47.85270.76 ± 52.22
LPC.22.20.81<0.01−1.630.41x x 115.47 ± 39.6308.4 ± 101.03
LPC.22.50.76<0.01−2.390.42x 65.26 ± 37.91244.7 ± 114.32
LPC.22.60.72<0.01−1.860.41x x 131.28 ± 61.4497.37 ± 246.18
LPE.22.60.610.22−0.730.22x x 27 ± 10.7938.01 ± 18.64
PA.38.60.82<0.015.66−0.54x xx12.45 ± 6.381.46 ± 1.54
PA.40.60.86<0.012.85−0.70xxxx33.14 ± 12.058.31 ± 3.64
PA.44.40.81<0.01−1.760.63 xx102.52 ± 41.73226.25 ± 58.24
PA.O-38.2/P-38.10.740.011.36−0.48 xx32.04 ± 6.920.2 ± 7.86
PA.O-38.5/P-38.40.72<0.014.13−0.41xx 6.66 ± 3.711.22 ± 1.53
PA.O-38.6/P-38.50.85<0.012.12−0.68xxxx46.5 ± 13.0915.64 ± 4.86
PC.28.00.84<0.011.35−0.55xxxx426.29 ± 119.56140.95 ± 36.64
PC.29.00.79<0.010.94−0.42 x 326.9 ± 78.46150.35 ± 29.08
PC.30.00.85<0.011.37−0.60xxxx1149.26 ± 291.11396.62 ± 108.82
PC.31.00.76<0.010.66−0.36 xx 706.06 ± 121.33419.14 ± 60.12
PC.32.00.78<0.010.95−0.51 xx 1181.4 ± 230.72572.94 ± 110.41
PC.42.30.730.1−0.490.40 xx435.88 ± 121.3582.86 ± 82.68
PC.42.40.72<0.01−0.530.30 xx 76.01 ± 13.45104.97 ± 12.52
PC.O-29.00.8<0.011.57−0.51xxx 274.36 ± 86.8997.24 ± 36.48
PC.O-30.00.8<0.011.48−0.54xxx 862.53 ± 256.36318.65 ± 112.49
PC.O-30.1/P-30.00.72<0.010.6−0.39 xxx479.19 ± 98.68292.4 ± 56.3
PC.O-30.2/P-30.10.69<0.010.68−0.31 x 489.48 ± 140.93197.21 ± 62.1
PC.O-31.00.8<0.011.37−0.55 xxx68.9 ± 21.2541.64 ± 9.75
PC.O-31.1/P-31.00.75<0.010.85−0.45 x 313.33 ± 66.93170.76 ± 36.29
PC.O-32.00.8<0.011.35−0.56xxxx923.78 ± 291.49351.64 ± 111.32
PC.O-38.1/P-38.00.76<0.01−0.690.51 x400.67 ± 74.02634.91 ± 112.22
PC.O-38.5/P-38.40.680.02−0.370.33 x3158.87 ± 452.944126.82 ± 599.14
PC.O-39.2/P-39.10.710.02−0.50.41 x587.31 ± 109.84964.5 ± 162.84
PC.O-40.2/P-40.10.78<0.01−0.730.51 xx587.31 ± 109.84964.5 ± 162.84
PC.O-42.10/P-42.9a0.81<0.01−1.430.14xxxx145.61 ± 41.99341.56 ± 83.65
PE.36.3a0.650.050.160.26 x 33 ± 6.8943.8 ± 10.85
PE.38.4a0.570.280.1−0.42 xx102.72 ± 14.6296.79 ± 11.17
PE.37.40.720.011.19−0.37 x33.96 ± 9.7917.15 ± 5.86
PE.39.50.76<0.010.72−0.48 xx75.17 ± 13.7739.58 ± 7.07
PE.41.50.640.010.62−0.28 x33.31 ± 12.7616.49 ± 4.91
PE.42.9c0.560.59−0.220.23 x185.63 ± 49.07193.23 ± 41.74
PE.O-34.3/P-34.20.830.010.98−0.51 xx53.51 ± 20.0124.5 ± 4.84
PE.O-35.3/P-35.20.770.010.97−0.46 x40.95 ± 14.5218.98 ± 4.37
PE.O-36.1/P-36.0b0.87<0.011.63−0.74xxxx65.23 ± 19.6521.09 ± 4.88
PE.O-36.1/P-36.0c0.77<0.010.88−0.58 x91.33 ± 25.8341.6 ± 7.31
PE.O-36.2/P-36.1b0.85<0.010.96−0.62 xxx94.97 ± 22.7647.13 ± 7.8
PE.O-36.2/P-36.1c0.88<0.010.97−0.73 xx251.77 ± 50.85121.82 ± 15.8
PE.O-36.3/P-36.20.85<0.010.61−0.58 xx308.04 ± 55.87193.04 ± 19.3
PE.O-37.2/P-37.10.86<0.010.99−0.71 xxx231.1 ± 39.56121.75 ± 21.86
PE.O-37.3/P-37.20.84<0.010.51−0.55 xxx428.34 ± 49.7300.34 ± 26.36
PE.O-37.4/P-37.30.87<0.010.74−0.40 xxx84.17 ± 14.0550.25 ± 6.82
PE.O-38.2/P-38.10.8<0.010.63−0.62 xx627.28 ± 92.01407.74 ± 58.2
PE.O-38.3/P-38.20.78<0.010.38−0.52 x 1411.67 ± 151.081084.61 ± 86.66
PE.O-39.2/P-39.10.73<0.010.51−0.47 x125.68 ± 18.8492.57 ± 15.07
PE.O-39.3/P-39.20.8<0.010.45−0.56 xx360.03 ± 39.86264.36 ± 23.89
PE.O-39.4/P-39.30.73<0.010.38−0.38 xx194.69 ± 27146.64 ± 13.67
PE.O-40.7/P-40.6b0.670.07−0.240.35 x886.89 ± 125.561034.58 ± 117.48
PE.O-41.4/P-41.30.650.040.25−0.33 x110.58 ± 15.7492.1 ± 9.6
PG.34.10.86<0.012.19−0.70xxxx25.61 ± 8.87.32 ± 2.83
PI.40.30.620.05−0.280.21 x 294.18 ± 38.48350.62 ± 36.02
PI.40.40.650.04−0.390.32 x 266.66 ± 45.88331.8 ± 43.8
PI.40.80.710.01−0.390.47 xx118.42 ± 14.83147.88 ± 16.72
PI.41.50.60.26−0.20.33 x102.64 ± 16.24115.25 ± 13.04
PI.41.60.74<0.01−0.490.54 x103.73 ± 14.79140.82 ± 13.95
PI.42.30.79<0.01−0.950.53 xx45.08 ± 10.0578.71 ± 12.29
PI.42.10a0.710.02−0.710.43 x27.67 ± 6.8441.65 ± 10.88
PI.44.70.610.03−0.610.22 x11.63 ± 4.0626.72 ± 12.24
PI.O-38.5/P-38.4a0.76<0.011.03−0.59 x55.34 ± 11.2730.46 ± 8.17
PI.O-40.2/P-40.10.730.01−1.370.35x 16.8 ± 6.2544.17 ± 17.7
PS.34.10.8<0.016.82−0.54x xx13.17 ± 7.650.53 ± 0.75
PS.36.20.81<0.017.06−0.56x xx16.21 ± 10.110.14 ± 0.22
PS.38.2a0.640.1−0.420.32 x55.04 ± 12.8968.49 ± 13.29
PS.38.40.640.031.55−0.35 x27.56 ± 19.085.6 ± 1.94
PS.38.50.89<0.011.61−0.71xxxx215.46 ± 51.566.38 ± 17.26
PS.38.6a0.84<0.011.23−0.60 xxx164.04 ± 40.3966.94 ± 16.21
PS.39.60.82<0.011.21−0.55 xx96.45 ± 35.1839.88 ± 10.52
PS.40.20.83<0.010.75−0.68 xx157.85 ± 32.7791.97 ± 13.65
PS.40.50.82<0.011.52−0.67 x41.86 ± 11.0518.17 ± 5.55
PS.40.60.89<0.011.06−0.74 xxx329.08 ± 66.25155.36 ± 27.89
PS.40.8a0.78<0.014−0.46xx 46.14 ± 17.079.88 ± 4.45
PS.42.3b0.60.260.11−0.25 x82.48 ± 17.2572.17 ± 12.55
PS.42.40.610.030.3−0.23 x265.25 ± 70.08188.69 ± 29.82
PS.O-34.00.78<0.011.45−0.58xx 41.47 ± 14.2315.36 ± 4.76
PS.O-38.1/P-38.00.580.15−0.410.03 x 49.83 ± 10.0159.14 ± 5.8
PS.O-38.2/P-38.10.70.040.41−0.44 x403.79 ± 54.5327.41 ± 58.89
PS.O-38.5/P-38.40.88<0.010.82−0.74 xxx163.08 ± 24.4494.22 ± 13.23
PS.O-38.6/P-38.50.86<0.010.99−0.67 xxx361.75 ± 74.27168.69 ± 30.8
PS.O-40.00.70.01−0.60.36 x66.34 ± 12.9788.35 ± 13.27
PS.O-40.6/P-40.50.75<0.010.36−0.46 xx372.13 ± 29.19294.14 ± 34.19
PS.O-42.5/P-42.4b0.79<0.010.44−0.57 xxx313.53 ± 31.31233.07 ± 24.54
PS.O-42.6/P-42.5b0.79<0.010.45−0.52 xx 226.36 ± 26.28165.82 ± 14.05
TG.44.00.76<0.01−0.770.49 x 371.86 ± 72.65620.47 ± 120.92
TG.46.40.78<0.01−0.910.52 xx389.78 ± 87.09641.69 ± 122.92
TG.47.40.78<0.01−1.010.56 x233.64 ± 57.24399.27 ± 78.14
TG.49.40.740.01−0.610.55 x898.38 ± 176.021280.61 ± 184.12
TG.49.50.730.01−0.580.51 x967.3 ± 181.851356.58 ± 186.94
TG.50.10.730.01−0.560.46 x3679.66 ± 783.324970.1 ± 532.03
TG.51.70.680.05−0.430.46 x803.73 ± 148.861007.24 ± 129.46
TG.52.1b0.72<0.01−0.620.45 x2349.98 ± 497.123209.19 ± 324.5
TG.52.20.730.01−0.50.47 x5644.84 ± 1125.167359.71 ± 769.69
TG.52.30.670.08−0.410.38 x5871.8 ± 1225.937209.96 ± 967.84
TG.54.1b0.75<0.01−0.690.52 x 1119.48 ± 233.541650.3 ± 207.51
TG.54.60.710.02−0.440.51 x10750.34 ± 1936.8313813.28 ± 1289.65
TG.55.50.690.03−0.410.48 x1918.8 ± 352.62479.95 ± 257.98
TG.56.80.76<0.01−0.980.58 xx 150.5 ± 34.36263.58 ± 60.64
TG.58.2a0.74<0.01−0.690.35 x350.25 ± 73.99476.86 ± 48.03
TG.62.130.640.030.34−0.21 x 3162.55 ± 736.722296.29 ± 325.35
TG.63.130.640.030.53−0.24 xx539.74 ± 170.8348.6 ± 78.85
TG.64.16b0.70.010.58−0.33 x740.92 ± 199.04464.71 ± 88.7
TG.66.150.610.071.33−0.22 x332.46 ± 148.58180.21 ± 61.03
TG.66.180.620.030.47−0.22 x910.13 ± 314.41524.17 ± 151.27
TG.O-52.10.76<0.01−0.930.46 xx369.81 ± 102.77637.06 ± 140.8

References

  1. van den Burg, S.W.K.; Termeer, E.E.W.; Skirtun, M.; Poelman, M.; Veraart, J.A.; Selnes, T. Exploring mechanisms to pay for ecosystem services provided by mussels, oysters and seaweeds. Ecosyst. Serv. 2022, 54, 101407. [Google Scholar] [CrossRef]
  2. Sea, M.A.; Hillman, J.R.; Thrush, S.F. The influence of mussel restoration on coastal carbon cycling. Glob. Chang. Biol. 2022, 28, 5269–5282. [Google Scholar] [CrossRef] [PubMed]
  3. Nijdam, D.; Rood, T.; Westhoek, H. The price of protein: Review of land use and carbon footprints from life cycle assessments of animal food products and their substitutes. Food Policy 2012, 37, 760–770. [Google Scholar] [CrossRef]
  4. Yaghubi, E.; Carboni, S.; Snipe, R.M.J.; Shaw, C.S.; Fyfe, J.J.; Smith, C.M.; Kaur, G.; Tan, S.Y.; Hamilton, D.L. Farmed Mussels: A Nutritive Protein Source, Rich in Omega-3 Fatty Acids, with a Low Environmental Footprint. Nutrients 2021, 13, 1124. [Google Scholar] [CrossRef] [PubMed]
  5. Lopez, A.; Bellagamba, F.; Moretti, V.M. Nutritional quality traits of Mediterranean mussels (Mytilus galloprovincialis): A sustainable aquatic food product available on Italian market all year round. Food Sci. Technol. Int. 2022, 29, 718–728. [Google Scholar] [CrossRef]
  6. Krause, G.; Le Vay, L.; Buck, B.H.; Costa-Pierce, B.A.; Dewhurst, T.; Heasman, K.G.; Nevejan, N.; Nielsen, P.; Nielsen, K.N.; Park, K.; et al. Prospects of Low Trophic Marine Aquaculture Contributing to Food Security in a Net Zero-Carbon World. Front. Sustain. Food Syst. 2022, 6, 875509. [Google Scholar] [CrossRef]
  7. Avdelas, L.; Avdic-Mravlje, E.; Borges Marques, A.C.; Cano, S.; Capelle, J.J.; Carvalho, N.; Cozzolino, M.; Dennis, J.; Ellis, T.; Fernández Polanco, J.M.; et al. The decline of mussel aquaculture in the European Union: Causes, economic impacts and opportunities. Rev. Aquac. 2020, 13, 91–118. [Google Scholar] [CrossRef]
  8. Stewart-Sinclair, P.J.; Last, K.S.; Payne, B.L.; Wilding, T.A. A global assessment of the vulnerability of shellfish aquaculture to climate change and ocean acidification. Ecol. Evol. 2020, 10, 3518–3534. [Google Scholar] [CrossRef]
  9. Navarro, E.; Iglesias, J.I.P. Energetics of reproduction related to evironmental variability in bivalve molluscs. Haliotis 1995, 24, 43–55. [Google Scholar]
  10. Duinker, A.; Håland, L.; Hovgaard, P.; Mortensen, S. Gonad development and spawning in one and two year old mussels (Mytilus edulis) from Western Norway. J. Mar. Biol. Assoc. UK 2008, 88, 1465–1473. [Google Scholar] [CrossRef]
  11. Murray, H.M.; Gallardi, D.; Mills, T. Effect of culture depth and season on the condition and reproductive indices of blue mussels (Mytilus edulis L.) cultured in a cold-water coastal environment. J. Shellfish. Res. 2019, 38, 351–362, 312. [Google Scholar]
  12. Seed, R.; Brown, R.A. A comparison of the reproductive cycles of Modiolus modiolus (L.), Cerastoderma (=Cardium) edule (L.), and Mytilus edulis L. in Strangford Lough, Northern Ireland. Oecologia 1977, 30, 173–188. [Google Scholar] [CrossRef] [PubMed]
  13. Villalba, A. Gametogenic cycle of cultured mussel, Mytilus galloprovincialis, in the bays of Galicia (NW Spain). Aquaculture 1995, 130, 269–277. [Google Scholar]
  14. Pieters, H.; Kluytmans, J.H.; Zandee, D.I. Tissue composition and reproduction of Mytilus edulis in relation to food availability. Neth. J. Sea Res. 1980, 14, 349–361. [Google Scholar]
  15. Pipe, R.K. Ultrastructural and cytochemical study on interactions between nutrient storage cells and gametogenesis in the mussel Mytilus Edulis. Mar. Biol. 1987, 96, 519–528. [Google Scholar]
  16. Pipe, R.K. oogenesis in the marine mussel Mytilus edulis: An ultrastructural study. Mar. Biol. 1987, 95, 405–414. [Google Scholar]
  17. Zandee, D.I.; Kluytmans, J.H.; Zurburg, W.; Pieters, H. Seasonal variations in biochemical composition of Mytilus edulis with reference to energy metabolism and gametogenesis. Neth. J. Sea Res. 1980, 14, 1–29. [Google Scholar]
  18. Martínez-Pita, I.; Sánchez-Lazo, C.; Ruíz-Jarabo, I.; Herrera, M.; Mancera, J.M. Biochemical composition, lipid classes, fatty acids and sexual hormones in the mussel Mytilus galloprovincialis from cultivated populations in south Spain. Aquaculture 2012, 358–359, 274–283. [Google Scholar] [CrossRef]
  19. Dagorn, F.; Couzinet-Mossion, A.; Kendel, M.; Beninger, P.G.; Rabesaotra, V.; Barnathan, G.; Wielgosz-Collin, G. Exploitable Lipids and Fatty Acids in the Invasive Oyster Crassostrea gigas on the French Atlantic Coast. Mar. Drugs 2016, 14, 104. [Google Scholar] [CrossRef]
  20. Pazos, A.J.; Ruiz, C.; Garcia-Martin, O.; Abad, M.; Sanchez, J.L. Seasonal variations of the lipid content and fatty acid composition of Crassostrea gigas cultured in E1 Grove, Galicia, n.W. Spain. Comp. Biochem. Physiol. 1996, 114, 171–179. [Google Scholar]
  21. Soudant, P.; Van Ryckeghem, K.; Marty, Y.; Moal, J.; Samain, J.F.; Sorgeloos, P. Comparison of the lipid class and fatty acid composition between a reproductive cycle in nature and a standard hatchery conditioning of the Pacific Oyster Crassostrea Gigas. Comp. Biochem. Physiol. Part. B Biochem. Mol. Biol. 1999, 123, 209–222. [Google Scholar] [CrossRef]
  22. Abad, M.; Ruiz, C.; Martinez, D.; Mosquera, G.; Sanchez, J.L. Seasonal variations of lipid classes and fatty acids in flat oyster, Ostrea edulis, from San Cibran (Galicia, Spain). Camp. Biochem. Physiol. 1995, 110, 109–118. [Google Scholar] [CrossRef]
  23. Alkanani, T.; Parrish, C.C.; Thompson, R.J.; McKenzie, C.H. Role of fatty acids in cultured mussels, Mytilus edulis, grown in Notre Dame Bay, Newfoundland. J. Exp. Mar. Biol. Ecol. 2007, 348, 33–45. [Google Scholar] [CrossRef]
  24. Orban, E.; Di Lena, G.; Nevigato, T.; Casini, I.; Marzetti, A.; Caproni, R. Seasonal changes in meat content, condition index and chemical composition of mussels (Mytilus galloprovincialis) cultured in two different Italian sites. Food Chem. 2002, 77, 57–65. [Google Scholar] [CrossRef]
  25. Mladimeo, I.; Peharda, M.; Orhanović, S.; Bolotin, J.; Pavela-Vrančić, M.; Treursić, B. The reproductive cycle, condition index and biochemical composition of the horse-bearded mussel Modiolus barbatus. Helgol. Mar. Res. 2007, 61, 183–192. [Google Scholar] [CrossRef]
  26. Zheng, H.; Zhang, Q.; Liu, H.; Liu, W.; Sun, Z.; Li, S.; Zhang, T. Cloning and expression of vitellogenin (Vg) gene and its correlations with total carotenoids content and total antioxidant capacity in noble scallop Chlamys nobilis (Bivalve: Pectinidae). Aquaculture 2012, 366–367, 46–53. [Google Scholar] [CrossRef]
  27. Beninger, P.G. Seasonal variations of the major lipid classes in relation to the reproductive activity of two species of clams raised in a common habitat: Tapes decussatus L. (Jeffreys, 1863) and T. philippinarum (Adams & Reeve, 1850). J. Exp. Mar. Biol. Ecol. 1984, 79, 79–90. [Google Scholar]
  28. Beninger, P.G. Caveat observator: The many faces of pre-spawning atresia in marine bivalve reproductive cycles. Mar. Biol. 2017, 164, 163. [Google Scholar] [CrossRef]
  29. Fearman, J.; Moltschaniwskyj, N.A. Warmer temperatures reduce rates of gametogenesis in temperate mussels, Mytilus galloprovincialis. Aquaculture 2010, 305, 20–25. [Google Scholar] [CrossRef]
  30. Helm, M.M.; Bourne, N. Hatchery Culture of Bivalves: A Practical Manual; FAO Fisheries technical paper 471; Lovatelli, A., Ed.; FAO, Food and Agriculture Organization: Rome, Otaly, 2004. [Google Scholar]
  31. Lucas, A.; Beninger, P.G. The use of physiological condition indices in marine bivalve aquaculture. Aquaculture 1985, 44, 187–200. [Google Scholar] [CrossRef]
  32. Toro, J.E.; Thompson, R.J.; Innes, D.J. Reproductive isolation and reproductive output in two sympatric mussel species (Mytilus edulis, M. trossulus) and their hybrids from Newfoundland. Mar. Biol. 2002, 141, 897–909. [Google Scholar] [CrossRef]
  33. Chipperfield, P.N.J. Observations on the breeding and settlement of Mytilus edulis (L.) in British waters. J. Mar. Biol. Assoc. UK 1953, 32, 449–476. [Google Scholar] [CrossRef]
  34. Seed, R. Ecology. In Marine Mussels: Their Ecology and Physiology; Bayne, B.L., Ed.; Cambridge University Press: Cambridge, UK, 1976. [Google Scholar]
  35. Gosling, E. Bivalve Mollusc, Biology, Ecology and Culture; Fishing New Books, Blakwell Science: Hoboken, NJ, USA, 2004. [Google Scholar]
  36. Beninger, P.G. A qualitative and quantitative study of the reproductive cycle of the giant scallop, Placopecten magellanicus, in the Bay of Fundy (New Brunswick, Canada). Can. J. Zool. 1987, 65, 495–498. [Google Scholar] [CrossRef]
  37. Fraser, M.; Fortier, M.; Roumier, P.-H.; Parent, L.; Brousseau, P.; Fournier, M.; Surette, C.; Vaillancourt, C. Sex determination in blue mussels: Which method to choose? Mar. Environ. Res. 2016, 120, 78–85. [Google Scholar] [CrossRef] [PubMed]
  38. Bayne, B.L.; Holland, D.L.; Moore, M.N.; Lowe, D.M.; Widdows, J. Further studies on the effects of stress in the adult on the eggs of Mytilus edulis. J. Mar. Biol. Assoc. UK 1978, 58, 825–841. [Google Scholar] [CrossRef]
  39. Newell, R.; Bayne, B. Seasonal changes in the physiology, reproductive condition and carbohydrate content of the cockle Cardium (= Cerastoderma) edule (Bivalvia: Cardiidae). Mar. Biol. 1980, 56, 11–19. [Google Scholar] [CrossRef]
  40. Pipe, R. Seasonal cycles in and effects of starvation on egg development in Mytilus edulis. Mar. Ecol. Prog. Ser. 1985, 24, 121–128. [Google Scholar] [CrossRef]
  41. Murray, H.M.; Ollerhead, L.M.N. A comparison of ArcGIS and Image J software tools for calculation of gonad volume fraction (GVF) froom histological sections of blue mussel cultured in deep and shallow sites on the Nortehrn coast of Newfoundland. Can. Tech. Rep. Fish. Aquat. Sci. 2018, 3275, vi-17. [Google Scholar]
  42. Hines, A.; Yeung, W.H.; Craft, J.; Brown, M.; Kennedy, J.; Bignell, J.; Stentiford, G.D.; Viant, M.R. Comparison of histological, genetic, metabolomics, and lipid-based methods for sex determination in marine mussels. Anal. Biochem. 2007, 369, 175–186. [Google Scholar] [CrossRef]
  43. Duinker, A.; Torstersen, B.E.; Lie, Ø. Lipid classes and fatty acid composition of female gonads of great scallops: A selective field study. J. Shellfish. Res. 2004, 23, 507–514. [Google Scholar]
  44. Martínez-Pita, I.; Hachero-Cruzado, I.; Sánchez-Lazo, C.; Moreno, O. Effect of diet on the lipid composition of the commercial clam Donax trunculus (Mollusca: Bivalvia): Sex-related differences. Aquac. Res. 2012, 43, 1134–1144. [Google Scholar] [CrossRef]
  45. Soudant, P.; Moal, J.; Marty, Y.; Samain, J.F. Impact of the quality of dietary fatty acids on metabolism and the composition of polar lipid classes in female gonads of Pecten maximus (L.). J. Exp. Mar. Biol. Ecol. 1996, 205, 149–163. [Google Scholar] [CrossRef]
  46. Chérel, D.; Beninger, P.G. Oocyte atresia characteristics and effect on reproductive effort of Manila clam Tapes philippinarum (Adams and Reeve, 1850). J. Shellfish. Res. 2017, 36, 549–557. [Google Scholar] [CrossRef]
  47. Laudicella, V.A.; Whitfield, P.D.; Carboni, S.; Doherty, M.K.; Hughes, A.D. Application of lipidomics in bivalve aquaculture, a review. Rev. Aquac. 2020, 12, 678–702. [Google Scholar] [CrossRef]
  48. Chansela, P.; Goto-Inoue, N.; Zaima, N.; Hayasaka, T.; Sroyraya, M.; Kornthong, N.; Engsusophon, A.; Tamtin, M.; Chaisri, C.; Sobhon, P.; et al. Composition and localization of lipids in Penaeus merguiensis ovaries during the ovarian maturation cycle as revealed by imaging mass spectrometry. PLoS ONE 2012, 7, e33154. [Google Scholar] [CrossRef]
  49. Laudicella, V.A.; Carboni, S.; Whitfield, P.D.; Doherty, M.K.; Hughes, A.D. Sexual dimorphism in the gonad lipidome of blue mussels (Mytilus sp.): New insights from a global lipidomics approach. Comp. Biochem. Physiol. Part. D Genom. Proteom. 2023, 48, 101150. [Google Scholar] [CrossRef] [PubMed]
  50. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar]
  51. Bayne, B.L. Marine Mussels: Their Ecology and Physiology; Cambridge University Press: Cambridge, UK, 1976; Volume 10. [Google Scholar]
  52. Alfaro, A.C.; Jeffs, A.G.; Hooker, S.H. Reproductive behavior of the green-lipped mussel, Perna canaliculus, in northern New Zealand. Bull. Mar. Sci. 2001, 69, 1095–1108. [Google Scholar]
  53. Folch, J.; Lees, M.; Sloane Stanley, G.H. A simple method for the isolation and purification of total lipids from animal tissues. J. biol Chem. 1957, 226, 497–509. [Google Scholar]
  54. Facchini, L.; Losito, I.; Cataldi, T.R.; Palmisano, F. Ceramide lipids in alive and thermally stressed mussels: An investigation by hydrophilic interaction liquid chromatography-electrospray ionization Fourier transform mass spectrometry. J. Mass. Spectrom. 2016, 51, 768–781. [Google Scholar] [CrossRef]
  55. Fahy, E.; Subramaniam, S.; Brown, H.A.; Glass, C.K.; Merrill, A.H., Jr.; Murphy, R.C.; Raetz, C.R.; Russell, D.W.; Seyama, Y.; Shaw, W.; et al. A comprehensive classification system for lipids. J. Lipid Res. 2005, 46, 839–861. [Google Scholar] [CrossRef]
  56. Fahy, E.; Subramaniam, S.; Murphy, R.C.; Nishijima, M.; Raetz, C.R.; Shimizu, T.; Spener, F.; van Meer, G.; Wakelam, M.J.; Dennis, E.A. Update of the LIPID MAPS comprehensive classification system for lipids. J. Lipid Res. 2009, 50, S9–S14. [Google Scholar] [CrossRef] [PubMed]
  57. Boselli, E.; Pacetti, D.; Lucci, P.; Frega, N.G. Characterization of phospholipid molecular species in the edible parts of bony fish and shellfish. J. Agric. Food Chem. 2012, 60, 3234–3245. [Google Scholar] [CrossRef]
  58. Chen, S.; Belikova, N.A.; Subbaiah, P.V. Structural elucidation of molecular species of pacific oyster ether amino phospholipids by normal-phase liquid chromatography/negative-ion electrospray ionization and quadrupole/multiple-stage linear ion-trap mass spectrometry. Anal. Chim. Acta 2012, 735, 76–89. [Google Scholar] [CrossRef] [PubMed]
  59. Liu, Z.Y.; Zhou, D.Y.; Zhao, Q.; Yin, F.W.; Hu, X.P.; Song, L.; Qin, L.; Zhang, J.R.; Zhu, B.W.; Shahidi, F. Characterization of glycerophospholipid molecular species in six species of edible clams by high-performance liquid chromatography-electrospray ionization-tandem mass spectrometry. Food Chem. 2017, 219, 419–427. [Google Scholar] [CrossRef] [PubMed]
  60. Yin, F.W.; Zhou, D.Y.; Zhao, Q.; Liu, Z.Y.; Hu, X.P.; Liu, Y.F.; Song, L.; Zhou, X.; Qin, L.; Zhu, B.W.; et al. Identification of glycerophospholipid molecular species of mussel (Mytilus edulis) lipids by high-performance liquid chromatography-electrospray ionization-tandem mass spectrometry. Food Chem. 2016, 213, 344–351. [Google Scholar] [CrossRef]
  61. Facchini, L.; Losito, I.; Cataldi, T.R.I.; Palmisano, F. Seasonal variations in the profile of main phospholipids in Mytilus galloprovincialis mussels: A study by hydrophilic interaction liquid chromatography-electrospray ionization Fourier transform mass spectrometry. J. Mass. Spectrom. 2018, 53, 1–20. [Google Scholar] [CrossRef]
  62. Facchini, L.; Losito, I.; Cianci, C.; Cataldi, T.R.; Palmisano, F. Structural characterization and profiling of lyso-phospholipids in fresh and in thermally stressed mussels by hydrophilic interaction liquid chromatography-electrospray ionization-Fourier transform mass spectrometry. Electrophoresis 2016, 37, 1823–1838. [Google Scholar] [CrossRef]
  63. Losito, I.; Facchini, L.; Valentini, A.; Cataldi, T.R.I.; Palmisano, F. Fatty acidomics: Evaluation of the effects of thermal treatments on commercial mussels through an extended characterization of their free fatty acids by liquid chromatography—Fourier transform mass spectrometry. Food Chem. 2018, 255, 309–322. [Google Scholar] [CrossRef]
  64. Guercia, C.; Cianciullo, P.; Porte, C. Analysis of testosterone fatty acid esters in the digestive gland of mussels by liquid chromatography-high resolution mass spectrometry. Steroids 2017, 123, 67–72. [Google Scholar] [CrossRef]
  65. Donato, P.; Micalizzi, G.; Oteri, M.; Rigano, F.; Sciarrone, D.; Dugo, P.; Mondello, L. Comprehensive lipid profiling in the Mediterranean mussel (Mytilus galloprovincialis) using hyphenated and multidimensional chromatography techniques coupled to mass spectrometry detection. Anal. Bioanal. Chem. 2018, 410, 3297–3313. [Google Scholar] [CrossRef] [PubMed]
  66. Yurekten, O.; Payne, T.; Tejera, N.; Amaladoss, F.X.; Martin, C.; Williams, M.; O’Donovan, C. MetaboLights: Open data repository for metabolomics. Nucleic Acids Res. 2024, 52, D640–D646. [Google Scholar] [CrossRef] [PubMed]
  67. Xia, J.; Chong, J. MetaboanlystR: An R Package for Comprehensive Analysis of Metabolomics Data., 0.0.0.9000. 2018. Available online: https://www.metaboanalyst.ca/docs/RTutorial.xhtml (accessed on 20 September 2019).
  68. Kassambara, A.; Mudt, F. Package ’factoextra’, 1.0.5. 2017. Available online: https://cran.r-project.org/web/packages/factoextra/index.html (accessed on 10 November 2024).
  69. Wold, S.; Esbensen, K.; Geladi, P. Principal Component Analysis. Chemom. Intell. Lab. Syst. 1987, 2, 37–52. [Google Scholar] [CrossRef]
  70. van den Berg, R.A.; Hoefsloot, H.C.; Westerhuis, J.A.; Smilde, A.K.; van der Werf, M.J. Centering, scaling, and transformations: Improving the biological information content of metabolomics data. BMC Genom. 2006, 7, 142. [Google Scholar] [CrossRef]
  71. Blighe, K.; Rana, S.; Lewis, M. EnhancedVolcano: Publication-ready volcano plots with enhanced colouring and labeling., R package version 1.4.0. 2019. Available online: https://github.com/kevinblighe/EnhancedVolcano (accessed on 12 November 2024).
  72. Benjamini, Y.; Hochberg, Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B 1995, 57, 289–300. [Google Scholar] [CrossRef]
  73. Bylesjö, M.; Rantalainen, M.; Cloarec, O.; Nicholson, J.K.; Holmes, E.; Trygg, J. OPLS discriminant analysis: Combining the strengths of PLS-DA and SIMCA classification. J. Chemom. 2006, 20, 341–351. [Google Scholar] [CrossRef]
  74. Gromski, P.S.; Muhamadali, H.; Ellis, D.I.; Xu, Y.; Correa, E.; Turner, M.L.; Goodacre, R. A tutorial review: Metabolomics and partial least squares-discriminant analysis--a marriage of convenience or a shotgun wedding. Anal. Chim. Acta 2015, 879, 10–23. [Google Scholar] [CrossRef]
  75. Brieuc, M.S.O.; Waters, C.D.; Drinan, D.P.; Naish, K.A. A practical introduction to Random Forest for genetic association studies in ecology and evolution. Mol. Ecol. Resour. 2018, 18, 755–766. [Google Scholar] [CrossRef]
  76. Liaw, A.; Wieiner, M. Package ’randomForest’. 2018. Available online: https://cran.r-project.org/web/packages/randomForest/index.html (accessed on 12 March 2020).
  77. Molenaar, M.R.; Jeucken, A.; Wassenaar, T.A.; van de Lest, C.H.A.; Brouwers, J.F.; Helms, J.B. LION/web: A web-based ontology enrichment tool for lipidomic data analysis. Gigascience 2019, 8, giz061. [Google Scholar] [CrossRef]
  78. Wagner, F. GO-PCA: An Unsupervised Method to Explore Gene Expression Data Using Prior Knowledge. PLoS ONE 2015, 10, e0143196. [Google Scholar] [CrossRef]
  79. Newell, R.I.; Hilbish, T.J.; Koehn, R.K.; Newell, C.J. Temporal variation in the reproductive cycle of Mytilus edulis L.(Bivalvia, Mytilidae) from localities on the east coast of the United States. Biol. Bull. 1982, 162, 299–310. [Google Scholar] [CrossRef]
  80. Pernet, F.; Gauthier-Clerc, S.; Mayrand, E. Change in lipid composition in eastern oyster (Crassostrea virginica Gmelin) exposed to constant or fluctuating temperature regimes. Comp. Biochem. Physiol. B Biochem. Mol. Biol. 2007, 147, 557–565. [Google Scholar] [CrossRef] [PubMed]
  81. Ren, S.; Hinzman, A.A.; Kang, E.L.; Szczesniak, R.D.; Lu, L.J. Computational and statistical analysis of metabolomics data. Metabolomics 2015, 11, 1492–1513. [Google Scholar] [CrossRef]
  82. Checa, A.; Bedia, C.; Jaumot, J. Lipidomic data analysis: Tutorial, practical guidelines and applications. Anal. Chim. Acta 2015, 885, 1–16. [Google Scholar] [CrossRef] [PubMed]
  83. Franceschi, P.; Giordan, M.; Wehrens, R. Multiple comparisons in mass-spectrometry-based -omics technologies. TrAC Trends Anal. Chem. 2013, 50, 11–21. [Google Scholar] [CrossRef]
  84. Pazos, A.J.; Román, G.; Acosta, C.P.; Sánchez, J.L.; Abad, M. Lipid Classes and Fatty Acid Composition in the Female Gonad of Pecten maximus in Relation to Reproductive Cycle and Environmental Variables. Comp. Biochem. Physiol. Part. B Biochem. Mol. Biol. 1997, 117, 393–402. [Google Scholar] [CrossRef]
  85. Zeng, X.; Li, Z.; Zhang, Z.; Shi, X.; Wang, Y. Variations in lipid composition of ovaries and hepatopancreas during vitellogenesis in the mud crab Scylla paramamosain: Implications of lipid transfer from hepatopancreas to ovaries. Aquac. Rep. 2024, 35, 102008. [Google Scholar] [CrossRef]
  86. Koutsouveli, V.; Balgoma, D.; Checa, A.; Hedeland, M.; Riesgo, A.; Cardenas, P. Oogenesis and lipid metabolism in the deep-sea sponge Phakellia ventilabrum (Linnaeus, 1767). Sci. Rep. 2022, 12, 6317. [Google Scholar] [CrossRef] [PubMed]
  87. Merrill, A.H.; Schmelz, E.-M.; Dillehay, D.L.; Spiegel, S.; Shayman, J.A.; Schroeder, J.J.; Riley, Ø.R.T.; Voss, K.A.; Wang, E. Sphingolipids—The enigmatic lipid class: Biochemistry, physiology, and pathophysiology. Toxicol. Appl. Pharmacol. 1997, 142, 208–225. [Google Scholar] [CrossRef]
  88. Alexaki, A.; Clarke, B.A.; Gavrilova, O.; Ma, Y.; Zhu, H.; Ma, X.; Xu, L.; Tuymetova, G.; Larman, B.C.; Allende, M.L.; et al. De Novo Sphingolipid Biosynthesis Is Required for Adipocyte Survival and Metabolic Homeostasis. J. Biol. Chem. 2017, 292, 3929–3939. [Google Scholar] [CrossRef]
  89. Dudek, J. Role of Cardiolipin in Mitochondrial Signaling Pathways. Front. Cell Dev. Biol. 2017, 5, 90. [Google Scholar] [CrossRef] [PubMed]
  90. Kraffe, E.; Grall, J.; Le Duff, M.; Soudant, P.; Marty, Y. A striking parallel between cardiolipin fatty acid composition and phylogenetic belonging in marine bivalves: A possible adaptative evolution? Lipids 2008, 43, 961. [Google Scholar] [CrossRef]
  91. Fernández-Reiriz, M.J.; Garrido, J.L.; Irisarri, J. Fatty acid composition in Mytilus galloprovincialis organs: Trophic interactions, sexual differences and differential anatomical distribution. Mar. Ecol. Prog. Ser. 2015, 528, 221–234. [Google Scholar] [CrossRef]
  92. Kraffe, E.; Tremblay, R.; Belvin, S.; LeCoz, J.-R.; Marty, Y.; Guderley, H. Effect of reproduction on escape responses, metabolic rates and muscle mitochondrial properties in the scallop Placopecten magellanicus. Mar. Biol. 2008, 156, 25–38. [Google Scholar] [CrossRef]
  93. Fan, J.; Upadhye, S.; Worster, A. Understanding receiver operating characteristic (ROC) curves. CJEM 2006, 8, 19–20. [Google Scholar] [CrossRef]
  94. Youden, W.J. Index for rating diagnostic tests. Cancer 1950, 3, 32–35. [Google Scholar] [CrossRef] [PubMed]
  95. Ren, C.; Liu, J.; Zhou, J.; Liang, H.; Wang, Y.; Sun, Y.; Ma, B.; Yin, Y. Lipidomic analysis of serum samples from migraine patients. Lipids Health Dis. 2018, 17, 22. [Google Scholar] [CrossRef] [PubMed]
  96. Liu, C.; Zong, W.-J.; Zhang, A.-H.; Zhang, H.-M.; Luan, Y.-H.; Sun, H.; Cao, H.-X.; Wang, X.-J. Lipidomic characterisation discovery for coronary heart disease diagnosis based on high-throughput ultra-performance liquid chromatography and mass spectrometry. RSC Adv. 2018, 8, 647–654. [Google Scholar] [CrossRef]
  97. Kariotoglou, D.M.; Mastronicolis, S.K. Phosphonolipids in the mussel Mytilus galloprovincialis. Z. Naturforsch. 1998, 53, 888–896. [Google Scholar] [CrossRef]
  98. Kariotoglou, D.M.; Mastronicolis, S.K. Sphingophosphonolipid molecular species from edible mollusks and a jellyfish. Comp. Biochem. Physiol. Part. B Biochem. Mol. Biol. 2003, 136, 27–44. [Google Scholar] [CrossRef]
  99. Losito, I.; Facchini, L.; Catucci, R.; Calvano, C.D.; Cataldi, T.R.I.; Palmisano, F. Tracing the Thermal History of Seafood Products through Lysophospholipid Analysis by Hydrophilic Interaction Liquid Chromatography(-)Electrospray Ionization Fourier Transform Mass Spectrometry. Molecules 2018, 23, 2212. [Google Scholar] [CrossRef] [PubMed]
  100. Le Grand, F.; Kraffe, E.; Marty, Y.; Donaghy, L.; Soudant, P. Membrane phospholipid composition of hemocytes in the Pacific oyster Crassostrea gigas and the Manila clam Ruditapes philippinarum. Comp. Biochem. Physiol. A Mol. Integr. Physiol. 2011, 159, 383–391. [Google Scholar] [CrossRef]
  101. Vacaru, A.M.; van den Dikkenberg, J.; Ternes, P.; Holthuis, J.C. Ceramide phosphoethanolamine biosynthesis in Drosophila is mediated by a unique ethanolamine phosphotransferase in the Golgi lumen. J. Biol. Chem. 2013, 288, 11520–11530. [Google Scholar] [CrossRef] [PubMed]
  102. Pauletto, M.; Milan, M.; de Sousa, J.T.; Huvet, A.; Joaquim, S.; Matias, D.; Leitao, A.; Patarnello, T.; Bargelloni, L. Insights into molecular features of Venerupis decussata oocytes: A microarray-based study. PLoS ONE 2014, 9, e113925. [Google Scholar] [CrossRef]
  103. Tilly, J.L.; Kolesnick, R.N. Sphingolipid signaling in gonadal development and function. Chem. Phys. Lipids 1999, 102, 149–155. [Google Scholar] [CrossRef] [PubMed]
  104. Vance, D.E. Glycerolipids biosynthesis in eukaryotes. In Biochemistry of Lipids, Lipoproteins and Membranes; Vance, D.E., Vance, J.E., Eds.; Elsevier: Amsterdam, The Netherlands, 1996; pp. 153–180. [Google Scholar]
  105. Zhu, J.; Hu, N.; Xiao, Y.; Lai, X.; Wang, L.; Song, Y. Characterization of Ovarian Lipid Composition in the Largemouth Bronze Gudgeon (Coreius guichenoti) at Different Development Stages. Fishes 2024, 9, 291. [Google Scholar] [CrossRef]
  106. Fadok, V.A.; Bratton, D.L.; Rose, D.M.; Pearson, A.; Ezekewitz, R.A.B.; Henson, P.M. A receptor for phosphatidylserine-specific clearance of apoptotic cells. Nature 2000, 405, 85–90. [Google Scholar] [CrossRef]
  107. Segawa, K.; Nagata, S. An Apoptotic ‘Eat Me’ Signal: Phosphatidylserine Exposure. Trends Cell Biol. 2015, 25, 639–650. [Google Scholar] [CrossRef]
  108. Gervais, O.; Renault, T.; Arzul, I. Induction of apoptosis by UV in the flat oyster, Ostrea edulis. Fish. Shellfish. Immunol. 2015, 46, 232–242. [Google Scholar] [CrossRef]
  109. Meehan, T.L.; Serizier, S.B.; Kleinsorge, S.E.; McCall, K. Analysis of phagocytosis in the Drosophila ovary. In Oogenesis: Methods and Protocols; Nezis, I.P., Ed.; Springer: New York, NY, USA, 2016; pp. 79–95. [Google Scholar] [CrossRef]
  110. Serizier, S.B.; McCall, K. Scrambled eggs: Apoptotic cell clearance by non-professional phagocytes in the Drosophila ovary. Front. Immunol. 2017, 8, 1642. [Google Scholar] [CrossRef]
  111. Mostafa, S.; Nader, N.; Machaca, K. Lipid Signaling During Gamete Maturation. Front. Cell Dev. Biol. 2022, 10, 814876. [Google Scholar] [CrossRef]
  112. Soudant, P.; Marty, Y.; Moal, J.; Samain, J.F. Separation of major polar lipids in Pecten maximus by highperformance liquid chromatography and subsequent determination of their fatty acids using gas chromatography. J. Chromatogr. B 1995, 673, 15–26. [Google Scholar] [CrossRef] [PubMed]
  113. Laudicella, V.A.; Beveridge, C.; Carboni, S.; Franco, S.C.; Doherty, M.K.; Long, N.; Mitchell, E.; Stanley, M.S.; Whitfield, P.D.; Hughes, A.D. Lipidomics analysis of juveniles’ blue mussels (Mytilus edulis L. 1758), a key economic and ecological species. PLoS ONE 2020, 15, e0223031. [Google Scholar] [CrossRef]
  114. Rey, F.; Alves, E.; Melo, T.; Domingues, P.; Queiroga, H.; Rosa, R.; Domingues, M.R.; Calado, R. Unravelling polar lipids dynamics during embryonic development of two sympatric brachyuran crabs (Carcinus maenas and Necora puber) using lipidomics. Sci. Rep. 2015, 5, 14549. [Google Scholar] [CrossRef] [PubMed]
  115. Whyte, J.N.C.; Boume, N.; Hodgson, C.A. Assessment of biochemical composition and energy reserves in larvae of the scallop Patinopecten yessoensis. J. Exp. Mar. Biol. Ecol. 1987, 113, 113–124. [Google Scholar] [CrossRef]
  116. Yamashita, A.; Hayashi, Y.; Nemoto-Sasaki, Y.; Ito, M.; Oka, S.; Tanikawa, T.; Waku, K.; Sugiura, T. Acyltransferases and transacylases that determine the fatty acid composition of glycerolipids and the metabolism of bioactive lipid mediators in mammalian cells and model organisms. Prog. Lipid Res. 2014, 53, 18–81. [Google Scholar] [CrossRef] [PubMed]
  117. Johnstone, J. The spawning of the mussel (Mytilus edulis). Proceeding Trans. Livrpool Biol. Soc. 1898, 13, 104–121. [Google Scholar]
  118. Cuevas, N.; Zorita, I.; Costa, P.M.; Franco, J.; Larreta, J. Development of histopathological indices in the digestive gland and gonad of mussels: Integration with contamination levels and effects of confounding factors. Aquat. Toxicol. 2015, 162, 152–164. [Google Scholar] [CrossRef]
Figure 1. Volcano and orthogonal partial least squares discriminant analysis (OPLS-DA) for discrimination between ripe and non-ripe blue mussel ovaries (BMOs). (A) Volcano plot; NS—not significant; FC—fold change; FDR p < 0.05—false discovery rate (FDR) p > 0.05. (B) OPLS-DA score plot; yellow circles: not ripe BMOs, red triangle ripe BMOs; (C) OPLS-DA Sigma plot. (A) Computed via ‘MetaboAnalystR’ [67] and plotted via ‘EnhancedVolcano’ package [71]. (B,C) Computed via ‘MetaboAnalystR’ and plotted via ‘ggplot2’.
Figure 1. Volcano and orthogonal partial least squares discriminant analysis (OPLS-DA) for discrimination between ripe and non-ripe blue mussel ovaries (BMOs). (A) Volcano plot; NS—not significant; FC—fold change; FDR p < 0.05—false discovery rate (FDR) p > 0.05. (B) OPLS-DA score plot; yellow circles: not ripe BMOs, red triangle ripe BMOs; (C) OPLS-DA Sigma plot. (A) Computed via ‘MetaboAnalystR’ [67] and plotted via ‘EnhancedVolcano’ package [71]. (B,C) Computed via ‘MetaboAnalystR’ and plotted via ‘ggplot2’.
Lipidology 02 00005 g001
Figure 2. Backward purging process to select the most important features from the top 5% of RF-ranked molecules. (A) RFclass; (B) RFreg. Computed via ‘RandomForest’ package [76] and plotted via ‘ggplot2’ package.
Figure 2. Backward purging process to select the most important features from the top 5% of RF-ranked molecules. (A) RFclass; (B) RFreg. Computed via ‘RandomForest’ package [76] and plotted via ‘ggplot2’ package.
Lipidology 02 00005 g002
Figure 3. Venn diagram showing the comparison between important features evidenced by the four different supervised statistical models. Obtained from https://bioinfogp.cnb.csic.es/tools/venny/ (accessed on 12 November 2024).
Figure 3. Venn diagram showing the comparison between important features evidenced by the four different supervised statistical models. Obtained from https://bioinfogp.cnb.csic.es/tools/venny/ (accessed on 12 November 2024).
Lipidology 02 00005 g003
Figure 4. Lipid ontology (LiOn) term enrichment analysis for important lipids in distinction between ripe and non-ripe blue mussel ovaries (BMOs). (A) Heatmap plot of principal component analysis of LiOn terms. Complete was used as the cluster algorithm, and Euclidean distance as the distance measure. (B) LiOn terms ranked in the comparison between non-ripe and ripe females. LiOn is an ontology database containing information related to lipid metabolism linked with a subset of lipid species. LiOn terms are ordered according to molecules’ p-values (t-test), and reported terms significantly different for FDR adjusted [72] p-value (q values) are in red. (A) Computed and plotted via LiOn-PCA heatmap module [78] and (B) via LiOn enrichment analysis module [77].
Figure 4. Lipid ontology (LiOn) term enrichment analysis for important lipids in distinction between ripe and non-ripe blue mussel ovaries (BMOs). (A) Heatmap plot of principal component analysis of LiOn terms. Complete was used as the cluster algorithm, and Euclidean distance as the distance measure. (B) LiOn terms ranked in the comparison between non-ripe and ripe females. LiOn is an ontology database containing information related to lipid metabolism linked with a subset of lipid species. LiOn terms are ordered according to molecules’ p-values (t-test), and reported terms significantly different for FDR adjusted [72] p-value (q values) are in red. (A) Computed and plotted via LiOn-PCA heatmap module [78] and (B) via LiOn enrichment analysis module [77].
Lipidology 02 00005 g004
Figure 5. Receiving operating characteristic (ROC) curves and area under the curve (AUC) for CerPE(40:2). (A) ROC curve reporting the AUC, the 95% CI for AUC, and the cut-off threshold with values of sensitivity and specificity; the diagonal line marks a 0.5 AUC (no effect). (B) Boxplot reporting the Pareto-scaled variable distribution, with the blue horizontal intercept indicating the cut-off point; the box includes the observation between the 1st (25th percentile) and 3rd (75th percentile) quartile, the whiskers represent values between ±1.5 × interquartile range (IQR), observations over ±1.5 × IQR are reported as outliers (black dots), and violin shapes represent the variable distribution. (A) Computed and plotted via ‘MetaboAnalystR’ [67]; (B) plotted via ‘ggplot2’.
Figure 5. Receiving operating characteristic (ROC) curves and area under the curve (AUC) for CerPE(40:2). (A) ROC curve reporting the AUC, the 95% CI for AUC, and the cut-off threshold with values of sensitivity and specificity; the diagonal line marks a 0.5 AUC (no effect). (B) Boxplot reporting the Pareto-scaled variable distribution, with the blue horizontal intercept indicating the cut-off point; the box includes the observation between the 1st (25th percentile) and 3rd (75th percentile) quartile, the whiskers represent values between ±1.5 × interquartile range (IQR), observations over ±1.5 × IQR are reported as outliers (black dots), and violin shapes represent the variable distribution. (A) Computed and plotted via ‘MetaboAnalystR’ [67]; (B) plotted via ‘ggplot2’.
Lipidology 02 00005 g005
Table 1. Lipid species (AUC > 0.8) markers of gonad ripeness identified by the supervised statistical approaches in blue mussel ovary (BMO) and ranked according to receiving operating curves (ROCs) and area under the curve (AUC). Isomeric lipid molecules resolved by reverse phase separation (same exact mass of 5 Δppm, but different retention time) are distinguished with a lowercase letter after the lipid ID. Cut-off: value of the variable that maximises sensitivity and specificity; Sens.: sensitivity of a variable, true positive ratio at the cut-off value; Spec.: specificity of the variable (true negative ratio) at the cut-off value; p-value: Wilcoxon Rank test with p-value adjusted for false discovery rate [72]; FC: fold change. The complete set of variables highlighted by statistical methods is available in Table A1. Computed via R package ‘MetaboanalysR’ [67].
Table 1. Lipid species (AUC > 0.8) markers of gonad ripeness identified by the supervised statistical approaches in blue mussel ovary (BMO) and ranked according to receiving operating curves (ROCs) and area under the curve (AUC). Isomeric lipid molecules resolved by reverse phase separation (same exact mass of 5 Δppm, but different retention time) are distinguished with a lowercase letter after the lipid ID. Cut-off: value of the variable that maximises sensitivity and specificity; Sens.: sensitivity of a variable, true positive ratio at the cut-off value; Spec.: specificity of the variable (true negative ratio) at the cut-off value; p-value: Wilcoxon Rank test with p-value adjusted for false discovery rate [72]; FC: fold change. The complete set of variables highlighted by statistical methods is available in Table A1. Computed via R package ‘MetaboanalysR’ [67].
Lipid ID 1AUC95% CIp-Value 2FCVolcanoOPLS-DARFClassRFregNot Ripe
Average ± CI (95%)
Ripe
Average ± CI (95%)
CAEP(42:2)0.8080.671–0.9253.63 × 10−41.57 XXX47.59 ± 9.0620.67 ± 7.5
CAEP(42:3)0.8040.683–0.9169 × 10−33.13X XX36.95 ± 11.2511.19 ± 6.08
CerPE(34:3)a0.8450.729–0.9392.08 × 10−30.98 XXX95.62 ± 20.8448.43 ± 8.13
CerPE(35:3/2)0.8740.771–0.9542.08 × 10−31.08 XXX132.82 ± 29.8255.95 ± 7.87
CerPE(40:1)0.8060.668–0.933.62 × 10−31.17 XXX34.15 ± 7.0213.71 ± 5.25
CerPE(40:2)0.9050.799–0.9813.04 × 10−41.48XXXX46.4 ± 7.8817.69 ± 4.48
LPC(22:2)0.8060.673–0.9227.42 × 10−3−1.62 X 115.47 ± 39.6308.4 ± 101.03
PA(38:6)0.8150.69–0.9170.0235.66X XX12.45 ± 6.381.46 ± 1.54
PA(40:6)0.8610.749–0.9587.42 × 10−32.85XXXX33.14 ± 12.058.31 ± 3.64
PA(44:4)0.8110.685–0.9250.014−1.75 XX102.52 ± 41.73226.25 ± 58.24
PA(O-38:6/P-38:5)0.8510.735–0.942.62 × 10−32.12XXXX46.5 ± 13.0915.64 ± 4.86
PC(28:0)0.8420.732–0.9393.62 × 10−31.35XXXX426.29 ± 119.56140.95 ± 36.64
PC(30:0)0.8480.717–0.9432.36 × 10−31.37XXXX1149.26 ± 291.11396.62 ± 108.82
PC(O-30:0)0.8030.672–0.9270.0121.48XXX 862.53 ± 256.36318.65 ± 112.49
PC(O-31:0)0.80.664–0.9020.0141.37 XXX68.9 ± 21.2541.64 ± 9.75
PC(O-32:0)0.8020.668–0.9190.0161.35XXXX923.78 ± 291.49351.64 ± 111.32
PC(O-46:10/P-46:9)a0.8110.68–0.9152.25 × 10−3−1.42XXXX145.61 ± 41.99341.56 ± 83.65
PE(O-34:3/P-34:2)0.8280.717–0.9420.0590.98 XX53.51 ± 20.0124.5 ± 4.84
PE(O-36:1/P-36:0)b0.8680.748–0.9622.66 × 10−31.63XXXX65.23 ± 19.6521.09 ± 4.88
PE(O-36:2/P-36:1)b0.8480.719–0.9493.62 × 10−30.96 XXX94.97 ± 22.7647.13 ± 7.8
PE(O-36:2/P-36:1)c0.8750.772–0.9482.08 × 10−30.97 XX251.77 ± 50.85121.82 ± 15.8
PE(O-36:3/P-36:2)0.8510.737–0.957.42 × 10−30.61 XX308.04 ± 55.87193.04 ± 19.3
PE(O-37:2/P-37:1)0.8580.749–0.9481.32 × 10−30.99 XXX231.1 ± 39.56121.75 ± 21.86
PE(O-37:3/P-37:2)0.8450.719–0.9382.08 × 10−30.51 XXX428.34 ± 49.7300.34 ± 26.36
PE(O-37:4/P-37:3)0.8680.752–0.9643.26 × 10−30.74 XXX84.17 ± 14.0550.25 ± 6.82
PE(O-39:3/P-39:2)0.80.692–0.9053.62 × 10−30.45 XX360.03 ± 39.86264.36 ± 23.89
PG(34:1)0.860.735–0.967.29 × 10−32.19XXXX25.61 ± 8.87.32 ± 2.83
PS(36:2)0.810.65–0.9133.0 × 10−37.06X XX16.21 ± 10.110.14 ± 0.22
PS(38:5)0.8940.79–0.9621.32 × 10−31.61XXXX215.46 ± 51.566.38 ± 17.26
PS(38:6)a0.840.728–0.9353.62 × 10−31.23 XXX164.04 ± 40.3966.94 ± 16.21
PS(39:6)0.8230.695–0.9260.0331.21 XX96.45 ± 35.1839.88 ± 10.52
PS(40:2)0.8280.72–0.9267.94 × 10−30.75 XX157.85 ± 32.7791.97 ± 13.65
PS(40:5)0.820.691–0.925.49 × 10−31.52 X41.86 ± 11.0518.17 ± 5.55
PS(40:6)0.8890.785–0.9812.08 × 10−31.06 XXX329.08 ± 66.25155.36 ± 27.89
PS(O-38:5/P-38:4)0.8750.777–0.9571.29 × 10−30.82 XXX163.08 ± 24.4494.22 ± 13.23
PS(O-38:6/P-38:5)0.8550.732–0.9562.36 × 10−30.99 XXX361.75 ± 74.27168.69 ± 30.8
1 Isomeric lipid molecules resolved by reverse-phase separation (same exact mass of 5 Δppm, but different retention time) are differentiated with a lowercase letter after the lipid ID. 2 FDR-adj p-value [72].
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Laudicella, V.A.; Carboni, S.; De Vittor, C.; Whitfield, P.D.; Doherty, M.K.; Hughes, A.D. Integration of Global Lipidomics and Gonad Histological Analysis via Multivariate Chemometrics and Machine Learning: Identification of Potential Lipid Markers of Ovarian Development in the Blue Mussel (Mytilus edulis). Lipidology 2025, 2, 5. https://doi.org/10.3390/lipidology2010005

AMA Style

Laudicella VA, Carboni S, De Vittor C, Whitfield PD, Doherty MK, Hughes AD. Integration of Global Lipidomics and Gonad Histological Analysis via Multivariate Chemometrics and Machine Learning: Identification of Potential Lipid Markers of Ovarian Development in the Blue Mussel (Mytilus edulis). Lipidology. 2025; 2(1):5. https://doi.org/10.3390/lipidology2010005

Chicago/Turabian Style

Laudicella, Vincenzo Alessandro, Stefano Carboni, Cinzia De Vittor, Phillip D. Whitfield, Mary K. Doherty, and Adam D. Hughes. 2025. "Integration of Global Lipidomics and Gonad Histological Analysis via Multivariate Chemometrics and Machine Learning: Identification of Potential Lipid Markers of Ovarian Development in the Blue Mussel (Mytilus edulis)" Lipidology 2, no. 1: 5. https://doi.org/10.3390/lipidology2010005

APA Style

Laudicella, V. A., Carboni, S., De Vittor, C., Whitfield, P. D., Doherty, M. K., & Hughes, A. D. (2025). Integration of Global Lipidomics and Gonad Histological Analysis via Multivariate Chemometrics and Machine Learning: Identification of Potential Lipid Markers of Ovarian Development in the Blue Mussel (Mytilus edulis). Lipidology, 2(1), 5. https://doi.org/10.3390/lipidology2010005

Article Metrics

Back to TopTop