Next Article in Journal
Frontiers in Antimicrobial Materials
Next Article in Special Issue
O-Glycosylation Changes in Serum Immunoglobulin G Are Associated with Inflammation Development in Advanced Endometriosis
Previous Article in Journal
Genome-Wide Identification of Eucalyptus Heat Shock Transcription Factor Family and Their Transcriptional Analysis under Salt and Temperature Stresses
Previous Article in Special Issue
miR 31-3p Has the Highest Expression in Cesarean Scar Endometriosis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Bioinformatics Approach to MicroRNA-Sequencing Analysis Based on Human Saliva Samples of Patients with Endometriosis

1
Department of Obstetrics and Reproductive Medicine, Hospital Tenon, Sorbonne University, 4 rue de la Chine, 75020 Paris, France
2
Clinical Research Group (GRC) Paris 6: Endometriosis Expert Center (C3E), Sorbonne University (GRC6 C3E SU), 75020 Paris, France
3
Cancer Biology and Therapeutics, Centre de Recherche Saint-Antoine (CRSA), Sorbonne University, INSERM UMR_S_938, 75020 Paris, France
4
Ziwig Health, 19 rue Reboud, 69003 Lyon, France
5
Paris Brain Institute-Institut du Cerveau-ICM, Sorbonne University, Inserm U1127, CNRS UMR 7225, AP-HP-Hôpital Pitié-Salpêtrière, 75013 Paris, France
6
Gentoyping and Sequencing Core Facility, iGenSeq, Institut du Cerveau et de la Moelle Épinière, ICM, Hôpital Pitié-Salpêtrière, 47-83 Boulevard de l’Hôpital, 75013 Paris, France
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2022, 23(14), 8045; https://doi.org/10.3390/ijms23148045
Submission received: 3 June 2022 / Revised: 16 July 2022 / Accepted: 16 July 2022 / Published: 21 July 2022
(This article belongs to the Special Issue Molecular Research on Endometriosis)

Abstract

:
Endometriosis, defined by the presence of endometrium-like tissue outside the uterus, affects 2–10% of the female population, i.e., around 190 million women, worldwide. The aim of the prospective ENDO-miRNA study was to develop a bioinformatics approach for microRNA-sequencing analysis of 200 saliva samples for miRNAome expression and to test its diagnostic accuracy for endometriosis. Among the 200 patients, 76.5% (n = 153) had confirmed endometriosis and 23.5% (n = 47) had no endometriosis (controls). Small RNA-seq of 200 saliva samples yielded ~4642 M raw sequencing reads (from ~13.7 M to ~39.3 M reads/sample). The number of expressed miRNAs ranged from 1250 (outlier) to 2561 per sample. Some 2561 miRNAs were found to be differentially expressed in the saliva samples of patients with endometriosis compared with the control patients. Among these, 1.17% (n = 30) were up- or downregulated. Among these, the F1-score, sensitivity, specificity, and AUC ranged from 11–86.8%, 5.8–97.4%, 10.6–100%, and 39.3–69.2%, respectively. Here, we report a bioinformatic approach to saliva miRNA sequencing and analysis. We underline the advantages of using saliva over blood in terms of ease of collection, reproducibility, stability, safety, non-invasiveness. This report describes the whole saliva transcriptome to make miRNA quantification a validated, standardized, and reliable technique for routine use. The methodology could be applied to build a saliva signature of endometriosis.

1. Introduction

MicroRNAs (miRNAs) are small, highly conserved non-coding RNAs with a length of about 22 nucleotides which bind to the 3′-untranslated region (3′-UTR) of target messenger RNAs (mRNAs), thus regulating gene expression post-transcriptionally through RNA degradation and/or translational inhibition [1,2]. Schematically, miRNA biosynthesis involves several steps: (i) they are first transcribed from genes in intronic regions of coding or non-coding transcripts, or coded from exons under the action of the RNA polymerase II, generating hundreds of duplex nucleotide-long primary miRNAs (pri-miRNA); (ii) the pri-miRNA is subsequently cleaved by a complex formed by an RNase III enzyme, Drosha, RNA binding cofactor and Pasha to generate precursor miRNA (pre-miRNA); and (iii) then, the pre-miRNAs are transported from the nucleus to the cytoplasm using exportin 5 where the duplex is cleaved by Dicer and helicase to form mature miRNAs [2,3]. The miRNAs are subsequently incorporated into an RNA silencing complex (RISC) that regulates post-translational modifications through binding to the 3′ untranslated region (3′ UTR) of the target messenger-RNA (mRNA). Finally, the miRNAs are released from the cells into circulation using various carriers such as Argonaute, nucleophosmin 1, high-density lipoproteins or extracellular vesicles (exosomes) with a distribution in human fluids where they can be detected [4,5].
Numerous studies have demonstrated the relevance of evaluating miRNA expression in cancers and benign pathologies to provide insights into the molecular mechanisms of disease onset and progression [3,6,7,8,9,10]. For these reasons, sequencing has been applied to biomarker discovery for a variety of diseases, such as endometriosis, but with some limitations especially for saliva samples. Indeed, several sources of errors can be introduced during a sequencing study such as (i) an underpowered cohort [3,11,12], (ii) sample extraction, (iii) library preparation (12, 15–18) [7,13,14,15], and (iv) sequencing technique. Overall, these errors can lead to over- or underestimation of the molecule expression or a one-size-fits-all approach with inadequate analysis [8,9,12,14,15,16].
Therefore, the goal of the prospective ENDO-miRNA study was to develop a bioinformatics approach for microRNA-sequencing analysis of 200 saliva samples for miRNAome expression and to test its diagnostic accuracy for endometriosis.

2. Material and Methods

2.1. Study Population

We used data from the prospective “ENDOmiARN” study (ClinicalTrials.gov Identifier: NCT04728152). Data collection and analysis were carried out under Research Protocol n° ID RCB: 2020-A03297-32. We obtained signed informed consent from all participants in the study. The experimental protocol was approved by Ethics committee le comité de protection des personnes (C.P.P.) Sud-Ouest et Outre-Mer 1 (CPP 1-20-095 ID 10476).
The ENDOmiARN study included 200 saliva samples obtained from patients with chronic pelvic pain suggestive of endometriosis. All the samples were collected between January 2021 and June 2021. Analysis was performed blinded to the surgical and imaging findings. The patients with endometriosis were stratified according to the revised American Society of Reproductive Medicine (rASRM) classification [17]. The main characteristics of the patients included in the ENDOmiARN study are displayed in Table 1.

2.2. Saliva Sample Collection

The saliva samples (2 mL) were collected in an all-in-one system including a nucleic acid stabilizing solution for collection, stabilization and transportation (OME 505, DNA Genotek Inc., 2 Beaverbrook Road Ottawa, ON, Canada K2K 1L1) using an at-home kit (https://www.dnagenotek.com/row/products/collection-microbiome/omnigene-oral/OME-505.html, accessed on 1 January 2021). Subjects were asked to refrain from eating, drinking, smoking, or chewing gum for 30 min before the saliva sample was taken. All the samples were stored at room temperature prior to shipping.

2.3. RNA Sample Extraction, Preparation and Quality Control

RNA was isolated from each saliva sample using the miRNeasy Kit (Qiagen, Inc., Germantown, MD, USA) according to the manufacturer’s instructions [6,8,9]. In accordance with DNA Genotek process of extraction, a systematic centrifugation was performed at 13,300× g for 3 min. RNA quality was assessed using the Agilent Technologies TapeStation 2200. RNA-sequencing libraries were prepared using the QIAseq miRNA Library Kit (Qiagen, Hilden, Germany) according to the manufacturer’s instructions. Samples were indexed in batches of 96, with a targeted sequencing depth of 17 million reads per sample. Sequencing was performed using 100 base single-end reads, using an Novaseq6000 sequencer (Illumina, San Diego, CA, USA) [14,18]. The process used is the one summarized in the previously published work by Potla et al. [13].

3. Bioinformatics

3.1. Raw Data Preprocessing (Raw, Filtered, Aligned Reads) and Quality Control

Sequencing reads were processed using the data processing pipeline. FastQ files were trimmed to remove adapter sequences using Cutadapt version v.1.18 and were aligned using Bowtie version 1.1.1 to the following transcriptome databases: the human reference genome available from NCBI (https://www.ncbi.nlm.nih.gov/genome/guide/human/, accessed on 1 January 2021), and miRBase21 (miRNAs) using the MirDeep2 v0.1.0 package. The raw sequencing data quality was assessed using FastQC software v0.11.7 [14,15,19,20,21].

3.2. Differential Expression Analysis of miRNA

miRNA expression was quantified by miRDeep2 v0.1.0 [22]. Differential expression tests were then conducted in DESeq2 for miRNAs with read counts in ≥1 of the samples. DESeq2 V1.20 integrates methodological advances with several novel features to facilitate a more quantitative analysis of comparative RNA-seq data using shrinkage estimators for dispersion and fold change [23]. The resulting matrix was filtered for expressed miRNAs and normalized using Z-score normalization [24]. miRNAs were considered as differentially expressed if the absolute value of log2-fold change was >1.5 (upregulated) and <0.5 (downregulated). The p value adjusted for multiple testing was <0.05 [23].

3.3. miRNA Diagnostic Accuracy

To evaluate the diagnostic accuracy of each miRNA biomarker, sensitivity, specificity, an ROC analysis was performed, and the ROC AUC was calculated [25,26].
Additional statistical analysis was based on the Chi2 test as appropriate for categorical variables. Values of p < 0.05 were considered to denote significant differences. Data were managed with an Excel database (Microsoft, Redmond, WA, USA) and analyzed using R 2.15 software, available online (http://cran.r-project.org/, accessed on 1 January 2021).

4. Results

4.1. Description of the ENDOmiARN Cohort

Among the 200 patients, 76.5% (n = 153) had confirmed endometriosis and 23.5% (n = 47) had no endometriosis (controls). In the endometriosis group, 52% (80) of the patients had rASRM stages I–II and 48% (73) had stage III–IV. The control group consisted of various benign pathologies with 51% (24) of the women having no abnormality. These were defined as “discordant” (or complex) patients corresponding to women with symptoms suggestive of endometriosis without clinical or MRI features of endometriosis and no endometriosis lesions discovered during laparoscopic inspection (Table 1).

4.2. Global Overview of the Saliva miRNAome

Small RNA-seq of 200 saliva samples yielded ~4 642 M raw sequencing reads (from ~13.7 M to ~39.3 M reads/sample). Pre-filtering and filtering steps retained 70% (~3205 M) of initial raw reads. The majority of filtered reads were of short read length. Quantification of filtered reads and identification of known miRNAs yielded ~190 M sequences to be mapped to 2561 known miRNAs from miRBase v21. The number of expressed miRNAs ranged from 1250 (outlier) to 2561 per sample. The distribution of expressed miRNAs in the 200 saliva samples and the overall composition of processed reads are shown in Figure 1A,B and Figure 2.

4.3. miRNA Expression in Patients with and without Endometriosis

Of the miRNAs, 2561 were found to be differentially expressed in the saliva samples of patients with endometriosis, compared with the control patients. Among these, 1.17% (n = 30) were up- or downregulated. Figure 3 shows a volcano plot of the miRNAs expressed in endometriosis. Among the 30 regulated miRNAs, only three (hsa-miR-34c-5p, hsa-miR-4677-3p, hsa-miR-655-5p) had an AUC > 0.6. The top 10 differentially expressed miRNA patterns in the endometriosis and control are reported in Figure 4.

4.4. Diagnostic Accuracy of Regulated miRNAs

The diagnostic metrics for endometriosis in the differentially expressed miRNAs in the saliva samples (n = 30) are reported in Table 2. Among these, the F1-score, sensitivity, specificity, and AUC ranged from 11–86.8%, 5.8–97.4%, 10.6–100%, and 39.3–69.2%, respectively.
For AUC criteria, 90% (n = 27), and 10% (n = 3) had a value ranging between 36.3–59% and ≥60%, respectively.
For the F1-score, 80% (n = 24) and 20% (n = 6) had a value ranging between 0–79%, and ≥80%, respectively
For sensitivity, 80% (n = 24) and 20% (n = 6) had a value ranging between 0–79%, and ≥80%, respectively
For specificity, 70% (n = 21) and 10% (n = 9) had a value ranging between 0–79%, and ≥80%, respectively. The clustering of the accuracy values is reported in Figure 5.

5. Discussion

To the best of our knowledge, this is the first report detailing the miRNAome of 200 saliva samples from patients with and without endometriosis included in a prospective study: the ENDOmiARN study [3,11,16,27]. In addition, we report a bioinformatics approach to saliva miRNA sequencing and analysis and underline the advantages of using saliva over blood in terms of ease of collection, reproducibility, stability, safety, non-invasiveness, and cost-effectiveness [6,8,10,22,28,29,30,31].
Preliminary results about the use of saliva RNAs as diagnostic biomarkers have previously been reported, mainly for cancer [6,8,32], systemic disease, and forensic casework [6,9,28,33]. However, the quality of the methodology and yield issues of these studies overall are debatable [34]. In a recent literature review of miRNAs for the non-invasive diagnosis of endometriosis, Monnaka et al. underlined that none of the 449 reports investigated miRNAs in saliva [11]. Therefore, since (i) no scientifically proven salivary biomarkers for endometriosis have been reported, and (ii) the applicability of such biomarkers has been poorly explored, the concept of extracting and identifying miRNAs from saliva samples for the reliable identification of endometriosis is challenging [35,36]. The main obstacle to using miRNAs is their stability and susceptibility to degradation. This has always been an issue for mRNA-based gene expression analysis and a potential source of bias for reproducibility [28,34,37]. This point was highlighted, for example, for forensic routine applications using miRNA, because biological stains from forensic casework are often altered by ambient moisture and temperature, UV light, suboptimal environmental pH, which all have the potential to degrade the miRNA beyond usability [28]. In this setting, Patel et al. demonstrated that Oragene•RNA solution could preserve and stabilize RNA collected from saliva to produce high yields of good quality RNA for subsequent downstream applications and/or analyses [34]. The authors reported that the RNA yield remained fairly constant between matched samples from each donor when stored for 48 h at room temperature [34]. In addition, they explored the differences in the total RNA yield from donors over a 3-day period, but also sought to examine the potential differences in expression between commonly used mRNA and miRNA endogenous controls [34]. Although the total RNA from each donor varied over the days, probably due to bacterial RNA, the abundance of the mammalian RNA normalizers (snU6 small RNA, 18S rRNA, GAPDH mRNA and let-7b miRNA) remained stable [34]. In the current study, 200 saliva samples were collected according to the manufacturer’s guidelines (Oragene) and stored at room temperature prior to shipping and analysis. Interestingly, we found that the quantification of filtered reads and identification of miRNAs yielded ~190 M sequences to be mapped to 2561 known miRNAs. The total reads ranged from 13 to 39 million with a mean of 23 million. Among these, the miRNA reads ranged from 272,322 to 6 million with a mean of 949,893 (Figure 2A,B). These results are in concordance with previous reports demonstrating that the salivary transcriptome is abundant and stable, consisting of thousands of mRNAs and miRNAs [6,9,10,28,34,38]. In this setting, Courts et al. also confirmed that miRNAs are especially relevant because they are stable and easy to collect and analyze, and validated their use in standard forensic medicine [28]. Using the Oragene•RNA kit, we demonstrated (i) the stability and consistency of miRNA reads for the 200 samples whatever the conditions of sampling and transport, (ii) the reproducibility and efficiency of such techniques since all the 200 samples were usable for sequencing, and (iii) a routine bioinformatics approach. In the present study, diagnostic accuracies according to the F1-score, sensitivity, specificity and AUC ranged from 11–86.8%, 5.8–97.4%, 10.6–100%, and 39.3–69.2%, respectively. In addition, we identified 30 miRNAs up- and downregulated with a high heterogeneity in terms of accuracy.
Although the use of saliva for miRNA identification could be a potential non-invasive solution to overcome current barriers to the diagnosis of endometriosis, the critical step is the transition from expression data to candidate selection, which is always somewhat arbitrary. In this setting, salivary miRNAs have been reported to be of great interest as diagnostic biomarkers especially in cancer [6,10,32]. However, as there are no fixed rules about which criteria to apply to select a miRNA candidate, we developed a bioinformatics approach for miRNA accuracy: among the 2561 miRNAs identified, 30 were up- or downregulated, underpinning the use of new mathematical methods and artificial intelligence to overcome the limits of classic logistic regression. Indeed, in agreement with Lopez-Rincon et al., it is illusory to imagine that a few mi-RNAs could reflect the heterogeneity of a multifactorial disorder such as endometriosis, characterized by various phenotypes and for which the various pathways implicated in its genesis are poorly understood [7,15,39]. We thus used a new statistical tool, machine learning, to overcome the accuracy limitations and design a potential diagnostic signature [7,9,10,15,30].
In the present study, we analyzed 200 plasma samples for miRNA expression and diagnostic accuracy. However, there are several unsolved issues that might hinder the broad acceptance of a miRNA-based signature. The miRNAome, perhaps even more than the transcriptome, is highly context dependent, and it is conceivable that certain non- physiologic or pathologic conditions might alter the expression levels of miRNAs for body-fluid identification. It will therefore be necessary to test whether the expression of candidate miRNAs for body-fluid identification are influenced by biologic processes or conditions such as the menstrual cycle phase or previous hormonal treatment [12,40]. In this setting, Vanhie et al. and Moustafa al. reported no impact on miRNA expression according either to hormonal treatment or menstrual cycle phases in contrast to data obtained from endometrial biopsies [12,40]. This apparent discrepancy could be linked to the modalities of miRNA release into bodily fluids that could vary depending on the organ and the tumor. In the ENDOmiARN study, two different body fluids were assessed: serum and saliva. This choice was mostly driven by the need for stability in the miRNAs detected to provide a reliable diagnostic tool. Indeed, while several studies have observed differences in miRNA expression in tissues according to the menstrual phase, mainly at endometrial level [41,42], no such cyclic differences were observed in the plasma of healthy women [43]. One hypothesis is that changes in miRNA expression at the endometrium level regulate gene expression locally but are insufficient to cause detectable systemic changes [3]. The other reason to opt for saliva was its easy availability, including in a home self-sampling setting and including virgin patients not examined during gynecological appointments. Another issue is the variations of miRNA expression analysis according to the next-generation sequencing (NGS) technique used. Indeed, several different methods and devices for miRNA extraction, reverse transcription and quantification from NGS to microarray analysis have been advocated leading to differences in results [13,31,34]. However, as underlined by Agrawal et al., we believe that the standardized NGS procedure we describe here is optimal for endometriosis since it is currently the gold standard approach for profiling nucleic acid, including miRNAs [3]. In addition, miRNAs are just one of several classes of small, ncRNAs with regulatory functions, and there is no reason to exclude these RNAs from endometriosis analyses. Therefore, miRNA analysis may represent an interim strategy until more is known about other small RNAs, and once a comprehensive small-RNA analysis is available, it is likely to replace miRNA only analysis [44]. Eventually, our results require external validation supporting temporal and geographic validation of for mi RNA quantification and sequencing reproducibility; that is the goal of an ongoing study [45].

6. Conclusions

Endometriosis affects about 190 million women worldwide, representing a healthcare burden equivalent to diabetes [46]. Endometriosis is a representative example of a multifactorial and not completely understood, chronic disease. To understand the various signaling pathways involved in this complex disease, analysis of the entire miRNome currently available is mandatory. This report describes the whole saliva transcriptome to make miRNA quantification a validated, standardized, and reliable technique for routine use. The methodology could be applied to build a saliva signature of endometriosis and to solve other issues of this debilitating disorder—various clinical phenotypes, infertility-associated endometriosis—as well as to evaluate the potential theragnostic value of miRNA expression. Finally, beyond endometriosis, our methodology can be applied to other chronic diseases with the goal of developing a noninvasive, quick and reliable tool to improve diagnosis, management and to select patients according to therapeutic medical and/or surgical response.

Author Contributions

Methodology and design: S.B. and E.D.; data collection: S.B., Y.D., A.P., C.T. and E.D.; biologic data collection: Y.D., L.J. and D.B.; analysis: S.S., S.B. and E.D.; data interpretation: S.B., S.S., Y.D., A.P., C.T. and E.D.; manuscript writing: S.B., Y.D. and E.D. All authors have read and agreed to the published version of the manuscript.

Funding

Part of this work was funded by a grant from the Conseil Régional d’Ile de France (grant number EX024087) and from Ziwig, Inc.

Institutional Review Board Statement

The experimental protocol was approved by le comité de protection des personnes (C.P.P.) Sud-Ouest et Outre-Mer 1 (CPP 1-20-095 ID 10476).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The authors state that the data used are from the prospective ENDOmiARN study (ClinicalTrials.gov Identifier: NCT04728152). Data collection and analysis were carried out under Research Protocol n° ID RCB: 2020-A03297-32.

Acknowledgments

All authors would like to sincerely thank F. Neilson (matrixconsultants.fr) for her English revision of the manuscript.

Conflicts of Interest

S. Suisse is a former employee of Ziwig, Inc. The remaining authors have no other relevant affiliations or financial involvement with any organization or entity with a financial interest in or financial conflict with the subject matter or materials discussed in the manuscript.

References

  1. Bartel, D.P. MicroRNAs: Genomics, biogenesis, mechanism, and function. Cell 2004, 116, 281–297. [Google Scholar] [CrossRef] [Green Version]
  2. Bartel, D.P. MicroRNAs: Target recognition and regulatory functions. Cell 2009, 136, 215–233. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Agrawal, S.; Tapmeier, T.; Rahmioglu, N.; Kirtley, S.; Zondervan, K.; Becker, C. The miRNA Mirage: How Close Are We to Finding a Non-Invasive Diagnostic Biomarker in Endometriosis? A Systematic Review. Int. J. Mol. Sci. 2018, 19, 599. [Google Scholar] [CrossRef] [Green Version]
  4. Vickers, K.C.; Palmisano, B.T.; Shoucri, B.M.; Shamburek, R.D.; Remaley, A.T. MicroRNAs are transported in plasma and delivered to recipient cells by high-density lipoproteins. Nat. Cell Biol. 2011, 13, 423–433. [Google Scholar] [CrossRef] [Green Version]
  5. Turchinovich, A.; Weiz, L.; Langheinz, A.; Burwinkel, B. Characterization of extracellular circulating microRNA. Nucleic Acids Res. 2011, 39, 7223–7233. [Google Scholar] [CrossRef] [PubMed]
  6. Setti, G.; Pezzi, M.E.; Viani, M.V.; Pertinhez, T.A.; Cassi, D.; Magnoni, C.; Bellini, P.; Musolino, A.; Vescovi, P.; Meleti, M. Salivary MicroRNA for Diagnosis of Cancer and Systemic Diseases: A Systematic Review. Int. J. Mol. Sci. 2020, 21, E907. [Google Scholar] [CrossRef] [Green Version]
  7. Lopez-Rincon, A.; Martinez-Archundia, M.; Martinez-Ruiz, G.U.; Schoenhuth, A.; Tonda, A. Automatic discovery of 100-miRNA signature for cancer classification using ensemble feature selection. BMC Bioinform. 2019, 20, 480. [Google Scholar] [CrossRef] [Green Version]
  8. Kang, J.-W.; Eun, Y.-G.; Lee, Y.-C. Diagnostic Value of Salivary miRNA in Head and Neck Squamous Cell Cancer: Systematic Review and Meta-Analysis. Int. J. Mol. Sci. 2021, 22, 7026. [Google Scholar] [CrossRef]
  9. LaRocca, D.; Barns, S.; Hicks, S.D.; Brindle, A.; Williams, J.; Uhlig, R.; Johnson, P.; Neville, C.; Middleton, F.A. Comparison of serum and saliva miRNAs for identification and characterization of mTBI in adult mixed martial arts fighters. PLoS ONE 2019, 14, e0207785. [Google Scholar] [CrossRef] [Green Version]
  10. Rapado-González, Ó.; Majem, B.; Álvarez-Castro, A.; Díaz-Peña, R.; Abalo, A.; Suárez-Cabrera, L.; Gil-Moreno, A.; Santamaría, A.; López-López, R.; Muinelo-Romay, L.; et al. A Novel Saliva-Based miRNA Signature for Colorectal Cancer Diagnosis. J. Clin. Med. 2019, 8, E2029. [Google Scholar] [CrossRef] [Green Version]
  11. Monnaka, V.U.; Hernandes, C.; Heller, D.; Podgaec, S. Overview of miRNAs for the non-invasive diagnosis of endometriosis: Evidence, challenges and strategies. A systematic review. Einstein 2021, 19, eRW5704. [Google Scholar] [CrossRef] [PubMed]
  12. Moustafa, S.; Burn, M.; Mamillapalli, R.; Nematian, S.; Flores, V.; Taylor, H.S. Accurate diagnosis of endometriosis using serum microRNAs. Am. J. Obs. Gynecol. 2020, 223, 557.e1–557.e11. [Google Scholar] [CrossRef] [PubMed]
  13. Potla, P.; Ali, S.A.; Kapoor, M. A bioinformatics approach to microRNA-sequencing analysis. Osteoarthr. Cartil. Open 2021, 3, 100131. [Google Scholar] [CrossRef]
  14. Gyvyte, U.; Juzenas, S.; Salteniene, V.; Kupcinskas, J.; Poskiene, L.; Kucinskas, L.; Jarmalaite, S.; Stuopelyte, K.; Steponaitiene, R.; Hemmrich-Stanisak, G.; et al. MiRNA profiling of gastrointestinal stromal tumors by next-generation sequencing. Oncotarget 2017, 8, 37225–37238. [Google Scholar] [CrossRef] [PubMed]
  15. Lopez-Rincon, A.; Mendoza-Maldonado, L.; Martinez-Archundia, M.; Schönhuth, A.; Kraneveld, A.D.; Garssen, J.; Tonda, A. Machine Learning-Based Ensemble Recursive Feature Selection of Circulating miRNAs for Cancer Tumor Classification. Cancers 2020, 12, E1785. [Google Scholar] [CrossRef]
  16. Panir, K.; Schjenken, J.E.; Robertson, S.A.; Hull, M.L. Non-coding RNAs in endometriosis: A narrative review. Hum. Reprod Update 2018, 24, 497–515. [Google Scholar] [CrossRef] [Green Version]
  17. Canis, M.; Donnez, J.G.; Guzick, D.S.; Halme, J.K.; Rock, J.A.; Schenken, R.S.; Vernon, M.W. Revised American Society for Reproductive Medicine classification of endometriosis: 1996. Fertil Steril 1997, 67, 817–821. [Google Scholar] [CrossRef]
  18. Gyvyte, U.; Kupcinskas, J.; Juzenas, S.; Inciuraite, R.; Poskiene, L.; Salteniene, V.; Link, A.; Fassan, M.; Franke, A.; Kupcinskas, L.; et al. Identification of long intergenic non-coding RNAs (lincRNAs) deregulated in gastrointestinal stromal tumors (GISTs). PLoS ONE 2018, 13, e0209342. [Google Scholar] [CrossRef]
  19. Langmead, B.; Trapnell, C.; Pop, M.; Salzberg, S.L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009, 10, R25. [Google Scholar] [CrossRef] [Green Version]
  20. Griffiths-Jones, S.; Saini, H.K.; van Dongen, S.; Enright, A.J. mirbase: Tools for microRNA genomics. Nucleic Acids Res. 2008, 36, D154–D158. [Google Scholar] [CrossRef] [Green Version]
  21. Gao, L.; Zhang, L. Construction and comprehensive analysis of a ceRNA network to reveal potential prognostic biomarkers for lung adenocarcinoma. BMC Cancer 2021, 21, 849. [Google Scholar] [CrossRef]
  22. Li, Q.; Liu, G.; Bao, Y.; Wu, Y.; You, Q. Evaluation and application of tools for the identification of known microRNAs in plants. Appl. Plant Sci. 2021, 9, e11414. [Google Scholar] [CrossRef]
  23. Love, M.I.; Huber, W.; Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014, 15, 550. [Google Scholar] [CrossRef] [Green Version]
  24. Bargaje, R.; Hariharan, M.; Scaria, V.; Pillai, B. Consensus miRNA expression profiles derived from interplatform normalization of microarray data. RNA 2010, 16, 16–25. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Harrell, F.E.; Lee, K.L.; Mark, D.B. Multivariable prognostic models: Issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat. Med. 1996, 15, 361–387. [Google Scholar] [CrossRef]
  26. Steyerberg, E.W.; Eijkemans, M.J.; Harrell, F.E.; Habbema, J.D. Prognostic modelling with logistic regression analysis: A comparison of selection and estimation methods in small data sets. Stat. Med. 2000, 19, 1059–1079. [Google Scholar] [CrossRef]
  27. Nisenblat, V.; Sharkey, D.J.; Wang, Z.; Evans, S.F.; Healey, M.; Ohlsson Teague, E.M.C.; Print, C.G.; Robertson, S.A.; Hull, M.L. Plasma miRNAs Display Limited Potential as Diagnostic Tools for Endometriosis. J. Clin. Endocrinol. Metab. 2019, 104, 1999–2022. [Google Scholar] [CrossRef]
  28. Courts, C.; Madea, B. Specific Micro-RNA Signatures for the Detection of Saliva and Blood in Forensic Body-fluid Identification. J. Forensic Sci. 2011, 56, 1464–1470. [Google Scholar] [CrossRef]
  29. El-Mogy, M.; Lam, B.; Haj-Ahmad, T.A.; McGowan, S.; Yu, D.; Nosal, L.; Rghei, N.; Roberts, P.; Haj-Ahmad, Y. Diversity and signature of small RNA in different bodily fluids using next generation sequencing. BMC Genom. 2018, 19, 408. [Google Scholar] [CrossRef]
  30. Ghassemi, M.; Naumann, T.; Schulam, P.; Beam, A.L.; Chen, I.Y.; Ranganath, R. A Review of Challenges and Opportunities in Machine Learning for Health. AMIA Jt Summits Transl. Sci. Proc. 2020, 2020, 191–200. [Google Scholar]
  31. Alexiou, A.; Zisis, D.; Kavakiotis, I.; Miliotis, M.; Koussounadis, A.; Karagkouni, D.; Hatzigeorgiou, A.G. DIANA-mAP: Analyzing miRNA from Raw NGS Data to Quantification. Genes 2020, 12, 46. [Google Scholar] [CrossRef] [PubMed]
  32. Park, N.J.; Zhou, H.; Elashoff, D.; Henson, B.S.; Kastratovic, D.A.; Abemayor, E.; Wong, D.T. Salivary microRNA: Discovery, characterization, and clinical utility for oral cancer detection. Clin. Cancer Res. 2009, 15, 5473–5477. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Hanson, E.K.; Lubenow, H.; Ballantyne, J. Identification of forensically relevant body fluids using a panel of differentially expressed microRNAs. Anal. Biochem. 2009, 387, 303–314. [Google Scholar] [CrossRef] [PubMed]
  34. Patel, R.S.; Jakymiw, A.; Yao, B.; Pauley, B.A.; Carcamo, W.C.; Katz, J.; Cheng, J.Q.; Chan, E.K.L. High resolution of microRNA signatures in human whole saliva. Arch. Oral Biol. 2011, 56, 1506–1513. [Google Scholar] [CrossRef] [Green Version]
  35. Wingfield, M.; O’Herlihy, C.; Finn, M.M.; Tallon, D.F.; Fottrell, P.F. Follicular and luteal phase salivary progesterone profiles in women with endometriosis and infertility. Gynecol. Endocrinol. 1994, 8, 21–25. [Google Scholar] [CrossRef]
  36. Petrelluzzi, K.F.S.; Garcia, M.C.; Petta, C.A.; Grassi-Kassisse, D.M.; Spadari-Bratfisch, R.C. Salivary cortisol concentrations, stress and quality of life in women with endometriosis and chronic pelvic pain. Stress 2008, 11, 390–397. [Google Scholar] [CrossRef]
  37. Andreasen, D.; Fog, J.U.; Biggs, W.; Salomon, J.; Dahslveen, I.K.; Baker, A.; Mouritzen, P. Improved microRNA quantification in total RNA from clinical samples. Methods 2010, 50, S6–S9. [Google Scholar] [CrossRef]
  38. Chen, Y.; Wu, Y.; Yao, H.; Luo, H.; Lin, B.; Zhang, X.; Liang, X.; Sun, R.; Zhao, S.; Li, Y.; et al. miRNA Expression Profile of Saliva in Subjects of Yang Deficiency Constitution and Yin Deficiency Constitution. Cell Physiol Biochem. 2018, 49, 2088–2098. [Google Scholar] [CrossRef]
  39. Metselaar, P.I.; Mendoza-Maldonado, L.; Li Yim, A.Y.F.; Abarkan, I.; Henneman, P.; Te Velde, A.A.; Schönhuth, A.; Bosch, J.A.; Kraneveld, A.D.; Lopez-Rincon, A. Recursive ensemble feature selection provides a robust mRNA expression signature for myalgic encephalomyelitis/chronic fatigue syndrome. Sci. Rep. 2021, 11, 4541. [Google Scholar] [CrossRef]
  40. Vanhie, A.; D, O.; Peterse, D.; Beckers, A.; Cuéllar, A.; Fassbender, A.; Meuleman, C.; Mestdagh, P.; D’Hooghe, T. Plasma miRNAs as biomarkers for endometriosis. Hum. Reprod. 2019, 34, 1650–1660. [Google Scholar] [CrossRef]
  41. Kuokkanen, S.; Chen, B.; Ojalvo, L.; Benard, L.; Santoro, N.; Pollard, J.W. Genomic profiling of microRNAs and messenger RNAs reveals hormonal regulation in microRNA expression in human endometrium. Biol. Reprod. 2010, 82, 791–801. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  42. Altmäe, S.; Martinez-Conejero, J.A.; Esteban, F.J.; Ruiz-Alonso, M.; Stavreus-Evers, A.; Horcajadas, J.A.; Salumets, A. MicroRNAs miR-30b, miR-30d, and miR-494 regulate human endometrial receptivity. Reprod. Sci. 2013, 20, 308–317. [Google Scholar] [CrossRef] [Green Version]
  43. Rekker, K.; Saare, M.; Roost, A.M.; Kaart, T.; Sõritsa, D.; Karro, H.; Sõritsa, A.; Simón, C.; Salumets, A.; Peters, M. Circulating miR-200-family micro-RNAs have altered plasma levels in patients with endometriosis and vary with blood collection time. Fertil. Steril. 2015, 104, 938–946. [Google Scholar] [CrossRef] [PubMed]
  44. Coenen-Stass, A.M.L.; Magen, I.; Brooks, T.; Ben-Dov, I.Z.; Greensmith, L.; Hornstein, E.; Fratta, P. Evaluation of methodologies for microRNA biomarker detection by next generation sequencing. RNA Biol. 2018, 15, 1133–1145. [Google Scholar] [CrossRef] [PubMed]
  45. ZIWIG Multicenter Validation of the Salivary miRNA Signature of Endometriosis—The ENDOmiARN Salive Test. Study. 2022. Available online: http://clinicaltrials.gov (accessed on 1 April 2022).
  46. Zondervan, K.T.; Becker, C.M.; Missmer, S.A. Endometriosis. N. Engl. J. Med. 2020, 382, 1244–1256. [Google Scholar] [CrossRef] [PubMed]
Figure 1. (A) Distribution of expressed miRs in the 200 saliva samples; (B) distribution of expressed miRNAs in the samples by diagnosis.
Figure 1. (A) Distribution of expressed miRs in the 200 saliva samples; (B) distribution of expressed miRNAs in the samples by diagnosis.
Ijms 23 08045 g001
Figure 2. Overall composition of processed reads for saliva sample RNA reads = miRNAs + piRNAs + rRNAs + tRNAs + mRNAs + other; filtered reads = reads with no adapters + reads with low quality bases + reads too short; not characterized/mappable reads = mapped reads to GRCh38 that could not be characterized as a particular type; not characterized/not mappable reads = reads that could not be mapped.
Figure 2. Overall composition of processed reads for saliva sample RNA reads = miRNAs + piRNAs + rRNAs + tRNAs + mRNAs + other; filtered reads = reads with no adapters + reads with low quality bases + reads too short; not characterized/mappable reads = mapped reads to GRCh38 that could not be characterized as a particular type; not characterized/not mappable reads = reads that could not be mapped.
Ijms 23 08045 g002
Figure 3. Volcano plot of expressed miRNAs in saliva for endometriosis.
Figure 3. Volcano plot of expressed miRNAs in saliva for endometriosis.
Ijms 23 08045 g003
Figure 4. Small RNA-seq defines differentially expressed miRNAs in the saliva of endometriosis patients.
Figure 4. Small RNA-seq defines differentially expressed miRNAs in the saliva of endometriosis patients.
Ijms 23 08045 g004
Figure 5. Clustering of accuracy values. In blue: F1-Score; In orange: Sensitivity; In grey: Specificity.
Figure 5. Clustering of accuracy values. In blue: F1-Score; In orange: Sensitivity; In grey: Specificity.
Ijms 23 08045 g005
Table 1. Demographic characteristics of the patients included in the ENDOmiARN cohort. BMI: body mass index; rASRM: revised American Society for Reproductive Medicine.
Table 1. Demographic characteristics of the patients included in the ENDOmiARN cohort. BMI: body mass index; rASRM: revised American Society for Reproductive Medicine.
Controls
N = 47
Endometriosis
N = 153
Age years (mean ± SD)30.92 ± 13.7931.17 ± 10.780.19
BMI (body mass index) (mean ± SD)24.84 ± 11.1024.36 ± 8.380.52
rASRM classification -
- I–II-80 (52%)
- III–IV -73 (48%)
Control diagnoses
- No abnormality24 (51%)--
- Leiomyoma1 (2%)
- Cystadenoma5 (11%)
- Teratoma11 (23%)
- Other gynecological disorders6 (13%)
Dysmenorrhea100%100%
Abdominal pain outside menstruation
- Yes21 (66%)89 (71%)0.69
Patients with pain suggesting sciatica10 (31%)70 (56%)0.02
Dyspareunia intensity at VAS (mean ± SD)4.95 ± 3.525.28 ± 3.95<0.001
Patients with lower back pain outside menstruation20 (62%)101 (81%)0.049
Intensity of pain during defecation at VAS (mean ± SD)2.84 ± 2.764.35 ± 3.47<0.001
Patient with right shoulder pain during menstruation3 (9%)26 (21%)0.21
Intensity of urinary pain during menstruation at VAS (mean ± SD)2.84 ± 2.764.35 ± 3.36<0.001
Patient with blood in the stools during menstruation4 (12%)30 (24%)0.24
Patient with blood in urine during menstruation8 (25%)21 (17%)0.41
Table 2. Diagnostic metrics for endometriosis for differentially expressed miRNAs in the saliva samples (n = 30).
Table 2. Diagnostic metrics for endometriosis for differentially expressed miRNAs in the saliva samples (n = 30).
miRNARegulationAUCF1-ScoreSensitivitySpecificity
hsa-let-7a-5pUP0.4730.7210.690.255
hsa-let-7i-5pUP0.4510.7260.710.191
hsa-miR-101-3pUP0.5540.2270.1290.979
hsa-miR-103a-3pUP0.5820.3390.2060.957
hsa-miR-142-3pUP0.5290.110.0581
hsa-miR-146a-5pUP0.5380.8570.9480.128
hsa-miR-15a-5pUP0.5710.5580.4190.723
hsa-miR-16-5pUP0.4270.7070.6840.17
hsa-miR-199a-3pUP0.4670.2710.1680.766
hsa-miR-203b-5pUP0.4740.4360.310.638
hsa-miR-205-5pUP0.5440.1970.110.979
hsa-miR-223-3pUP0.4550.5170.40.511
hsa-miR-23a-3pUP0.540.2160.1230.957
hsa-miR-23b-3pUP0.5640.420.2770.851
hsa-miR-24-3pUP0.5420.6220.510.574
hsa-miR-26a-5pUP0.540.8680.9740.106
hsa-miR-29a-3pUP0.5520.6380.5290.574
hsa-miR-29c-3pUP0.4410.4490.3290.553
hsa-miR-3191-3pUP0.550.8520.9290.17
hsa-miR-34c-5pUP0.6420.8440.8580.426
hsa-miR-378a-3pUP0.5540.4840.3420.766
hsa-miR-4330UP0.3930.4090.2970.489
hsa-miR-4516UP0.5810.6840.5870.574
hsa-miR-4677-3pUP0.6920.7810.7030.681
hsa-miR-4754UP0.5530.820.8520.255
hsa-miR-4778-5pUP0.5830.5980.4650.702
hsa-miR-523-5pDOWN0.5760.40.2580.894
hsa-miR-655-5pUP0.6160.5340.3810.851
hsa-miR-6726-5pUP0.5570.8510.9230.191
hsa-miR-6738-3pDOWN0.5910.4590.310.872
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Bendifallah, S.; Dabi, Y.; Suisse, S.; Jornea, L.; Bouteiller, D.; Touboul, C.; Puchar, A.; Daraï, E. A Bioinformatics Approach to MicroRNA-Sequencing Analysis Based on Human Saliva Samples of Patients with Endometriosis. Int. J. Mol. Sci. 2022, 23, 8045. https://doi.org/10.3390/ijms23148045

AMA Style

Bendifallah S, Dabi Y, Suisse S, Jornea L, Bouteiller D, Touboul C, Puchar A, Daraï E. A Bioinformatics Approach to MicroRNA-Sequencing Analysis Based on Human Saliva Samples of Patients with Endometriosis. International Journal of Molecular Sciences. 2022; 23(14):8045. https://doi.org/10.3390/ijms23148045

Chicago/Turabian Style

Bendifallah, Sofiane, Yohann Dabi, Stéphane Suisse, Ludmila Jornea, Delphine Bouteiller, Cyril Touboul, Anne Puchar, and Emile Daraï. 2022. "A Bioinformatics Approach to MicroRNA-Sequencing Analysis Based on Human Saliva Samples of Patients with Endometriosis" International Journal of Molecular Sciences 23, no. 14: 8045. https://doi.org/10.3390/ijms23148045

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop