Next Article in Journal
Design, Synthesis, Antibacterial, Antifungal and Anticancer Evaluations of Novel β-Pinene Quaternary Ammonium Salts
Next Article in Special Issue
CAF Proteins Help SOT1 Regulate the Stability of Chloroplast ndhA Transcripts
Previous Article in Journal
Biological Control of Leaf Blight Disease Caused by Pestalotiopsis maculans and Growth Promotion of Quercus acutissima Carruth Container Seedlings Using Bacillus velezensis CE 100
Previous Article in Special Issue
PARN-like Proteins Regulate Gene Expression in Land Plant Mitochondria by Modulating mRNA Polyadenylation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Full Length Transcriptome Highlights the Coordination of Plastid Transcript Processing

1
Institute of Plant Sciences Paris-Saclay (IPS2), Université Paris-Saclay, CNRS, INRAE, Université Evry, 91405 Orsay, France
2
Institute of Plant Sciences Paris-Saclay (IPS2), Université de Paris, CNRS, INRAE, 91405 Orsay, France
3
Laboratoire de Mathématiques et de Modélisation d’Evry (LaMME), Université d’Evry-Val-d’Essonne, UMR CNRS 8071, ENSIIE, USC INRAE, 91000 Evry, France
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2021, 22(20), 11297; https://doi.org/10.3390/ijms222011297
Submission received: 29 August 2021 / Revised: 8 October 2021 / Accepted: 11 October 2021 / Published: 19 October 2021
(This article belongs to the Special Issue Post-transcriptional Regulation in Plant Organelles)

Abstract

:
Plastid gene expression involves many post-transcriptional maturation steps resulting in a complex transcriptome composed of multiple isoforms. Although short-read RNA-Seq has considerably improved our understanding of the molecular mechanisms controlling these processes, it is unable to sequence full-length transcripts. This information is crucial, however, when it comes to understanding the interplay between the various steps of plastid gene expression. Here, we describe a protocol to study the plastid transcriptome using nanopore sequencing. In the leaf of Arabidopsis thaliana, with about 1.5 million strand-specific reads mapped to the chloroplast genome, we could recapitulate most of the complexity of the plastid transcriptome (polygenic transcripts, multiple isoforms associated with post-transcriptional processing) using virtual Northern blots. Even if the transcripts longer than about 2500 nucleotides were missing, the study of the co-occurrence of editing and splicing events identified 42 pairs of events that were not occurring independently. This study also highlighted a preferential chronology of maturation events with splicing happening after most sites were edited.

1. Introduction

Plastids are derived from the endosymbiosis between photosynthetic organisms and an ancestral Eukaryote. Although most of the initial symbiont genes have been transferred to the nucleus during the course of evolution, plastids of land plants and other photosynthetic Eukaryotes still maintain a small but essential genome. It mainly encodes subunits of each of the photosynthetic complexes (Photosystem I and II, cytochrome b6/f, ATP synthase and Rubisco) and some of the plastid gene expression (PGE) machinery [1]. Most of the proteins involved in PGE are, however, encoded in the nucleus and need to be targeted back to plastids. As a consequence, PGE retains characteristics from both eukaryotes and bacterial systems, resulting in a sophisticated interplay between nucleus and plastid encoded factors [2,3,4].
A striking feature of PGE is the importance and complexity of the post-transcriptional maturation steps. In addition to the intron removal by RNA splicing [5] and the specific conversion of cytosines into uridines by RNA editing [6], complete maturation also requires intergenic cleavage of the multigenic transcripts and the generation of 5′ and 3′ ends through RNA processing [7,8]. Most of the RNA binding proteins (RBP) or ribonucleases known to be involved in PGE are localized in a membraneless structure surrounding the plastome—the nucleoid [9]. This close association between RNA maturation factors might be an explanation for the multiple pleiotropic effects observed in chloroplast mutants [7].
Various investigations, both in vitro and in organellar gene expression mutant plants, have indeed revealed situations where the different maturation events can influence each other. For example, intron removal is a prerequisite for editing in the ndhA second exon [10] and atpF splicing is severely reduced in the aef1 mutant in which the editing of atpF_12707 is abolished [11]. Arabidopsis thaliana chloroplast RNA editing is affected in a mutant deficient for the exoribonuclease PNPase [12] while correct processing of the potato mitochondrial tRNA Phe requires RNA editing [13]. Editing sites can even influence each other. For example, in A. thaliana, editing of mitochondrial ccmB_17869 by MEF19 depends on the editing of ccmB_17884 by MEF37 [14]. Similarly, in Physcomitrium patens, editing of the mitochondrial ccmFc-C103 by PpPPR_65 controls editing of ccmFc-C122 by PpPPR_71 [15,16].
These dependencies are usually explained according to two models. First, one maturation event can modify the RNA secondary structure necessary for the second maturation. Second, the proteins responsible for the maturation can interact with each other or, more directly, target several maturation events. Most studies, however, only focused on a limited set of transcripts or RNA maturation events precluding any general conclusions. This illustrates the urgent need for the development of global approaches capable of simultaneously studying all the RNA maturation processes, at the transcriptomic level. This issue has recently been tackled by the increasing use of Illumina-based RNA-Seq strategies to study PGE from transcription to translation [17,18,19,20,21,22,23].
Although this has considerably increased the power and sensitivity of PGE analyses, it is ill-suited to study the potential coordination between maturation steps. The short reads used by Illumina technology (the maximum insert size of Illumina TruSeq RNA libraries reaches around 350 base-pairs) make it impossible to monitor the co-occurrence of these events on single RNA transcripts that can be several kilobases long. An alternative would be to take advantage of other sequencing technologies such as PacBio or Oxford Nanopore. They theoretically allow the sequencing of full-length cDNAs or RNA and should therefore overcome the current technical limitations [24]. A major issue, however, is that most of the available library preparation protocols only capture polyadenylated RNA transcripts, therefore excluding plastid transcripts. A recent protocol analyzing chromatin-bound transcripts also captures non-polyadenylated transcripts but was not applied to the analysis of plastid transcripts [25,26].
In this work, we describe the analysis of the A. thaliana plastid transcriptome by sequencing full-length non-polyadenylated and polyadenylated cDNAs using the Oxford Nanopore technology (ONT). This analysis identified all known post-transcriptional maturation events and provided an overview of their coordination in normal growth conditions.

2. Results

2.1. A Protocol to Sequence the Full Length Plastid Transcriptome

The library synthesis protocol is derived from the Switching Mechanism at the 5′ end of RNA Transcript (SMART) technology developed to synthesize full-length cDNAs [27]. Because polyadenylation of chloroplastic RNAs acts as a degradation signal [28], we, however, had to first start with the ligation of an RNA adapter (modified from Hotto et al. [29]) at the 3′ end of the RNAs to allow the priming of the reverse transcription and an rRNA depletion before completing the cDNA synthesis. The cDNAs are then incorporated into an ONT sequencing library and sequenced. Sampling RNA from leaves of 5 week-old col-0 A. thaliana plants grown in long-day conditions at 20 °C, we mapped between 1.55 million and 2.69 million stranded reads (mapping rate between 98.5% and 99.8%) to the A. thaliana genome including between 10% and 40% to the plastid genome and between 0.3% and 0.8% to the mitochondrial genome. The median error rate was between 4% and 4.4%. The rRNA depletion was very efficient with less than 0.1% of reads mapping to rRNA loci. More than 99.5% of the reads mapped to the annotated nuclear genes corresponding to the sense orientation, a proportion similar to Illumina stranded RNA-Seq. Most of the reads (99%) were between 195 and 2141 nucleotides (nt) long with a median size of 852 nt and a maximum size of 4805 nt. In A. thaliana, 7261 genes are producing transcripts longer than 2141 nt and more than 390 genes (including the plastid ycf2 gene) are producing transcripts longer than 4800 nt. Based on the whole transcriptome, the 3′ to 5′ transcript coverage was better with our protocol than for similar samples analyzed using Illumina sequencing for transcripts below 1500 nt (22,853 genes; Figure S1). For transcripts above 1500 nt (17,985 genes), the Illumina sequencing performed better and a moderate 3′-5′ bias can be observed. These results confirm that our nanopore reads were mostly full-length and stranded but that the longer transcripts are missing from the sequencing libraries.

2.2. A Representative Picture of the Plastid Transcriptome

With at least 275,000 reads mapped on the plastid genome for each biological replicate, the coverage is deep enough to have a good representation of the plastid transcriptome. To verify that the sequencing data are correctly capturing the plastid transcriptome, we looked at the complex transcriptional profile of the psbB to petD genomic region (Figure 1).
Following transcription, transcripts from this multigenic locus are processed into multiple poly- or monocistronic isoforms on both genomic strands [30,31]. A rapid overview of the reads showed the transcription of psbN on the Crick strand while psbB, psbT, psbH, petB and petD were transcribed from the Watson strand as expected. The spliced petD and petB transcripts were also found. Taking advantage of long-read sequencing, it is possible to emulate Northern blots by selecting reads which map on specific positions and plotting the distribution of the read lengths. Felder et al. [30] studied the involvement of HCF107 in the processing of the psbB to petD locus with an extensive use of Northern blots, allowing a comparison of the two methods. We therefore generated virtual Northern blots for psbN, psbH, petB and petD (Figure 2) using virtual probes equivalent to the probes used for Figure 4C,E,H,I of Felder et al. [30].
Reads mapping to psbN were almost exclusively 200 nt long which is compatible with the signal detected by a classic Northern blot. Reads mapping to psbH showed two major isoforms around 1100 nucleotides (nt) and 1800 nt but also two minor isoforms around 370 nt and 2600 nt. This profile is also compatible with the regular Northern blot. However, Felder et al. [30] also detected larger isoforms at 3300, 4100, 4900, and 5600 nt that were not captured in our sequencing libraries. The virtual Northern blot for petB showed four major isoforms at 750, 1100, 1450, and 1800 nt. A faint isoform may be present at 2250 nt. These isoforms were also detected by Felder et al. [30] who found additional isoforms at 2600, 3300, 4100, 4900, and 5600 nt. Finally, for petD, we found two major isoforms around 1450 and 1800 nt and minor isoforms around 990 and 2225 nt. We missed the larger isoforms detected by Felder et al. [30] but also a 1200 nt isoform described as an unspliced petD transcript which seemed to be replaced by our 990 nt isoform. The detection of sharp “bands” of the expected size in our virtual Northern blots confirms that the majority of the nanopore reads correspond to full-length cDNAs but this result also confirms that transcripts longer than 2–2.5 kb are under-represented in our sequencing libraries.
In these complex loci, it is sometimes difficult to identify all the bands on a regular Northern blot. For example, Felder et al. did not associate the 2200 nt transcript of their petB and petD Northern blots to a particular isoform.
Our sequencing showed that this transcript is most likely a polycistronic intermediate containing an unspliced petB with a spliced petD (Figure 3A). For petD, we detected a minor isoform around 990 nt. The associated transcripts corresponded to two distinct isoforms (Figure 3B). The first one corresponded to spliced petD transcripts but with 5′ ends within the second petB exon. The second one had a 5′ end in the petD intron at position 76,780 and included the second petD exon. Position 76,780 was identified as a transcription start site and multiple 5′ ends were mapped in this area [20]. Similarly, because of their poor resolution, regular Northern blots can miss isoforms of similar sizes. Our virtual Northern blot for psbH showed that the four peaks are double peaks: the main isoforms are each associated with isoforms which are 50 nt longer. When mapping these isoforms, we could show that the short and long isoforms are associated with different 5′ ends, the long one around the genomic position 74,393 and the short one around 74,441 (Figure 3C). According to Castandet et al. [20], position 74,441 corresponds to the major processed extremity of psbH while position 74,393 is a transcription start site.
Even if our nanopore reads showed no or only moderate 3′ to 5′ bias in general (Figure S1), some plastid transcripts showed 3′ to 5′ but also 5′ to 3′ biases (Figure 4).
Figure 3. Identification of transcripts isoforms. Screenshots of IGV displaying the reads corresponding to various virtual Northern blot isoforms. Matching bases are shown in red. Split reads are joined by blue lines. Other colors indicate mismatches and indels. (A) Reads corresponding to the 2200 nt isoform of the petB and petD virtual Northern blots. (B) Reads corresponding to the 990 nt isoform of the petD virtual Northern blot. (C) Reads corresponding to the 1100–1150 isoform of the psbH virtual Northern blot. The two 5′ ends are shown by black arrows.
Figure 3. Identification of transcripts isoforms. Screenshots of IGV displaying the reads corresponding to various virtual Northern blot isoforms. Matching bases are shown in red. Split reads are joined by blue lines. Other colors indicate mismatches and indels. (A) Reads corresponding to the 2200 nt isoform of the petB and petD virtual Northern blots. (B) Reads corresponding to the 990 nt isoform of the petD virtual Northern blot. (C) Reads corresponding to the 1100–1150 isoform of the psbH virtual Northern blot. The two 5′ ends are shown by black arrows.
Ijms 22 11297 g003
Figure 4. Examples of coverage biases in plastid transcripts. For each panel, the coverage at single-nucleotide resolution by strand-specific reads overlapping at least partially the genomic regions shown in red is shown. At the top, the genomic positions are shown while the coding sequences associated with these regions are shown below as gray arrows or boxes. Introns are represented as black lines. (A): psaA transcripts. (B): rbcL transcripts. (C): ndhB transcripts. (D): psbE-psbJ transcripts.
Figure 4. Examples of coverage biases in plastid transcripts. For each panel, the coverage at single-nucleotide resolution by strand-specific reads overlapping at least partially the genomic regions shown in red is shown. At the top, the genomic positions are shown while the coding sequences associated with these regions are shown below as gray arrows or boxes. Introns are represented as black lines. (A): psaA transcripts. (B): rbcL transcripts. (C): ndhB transcripts. (D): psbE-psbJ transcripts.
Ijms 22 11297 g004
In ndhB, the coverage at the 3′ end was only about 20% of the coverage at the 5′ end. This bias could be technical but it was not the same for other transcripts (for example rbcL or the polycistronic psbE-F-L-J transcript). For psaA, we observed a strong 5′ to 3′ bias. It may illustrate the pattern of transcript degradation of the 5300 nt long psaA-psaB-rps14 transcript [32] which is absent of our sequencing libraries.
Finally, post-transcriptional maturation events can be quantitatively analyzed. Known editing sites could be detected with rates comparable to leaf datasets (Table 1, Table S1) previously published by Guillaumot et al. [33] (Pearson correlation = 0.97; p-value < 2.2 × 10−16) and Ruwe et al. [12] (Pearson correlation = 0.94; p-value < 2.2 × 10−16).
It should be noted that the analysis of poorly edited sites by nanopore sequencing must be done carefully because of the relatively high error rate of this technology. For example, using the same pipeline as Guillaumot et al., we detected 123 plastid C to U transitions with a rate higher than 10% but only 44 of them were also detected by Guillaumot et al. [33] using Illumina sequencing. Similarly, intron splicing efficiency could be measured (Table 1 and Table S1) and it varied from 4% to 97% depending on the intron. Most values are higher (by 22 points on average) than the efficiencies measured by Guillaumot et al. [33] using Illumina. This bias could be explained by an under-representation of long (unspliced) transcripts compared to short (spliced) ones. However, this bias is not linked to the unspliced transcript length, the intron length or the unspliced/spliced size ratio (Figure S2). An alternative, but not exclusive, explanation is that the abundance of unspliced transcripts is difficult to estimate with Illumina sequencing.

2.3. Some Post-Transcriptional Events Are Coordinated and Ordered

Because editing and splicing events are well defined (a single genomic position, either processed or not), it is easy to statistically analyze the possible coordination between these events.
Although 1596 co-occurring events could theoretically be expected with 14 splicing and 43 editing events analyzed, only 138 co-occurrences were detected at least once. This is, however, expected as all events are not found on a single transcript (Table S2). Out of these 138 pairs of maturation events, 42 were not found to occur independently (Figure 5). Conversely, we did not detect any complete dependency (when one maturation event is absolutely required for another maturation event to occur). We observed partial dependencies between splicing events (clpP introns, petD and petB introns), editing and splicing events (in the atpF, clpP and ndhB transcripts) and between editing events (in the rps14, ndhD and ndhB transcripts). This partial dependency also occurred between different genes (petD and petB; psbE and psbF) belonging to polycistronic transcripts. Some sites of coordinated events like ndhD_116290 and ndhD_116281, ndhB_95650 and ndhB_95644 or ndhB_95419 and the ndhB intron could be very close but the others were separated by more than 100 nt.
A more detailed analysis shows that maturation intermediates (TF (True-False) and FT (False-True) columns of Table S2) were always less frequent than expected for independent events for the 42 pairs of dependent events. This means that when one site was processed the second one was more processed than expected randomly. In other words, there was co-maturation but no incompatibility, the maturation of one site increased the rate/speed of maturation of the other one. Furthermore, comparing the abundance of the intermediates of maturation offers the opportunity to find a preferred order of the maturation events (Figure 6).
This analysis suggests that RNA editing at psbE_64109 generally occurred before RNA editing at psbF_63985, and that the splicing of petD preferentially occurred before the splicing of petB. The maturation of clpP generally started with RNA editing at clpP_69942 followed by the splicing of the second intron and finished by the splicing of the first intron. For ndhD, the maturation preferentially started with RNA editing at ndhD_116785 followed by ndhD_116494 then both ndhD_116290 and ndhD_116281 to finish with ndhD_117166, the editing site creating the start codon of ndhD. For the ndhB transcript, the chronology of the maturation seemed more convoluted as three sites (96,457, 96,439 and 95,225) were edited independently of the other maturation events. RNA editing at ndhB_97016 seemed to occur first followed by editing at the four sites 96,579, 95,650, 95,608 and 94,999. The maturation of ndhB ended with RNA editing at sites 96,698, 96,419, 95,644 and, probably slightly later, its splicing. To confirm the order deduced from the co-occurrence analysis for transcripts requiring more than three maturation events (i.e., ndhD and ndhB), we identified the reads covering all the maturation events and counted the frequency of the various intermediates (Table S3). Out of 413 intermediate reads, 311 (75%) were compatible with the proposed chronology of ndhD maturation. For ndhB, only 63 intermediate reads were identified. This number is too small to estimate the frequency of the 4096 possible intermediates (12 maturation events) but 35 (57%) were compatible with the proposed chronology. The reads corresponding to alternative maturation chronologies are probably the result of sequencing errors. Given the nanopore sequencing error rate at around 4% and the fact that this analysis considered five positions in ndhD and 12 in ndhB, only 81.6% (0.965) of the ndhD reads and 62% (0.9612) of the ndhB reads are expected to be error-free at these positions.
This preferred chronology could theoretically be the result of kinetic differences between the different maturation events. For example, looking at two independent events, the one happening at a higher rate will likely occur first. This simple explanation is, however, incompatible with the observations. In particular, the decrease of the observed vs. expected TF and FT counts (columns delta_TF and delta_FT in Table S2) is not homogenous between TF and FT for most pairs of events. This shows that the positive effect (e.g., enhancement) provided by one maturation event to the other is not symmetrical. This asymmetry is involved in the chronology and could reinforce (at least in this case) any putative effects caused by a difference in processing rates. Finally, because of the number of maturation events jointly monitored for the ndhB (12 events) and ndhD transcripts (5 events, Table S3), the observation of a preferred chronology of maturation is extremely unlikely to be explained only by differences in maturation speed. We conclude that the observed preferred chronology of maturation is due, at least partly, to interactions between the processing events.

3. Discussion

Our protocol generates mostly full-length and stranded reads but transcripts longer than 2000–2500 nt are clearly under-represented. This bias is common to nuclear and plastid transcripts and several pieces of evidence (data not shown) strongly suggest that it is associated with the initial RNA-RNA ligation at the 3′ end of transcripts. It has indeed been described that the ligation step was sensitive to secondary structures at the 3′ end [35]. Maybe the denaturation step preceding the ligation step was not sufficient for long transcripts.
Following transcription, plastid transcripts undergo a complex array of modifications and maturation and the recent massive use of RNA-Seq based strategies has led to an unprecedented knowledge about its different steps. What is sorely lacking, however, is a global understanding of the interplay between RNA editing, splicing and processing.
Initially thought to be mainly independent [36,37], there are now more and more pieces of evidence for crosstalk between the different maturation steps [10,38,39,40,41]. Most of these results, however, have been obtained from experiments based on Sanger sequencing of a cDNA of interest, therefore limiting any potential generalization. Taking advantage of the development of nanopore sequencing, we systematically studied the link between individual RNA splicing and RNA editing events, at the plastome level.
Our results show that co-maturation of several sites tends to occur even when located far apart on their cognate transcript. This implies that all of the actors of these different processing events are grouped or co-localized, likely in the nucleoid [9].
Looking at specific links, splicing of the atpF intron and RNA editing at the atpF_12707 site are clearly not independent (Figure 5). This was expected as AEF1, the PPR protein responsible for atpF_12707 editing in A. thaliana, also facilitates atpF splicing [11]. Similarly, clpP intron 2 and ndhB splicing is enhanced by RNA editing in the cognate transcripts (Figure 6). Earlier studies have shown that some unspliced or unprocessed transcripts can already be fully edited [36,37] and this was interpreted as evidence that RNA editing is an early process, mainly occurring before splicing. Although RNA editing can be a prerequisite for splicing when it restores sequences or structures within the intron [42,43], this is an unlikely explanation here as the sites are located far from the identified splicing key elements [44]. A possibility put forward by Yap et al. [11] is that the binding of the RNA editing factor itself could have an indirect effect on splicing through the modification of RNA secondary structure or accessibility.
In agreement with the idea that RNA editing is an early maturation step, we only found marginal evidence that specific RNA editing sites could be influenced by splicing (Figure 5). This result is, however, probably dependent on our experimental model, A. thaliana. In various plants, ndhA intron removal was shown to be necessary for a ndhA editing site located close to the 3′ splice site. In this case, splicing is thought to create the RNA sequence necessary for the recognition of the RNA editing site [10], a site that is absent in A. thaliana. A similar situation has been described in P. patens mitochondria where atp9 splicing is necessary to one editing site on the same transcript [15]. As shown for clpP, the splicing of one intron can also influence the splicing of another intron located on the same transcript (Figure 5 and Figure 6). Experiments with intron deletions in tobacco have previously shown that the second intron in the ycf3 transcript needs to be spliced before the first intron. In this case, splicing of the first intron was hypothesized to create a sequence masking essential structural elements of the second intron [45]. Although A. thaliana ycf3 structure is similar to tobacco, our analysis did not confirm such dependence in this transcript.
The dependence between RNA editing sites themselves has long been debated. For example, in vitro results on short fragments of the mitochondrial atp4 RNA suggested that editing of individual sites did not influence others while in organello experiments with longer cox2 transcripts showed a pattern of dependencies [46,47]. The identification of distal elements able to enhance RNA editing was also a strong argument against complete stochastic independence of the editing site recognition [41,48]. Our results show that both cases exist in the chloroplast. Editing site ndhD_117166 generally requires earlier editing of the four other ndhD sites and ndhB_97016 editing strongly influences editing at ndhB_96698 and ndhB_96579 sites. On the other hand editing at ndhB_95225 seems autonomous and barely influences any other editing site (Figure 6).
Editing and splicing of organellar transcripts are required to get mRNA translated into functional proteins as editing often restores conserved amino acids [49] and splicing preserves the translation frame. However, the study of the translational landscape of A. thaliana mitochondria [50] or maize chloroplasts [21] showed that ribosomes were associated with partially edited transcripts and a small fraction of ribosomes were even associated with intronic sequences. Earlier chloroplast polysome purification experiments also showed that transcripts of the psbB gene cluster containing the petB or petD intron could still be translated for other genes [51]. This suggests that partially mature (especially partially edited) transcripts can access the organelle translation machinery. In addition to the dependence of some maturations events, our results showed that they could be ordered (Figure 6). In this chronology, splicing events seemed to occur later than editing events: the splicing of ndhB occurred after editing at most sites and splicing of clpP occurred after its editing. Even if the chronology was not clear from our results, Yap et al. also showed that atpF editing probably occurs before its splicing [11]. In addition, events located at the 5′ end of the transcripts tended to be later than the others. That is clearly the case for clpP and ndhD associated transcripts. In ndhD, RNA editing at ndhD_117166 was generally the last maturation event and is required to create the start codon and thus to allow the translation of the transcript. This succession of the maturation events where splicing and 5′ end events tend to be last could be a way to ensure the complete (or at least a better) maturation of the transcripts before initiating their translation. Although there is currently no known underlying mechanism to support this hypothesis, it could at least explain why partially edited RNA editing sites are generally more edited in ribosome-associated RNAs than on the steady-state pool of transcripts [21,50]. In addition, it could also explain why sites restoring cryptic start codons have variable but often lower editing rates [49,52].
Despite the modest size of the dataset and its rather simple analysis, the results presented in this study highlight the potential of long-read RNA-Seq for the analysis of plastid and mitochondrial transcriptomes. Even if the molecular protocol still needs improvements to capture the longest transcripts, it provides access to the full complexity of this transcriptome and already showed numerous links between splicing and editing. For analytical reasons, we did not include the analysis of processing in this study but nanopore RNA-Seq is suited for this type of analysis (Figure 3) and we are developing the required bioinformatical and statistical tools. A potential improvement of our strategy would be to directly sequence the chloroplastic RNAs, without performing any cDNA synthesis. This would give access to the various epitranscriptomics marks [53] that are now known to be pervasive in chloroplastic RNAs [54] and whose interactions have, for example, been shown to be important in human diseases [55]. With this complete toolbox, we anticipate it will be possible to explore the impact of growth conditions and/or mutants or compare the nucleoid- or polysome-associated transcriptome to further decipher the molecular mechanisms controlling plastid but also mitochondrial gene expression.

4. Materials and Methods

4.1. Plant Growth and RNA Extraction

Col-0 plants were grown in soil in growth chambers with 16 h of light per day at 20 °C for 5 weeks. Fifteen minutes before the onset of lights, 2 adult leaves were flash-frozen in liquid nitrogen. Total RNA was extracted using Nucleozol (Macherey-Nagel, Hoerdt, France) followed by a purification with AMPure RNA XP beads (Beckman Coulter, Villepinte, France). Three independent experiments were performed to get three biological replicates.

4.2. Nanopore Sequencing

The step-by-step protocol for the construction of the sequencing library is available online at https://forgemia.inra.fr/guillem.rigaill/nanopore_chloro (accessed on 18 October 2021). Briefly, 10 fmoles of the RNA oligo /5Phos/rNrNrNrNrUrGrArArUrGrCrArArCrArCrUrUrCrUrGrUrArC/3InvdT/ (IDT Technologies, Leuven, Belgium) was ligated to the 3′ end of 100 ng of total RNA using 10 U of T4 RNA ligase 1 (NEB, Evry, France). Ligated RNA was depleted of rRNA using the QIAseq FastSelect -rRNA Plant Kit (QIAGEN, Les Ulis, France) before a full-length cDNA synthesis using the SMARTScribe™ Reverse Transcriptase (Takara, Saint Germain en Laye, France) and the oligos AAGCAGTGGTATCAACGCAGAGTACrGrG + G and AAGCAGTGGTATCAACGCAGAGTACGTACAGAAGTGTTGCATTC (IDT Technologies, Leuven, Belgium). Full-length cDNAs were amplified with the SeqAmp DNA Polymerase (Takara, Saint Germain en Laye, France) using the AAGCAGTGGTATCAACGCAGAGTAC primer and purified with AMPure XP beads (Beckman-Coulter, Villepinte, France). 35 fmoles of amplified cDNAs were converted to a nanopore sequencing library with the PCR barcoding kit (Oxford Nanopore Technologies, Oxford, UK) and then sequenced on an R10.3 MinIon flow-cell (Oxford Nanopore Technologies, Oxford, UK).

4.3. Bioinformatics and Statistical Analyses

The raw data were base-called and demultiplexed with Guppy v5.0.7 (Oxford Nanopore Technologies) using the dna_r10.3_450 bps_hac model. Reads were then oriented using the in-house script “fastq_processing.sh” which uses LAST v1179 [56] and CUTADAPT v2.10 [57] and is available online at https://forgemia.inra.fr/guillem.rigaill/nanopore_chloro (accessed on 18 October 2021). They were mapped on the col-0 genomic sequence with Minimap2 v2.1 [58]. Transcript body coverage and strandedness were measured with the RSeQC v3.0 package [59]. The Illumina samples used to compare were the dyw2_HE replicates 1 to 3 (NCBI GEO accession numbers GSM2677518, GSM2677519 and GSM2677520) from Guillaumot et al. [33]. The plants used for these samples were grown in the same growth chambers and the sequencing libraries were constructed with the Illumina TruSeq stranded total RNA with Ribozero plant kit.
The maturation events analyzed in this study are listed in Table S1. They include the editing sites detected by Ruwe et al. [12] and the introns of protein-coding genes. The tRNA introns were omitted because the mature tRNAs are excluded from the sequencing library during sizing. This information is used to annotate each read for every maturation event according to three modalities: mature site, not mature site, and not read site. The latter allows taking insertions/deletions into account which are frequent in nanopore datasets. For each pair of events jointly observed the following configurations are listed and counted in a contingency table: mature/mature, mature/immature, immature/mature, and immature/immature. The dependency of two events, based on the contingency table, is tested using a Fisher exact test and the p-values were adjusted with an FDR [60]. Only pairs of events characterized by an adjusted p-value < 0.1 in at least 2 of the 3 replicates and an adjusted p-value < 0.005 on the pool of the 3 replicates were considered significant. Commented R scripts to annotate reads, create contingency table, perform Fisher’s exact tests and generate the result table are available online at https://forgemia.inra.fr/guillem.rigaill/nanopore_chloro (accessed on 18 October 2021).
The splicing and editing rates were measured from pooled reads of the 3 replicates. Virtual Northern blots were generated by extracting the length of the reads mapping from position 75700 to position 76000 on the Watson strand (petB), from position 77200 to position 77500 on the Watson strand (petD), from position 74487 to position 74706 on the Watson strand (psbH) or from position 74254 to position 74378 on the Crick strand (psbN) using samtools [61] and bedtools [62]. The size distributions were normalized by setting the value of the most abundant read length to 100. These distributions were converted into virtual Northern blots with the “vNB.py” python script available online at https://forgemia.inra.fr/guillem.rigaill/nanopore_chloro (accessed on 18 October 2021).

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/ijms222011297/s1.

Author Contributions

Conceptualization, M.G., B.C. and E.D.; Data curation, A.L., G.R. and E.D.; Formal analysis, A.L., C.S. and G.R.; Funding acquisition, G.R., B.C. and E.D.; Investigation, M.G. and E.D.; Methodology, G.R., B.C. and E.D.; Project administration, E.D.; Resources, E.D.; Software, M.G., A.L., C.S., T.B. and G.R.; Supervision, E.D.; Writing—original draft, M.G., B.C. and E.D.; Writing—review and editing, A.L., G.R., B.C. and E.D. All authors have read and agreed to the published version of the manuscript.

Funding

The IPS2 benefits from the support of Saclay Plant Sciences-SPS (ANR-17-EUR-0007). This work was supported by a grant from the Université Evry-Val d’Essonne to E.D., by the ANR-20-CE20-0004 JOAQUIN to B.C. and by the Evry Genopole to G.R.

Data Availability Statement

The fastq files are available from the NCBI SRA database under the accession number PRJNA748959.

Acknowledgments

We thank Etienne Sandré-Chardonnal for the python script generating the picture of the virtual Northern blot and Amber M Hotto for her comments and proofreading of the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Green, B.R. Chloroplast genomes of photosynthetic eukaryotes. Plant J. 2011, 66, 34–44. [Google Scholar] [CrossRef] [PubMed]
  2. Maier, U.G.; Bozarth, A.; Funk, H.T.; Zauner, S.; Rensing, S.A.; Schmitz-Linneweber, C.; Börner, T.; Tillich, M. Complex chloroplast RNA metabolism: Just debugging the genetic programme? BMC Biol. 2008, 6, 36. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Stern, D.B.; Goldschmidt-Clermont, M.; Hanson, M.R. Chloroplast RNA Metabolism. Annu. Rev. Plant Biol. 2010, 61, 125–155. [Google Scholar] [CrossRef] [PubMed]
  4. Barkan, A. Expression of Plastid Genes: Organelle-Specific Elaborations on a Prokaryotic Scaffold. Plant Physiol. 2011, 155, 1520–1532. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. de Longevialle, A.F.; Small, I.D.; Lurin, C. Nuclearly-encoded splicing factors implicated in RNA splicing in higher plant organelles. Mol. Plant 2010, 3, 691–705. [Google Scholar] [CrossRef] [Green Version]
  6. Sun, T.; Bentolila, S.; Hanson, M.R. The Unexpected Diversity of Plant Organelle RNA Editosomes. Trends Plant Sci. 2016, 21, 962–973. [Google Scholar] [CrossRef] [Green Version]
  7. Germain, A.; Hotto, A.M.; Barkan, A.; Stern, D.B. RNA processing and decay in plastids. Wiley Interdiscip. Rev. RNA 2013, 4, 295–316. [Google Scholar] [CrossRef]
  8. MacIntosh, G.C.; Castandet, B. Organellar and Secretory Ribonucleases: Major Players in Plant RNA Homeostasis. Plant Physiol. 2020, 183, 1438. [Google Scholar] [CrossRef]
  9. Majeran, W.; Friso, G.; Asakura, Y.; Qu, X.; Huang, M.; Ponnala, L.; Watkins, K.P.; Barkan, A.; van Wijk, K.J. Nucleoid-enriched proteomes in developing plastids and chloroplasts from maize leaves: A new conceptual framework for nucleoid functions. Plant Physiol. 2012, 158, 156–189. [Google Scholar] [CrossRef] [Green Version]
  10. Schmitz-Linneweber, C.; Tillich, M.; Herrmann, R.G.; Maier, R.M. Heterologous, splicing-dependent RNA editing in chloroplasts: Allotetraploidy provides trans-factors. EMBO J. 2001, 20, 4874–4883. [Google Scholar] [CrossRef] [Green Version]
  11. Yap, A.; Kindgren, P.; Colas Des Francs-Small, C.; Kazama, T.; Tanz, S.K.; Toriyama, K.; Small, I. AEF1/MPR25 is implicated in RNA editing of plastid atpF and mitochondrial nad5, and also promotes atpF splicing in Arabidopsis and rice. Plant J. 2015, 81, 661–669. [Google Scholar] [CrossRef] [Green Version]
  12. Ruwe, H.; Castandet, B.; Schmitz-Linneweber, C.; Stern, D.B. Arabidopsis chloroplast quantitative editotype. FEBS Lett. 2013, 587, 1429–1433. [Google Scholar] [CrossRef] [Green Version]
  13. Marechal-Drouard, L.; Cosset, A.; Remacle, C.; Ramamonjisoa, D.; Dietrich, A. A single editing event is a prerequisite for efficient processing of potato mitochondrial phenylalanine tRNA. Mol. Cell. Biol. 1996, 16, 3504–3510. [Google Scholar] [CrossRef] [Green Version]
  14. Malbert, B.; Burger, M.; Lopez-Obando, M.; Baudry, K.; Launay-Avon, A.; Härtel, B.; Verbitskiy, D.; Jörg, A.; Berthomé, R.; Lurin, C.; et al. The Analysis of the Editing Defects in the dyw2 Mutant Provides New Clues for the Prediction of RNA Targets of Arabidopsis E+-Class PPR Proteins. Plants 2020, 9, 280. [Google Scholar] [CrossRef] [Green Version]
  15. Ichinose, M.; Sugita, C.; Yagi, Y.; Nakamura, T.; Sugita, M. Two DYW subclass PPR proteins are involved in RNA editing of ccmFc and atp9 transcripts in the moss Physcomitrella patens: First complete set of PPR editing factors in plant mitochondria. Plant Cell Physiol. 2013, 54, 1907–1916. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  16. Schallenberg-Rüdinger, M.; Kindgren, P.; Zehrmann, A.; Small, I.; Knoop, V. A DYW-protein knockout in Physcomitrella affects two closely spaced mitochondrial editing sites and causes a severe developmental phenotype. Plant J. 2013, 76, 420–432. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Castandet, B.; Hotto, A.M.; Strickler, S.R.; Stern, D.B. ChloroSeq, an Optimized Chloroplast RNA-Seq Bioinformatic Pipeline, Reveals Remodeling of the Organellar Transcriptome Under Heat Stress. G3 Genes Genomes Genet. 2016, 6, 2817–2827. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Michel, E.J.S.S.; Hotto, A.M.; Strickler, S.R.; Stern, D.B.; Castandet, B. A Guide to the Chloroplast Transcriptome Analysis Using RNA-Seq. Methods Mol. Biol. 2018, 1829, 295–313. [Google Scholar] [CrossRef] [PubMed]
  19. Malbert, B.; Rigaill, G.; Brunaud, V.; Lurin, C.; Delannoy, E. Bioinformatic Analysis of Chloroplast Gene Expression and RNA Posttranscriptional Maturations Using RNA Sequencing. In Methods in Molecular Biology; Humana Press: New York, NY, USA, 2018; Volume 1829, pp. 279–294. [Google Scholar]
  20. Castandet, B.; Germain, A.; Hotto, A.M.; Stern, D.B. Systematic sequencing of chloroplast transcript termini from Arabidopsis thaliana reveals >200 transcription initiation sites and the extensive imprints of RNA-binding proteins and secondary structures. Nucleic Acids Res. 2019, 47, 11889–11905. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  21. Chotewutmontri, P.; Barkan, A. Dynamics of Chloroplast Translation during Chloroplast Differentiation in Maize. PLoS Genet. 2016, 12, e1006106. [Google Scholar] [CrossRef] [Green Version]
  22. Ruwe, H.; Wang, G.; Gusewski, S.; Schmitz-Linneweber, C. Systematic analysis of plant mitochondrial and chloroplast small RNAs suggests organelle-specific mRNA stabilization mechanisms. Nucleic Acids Res. 2016, 44, 7406–7417. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Zhelyazkova, P.; Sharma, C.M.; Forstner, K.U.; Liere, K.; Vogel, J.; Borner, T. The Primary Transcriptome of Barley Chloroplasts: Numerous Noncoding RNAs and the Dominating Role of the Plastid-Encoded RNA Polymerase. Plant Cell 2012, 24, 123–136. [Google Scholar] [CrossRef] [Green Version]
  24. Cui, J.; Shen, N.; Lu, Z.; Xu, G.; Wang, Y.; Jin, B. Analysis and comprehensive comparison of PacBio and nanopore-based RNA sequencing of the Arabidopsis transcriptome. Plant Methods 2020, 16, 85. [Google Scholar] [CrossRef] [PubMed]
  25. Long, Y.; Jia, J.; Mo, W.; Jin, X.; Zhai, J. FLEP-seq: Simultaneous detection of RNA polymerase II position, splicing status, polyadenylation site and poly(A) tail length at genome-wide scale by single-molecule nascent RNA sequencing. Nat. Protoc. 2021, 16, 4355–4381. [Google Scholar] [CrossRef] [PubMed]
  26. Jia, J.; Long, Y.; Zhang, H.; Li, Z.; Liu, Z.; Zhao, Y.; Lu, D.; Jin, X.; Deng, X.; Xia, R.; et al. Post-transcriptional splicing of nascent RNA contributes to widespread intron retention in plants. Nat. Plants 2020, 6, 780–788. [Google Scholar] [CrossRef]
  27. Zhu, Y.; Machleder, E.; Chenchik, A.; Li, R.; Siebert, P. Reverse transcriptase template switching: A SMART approach for full-length cDNA library construction. Biotechniques 2001, 30, 892–897. [Google Scholar] [CrossRef] [Green Version]
  28. Schuster, G.; Stern, D. RNA Polyadenylation and Decay in Mitochondria and Chloroplasts. Prog. Mol. Biol. Transl. Sci. 2009, 85, 393–422. [Google Scholar] [CrossRef]
  29. Hotto, A.M.; Castandet, B.; Gilet, L.; Higdon, A.; Condon, C.; Stern, D.B. Arabidopsis Chloroplast Mini-Ribonuclease III Participates in rRNA Maturation and Intron Recycling. Plant Cell 2015, 27, 724–740. [Google Scholar] [CrossRef]
  30. Felder, S.; Meierhoff, K.; Sane, A.P.; Meurer, J.; Driemel, C.; Plücken, H.; Klaff, P.; Stein, B.; Bechtold, N.; Westhoff, P. The Nucleus-Encoded HCF107 Gene of Arabidopsis Provides a Link between Intercistronic RNA Processing and the Accumulation of Translation-Competent psbH Transcripts in Chloroplasts. Plant Cell 2001, 13, 2127. [Google Scholar] [CrossRef] [Green Version]
  31. Stoppel, R.; Meurer, J. Complex RNA metabolism in the chloroplast: An update on the psbB operon. Planta 2013, 237, 441–449. [Google Scholar] [CrossRef] [Green Version]
  32. Lezhneva, L.; Meurer, J. The nuclear factor HCF145 affects chloroplast psaA-psaB-rps14 transcript abundance in Arabidopsis thaliana. Plant J. 2004, 38, 740–753. [Google Scholar] [CrossRef]
  33. Guillaumot, D.; Lopez-Obando, M.; Baudry, K.; Avon, A.; Rigaill, G.; Falcon De Longevialle, A.; Broche, B.; Takenaka, M.; Berthomé, R.; De Jaeger, G.; et al. Two interacting PPR proteins are major Arabidopsis editing factors in plastid and mitochondria. Proc. Natl. Acad. Sci. USA 2017, 114, 8877–8882. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Rüdinger, M.; Funk, H.T.; Rensing, S.A.; Maier, U.G.; Knoop, V. RNA editing: Only eleven sites are present in the Physcomitrella patens mitochondrial transcriptome and a universal nomenclature proposal. Mol. Genet. Genomics 2009, 281, 473–481. [Google Scholar] [CrossRef] [PubMed]
  35. Zhuang, F.; Fuchs, R.T.; Sun, Z.; Zheng, Y.; Robb, G.B. Structural bias in T4 RNA ligase-mediated 3′-adapter ligation. Nucleic Acids Res. 2012, 40, e54. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Freyer, R.; Hoch, B.; Neckermann, K.; Maier, R.M.; Kössel, H. RNA editing in maize chloroplasts is a processing step independent of splicing and cleavage to monocistronic mRNAs. Plant J. 1993, 4, 621–629. [Google Scholar] [CrossRef] [PubMed]
  37. Ruf, S.; Zeltz, P.; Kössel, H. Complete RNA editing of unspliced and dicistronic transcripts of the intron-containing reading frame IRF170 from maize chloroplasts. Proc. Natl. Acad. Sci. USA 1994, 91, 2295–2299. [Google Scholar] [CrossRef] [Green Version]
  38. Maréchal-Drouard, L.; Kumar, R.; Remacle, C.; Small, I. RNA editing of larch mitochondrial tRNA His precursors is a prerequisite for processing. Nucleic Acids Res. 1996, 24, 3229–3234. [Google Scholar] [CrossRef] [Green Version]
  39. Tillich, M.; Hardel, S.L.; Kupsch, C.; Armbruster, U.; Delannoy, E.; Gualberto, J.M.; Lehwark, P.; Leister, D.; Small, I.D.; Schmitz-Linneweber, C. Chloroplast ribonucleoprotein CP31A is required for editing and stability of specific chloroplast mRNAs. Proc. Natl. Acad. Sci. USA 2009, 106, 6002–6007. [Google Scholar] [CrossRef] [Green Version]
  40. Karcher, D.; Bock, R. Site-selective inhibition of plastid RNA editing by heat shock and antibiotics: A role for plastid translation in RNA editing. Nucleic Acids Res. 1998, 26, 1185–1190. [Google Scholar] [CrossRef] [Green Version]
  41. Takenaka, M.; Neuwirt, J.; Brennicke, A. Complex cis-elements determine an RNA editing site in pea mitochondria. Nucleic Acids Res. 2004, 32, 4137–4144. [Google Scholar] [CrossRef] [Green Version]
  42. Castandet, B.; Choury, D.; Bégu, D.; Jordana, X.; Araya, A. Intron RNA editing is essential for splicing in plant mitochondria. Nucleic Acids Res. 2010, 38, 7112–7121. [Google Scholar] [CrossRef] [Green Version]
  43. Farré, J.-C.; Aknin, C.; Araya, A.; Castandet, B. RNA Editing in Mitochondrial Trans-Introns Is Required for Splicing. PLoS ONE 2012, 7, e52644. [Google Scholar] [CrossRef] [Green Version]
  44. Vogel, J.; Börner, T. Lariat formation and a hydrolytic pathway in plant chloroplast group II intron splicing. EMBO J. 2002, 21, 3794–3803. [Google Scholar] [CrossRef] [Green Version]
  45. Petersen, K.; Schöttler, M.A.; Karcher, D.; Thiele, W.; Bock, R. Elimination of a group II intron from a plastid gene causes a mutant phenotype. Nucleic Acids Res. 2011, 39, 5181–5192. [Google Scholar] [CrossRef]
  46. Verbitskiy, D.; Takenaka, M.; Neuwirt, J.; Van Der Merwe, J.A.; Brennicke, A. Partially edited RNAs are intermediates of RNA editing in plant mitochondria. Plant J. 2006, 47. [Google Scholar] [CrossRef]
  47. Castandet, B.; Araya, A. The RNA editing pattern of cox2 mRNA is affected by point mutations in plant mitochondria. PLoS ONE 2011, 6, e20867. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  48. Staudinger, M.; Bolle, N.; Kempken, F. Mitochondrial electroporation and in organello RNA editing of chimeric atp6 transcripts. Mol. Genet. Genom. 2005, 273, 130–136. [Google Scholar] [CrossRef]
  49. Small, I.D.; Schallenberg-Rüdinger, M.; Takenaka, M.; Mireau, H.; Ostersetzer-Biran, O. Plant organellar RNA editing: What 30 years of research has revealed. Plant J. 2019, 101, 1040–1056. [Google Scholar] [CrossRef] [PubMed]
  50. Planchard, N.; Bertin, P.; Quadrado, M.; Dargel-Graffin, C.; Hatin, I.; Namy, O.; Mireau, H. The translational landscape of Arabidopsis mitochondria. Nucleic Acids Res. 2018, 46, 6218. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  51. Barkan, A. Proteins encoded by a complex chloroplast transcription unit are each translated from both monocistronic and polycistronic mRNAs. EMBO J. 1988, 7, 2637–2644. [Google Scholar] [CrossRef]
  52. Li, M.; Xia, L.; Zhang, Y.; Niu, G.; Li, M.; Wang, P.; Zhang, Y.; Sang, J.; Zou, D.; Hu, S.; et al. Plant editosome database: A curated database of RNA editosome in plants. Nucleic Acids Res. 2019, 47, D170. [Google Scholar] [CrossRef] [Green Version]
  53. Anreiter, I.; Mir, Q.; Simpson, J.T.; Janga, S.C.; Soller, M. New Twists in Detecting mRNA Modification Dynamics. Trends Biotechnol. 2021, 39, 72–89. [Google Scholar] [CrossRef] [PubMed]
  54. Manavski, N.; Vicente, A.; Chi, W.; Meurer, J. The chloroplast epitranscriptome: Factors, sites, regulation, and detection methods. Genes 2021, 39, 72–89. [Google Scholar]
  55. Kadumuri, R.V.; Janga, S.C. Epitranscriptomic Code and Its Alterations in Human Disease. Trends Mol. Med. 2018, 24, 886–903. [Google Scholar] [CrossRef] [PubMed]
  56. Kiełbasa, S.M.; Wan, R.; Sato, K.; Horton, P.; Frith, M.C. Adaptive seeds tame genomic sequence comparison. Genome Res. 2011, 21, 487–493. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  57. Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet. J. 2011, 17, 10–12. [Google Scholar] [CrossRef]
  58. Li, H. Minimap2: Pairwise alignment for nucleotide sequences. Bioinformatics 2018, 34, 3094–3100. [Google Scholar] [CrossRef]
  59. Wang, L.; Nie, J.; Sicotte, H.; Li, Y.; Eckel-Passow, J.E.; Dasari, S.; Vedell, P.T.; Barman, P.; Wang, L.; Weinshiboum, R.; et al. Measure transcript integrity using RNA-seq data. BMC Bioinform. 2016, 17, 1–16. [Google Scholar] [CrossRef] [Green Version]
  60. Benjamini, Y.; Hochberg, Y. Controlling The False Discovery Rate—A Practical And Powerful Approach To Multiple Testing. J. R. Stat. Soc. B 1995, 57, 289–300. [Google Scholar] [CrossRef]
  61. Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R. Subgroup, 1000 Genome Project Data Processing The Sequence Alignment/Map format and SAMtools. Bioinformatics 2009, 25, 2078. [Google Scholar] [CrossRef] [Green Version]
  62. Quinlan, A.R.; Hall, I.M. BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics 2010, 26, 841. [Google Scholar] [CrossRef] [Green Version]
Figure 1. The complexity of the psbB-petD locus. Screenshots of Integrative Genomics Viewer (IGV) displaying nanopore reads mapping to the psbB-petD locus. (A) plastid genomic position. (B) coverage track displaying the number of reads at each nucleotide. (C) screenshot of reads mapping on the Watson strand. Matching bases are shown in red. Split reads are joined by blue lines. (D) Annotation of the locus. Introns are shown as thinner segments. (E) screenshot of reads mapping on the Crick strand. Matching bases are shown in purple. Split reads are joined by blue lines.
Figure 1. The complexity of the psbB-petD locus. Screenshots of Integrative Genomics Viewer (IGV) displaying nanopore reads mapping to the psbB-petD locus. (A) plastid genomic position. (B) coverage track displaying the number of reads at each nucleotide. (C) screenshot of reads mapping on the Watson strand. Matching bases are shown in red. Split reads are joined by blue lines. (D) Annotation of the locus. Introns are shown as thinner segments. (E) screenshot of reads mapping on the Crick strand. Matching bases are shown in purple. Split reads are joined by blue lines.
Ijms 22 11297 g001
Figure 2. Virtual Northern blots derived from the nanopore sequencing. Northern blots were emulated from nanopore reads mapping to the sequences of psbN, psbH, the second exon of petB, or the second exon of petD shown in red on the genomic map displayed above. The size (in nt) is shown on the left.
Figure 2. Virtual Northern blots derived from the nanopore sequencing. Northern blots were emulated from nanopore reads mapping to the sequences of psbN, psbH, the second exon of petB, or the second exon of petD shown in red on the genomic map displayed above. The size (in nt) is shown on the left.
Ijms 22 11297 g002
Figure 5. Network of splicing and editing coordination. Splicing events are shown in green and editing events in red. Dependent events are joined by an edge. The darkness of the edge is proportional to the adjusted p-value of the Exact Fisher test for the pool of the three replicates.
Figure 5. Network of splicing and editing coordination. Splicing events are shown in green and editing events in red. Dependent events are joined by an edge. The darkness of the edge is proportional to the adjusted p-value of the Exact Fisher test for the pool of the three replicates.
Ijms 22 11297 g005
Figure 6. Proposed chronology of maturation events. Exons are shown as grey bars and introns as black lines. The editing sites are indicated by their genomic position. Grey editing sites are processed independently and thus are not included in the chronology. The preferred order of the maturation events is indicated by the numbers above the editing sites or introns.
Figure 6. Proposed chronology of maturation events. Exons are shown as grey bars and introns as black lines. The editing sites are indicated by their genomic position. Grey editing sites are processed independently and thus are not included in the chronology. The preferred order of the maturation events is indicated by the numbers above the editing sites or introns.
Ijms 22 11297 g006
Table 1. Quantification of known editing and splicing events.
Table 1. Quantification of known editing and splicing events.
NameTypeMaturation RateMaturation Rate
(Guillaumot et al., 2017)
Maturation Rate (Ruwe et al., 2013)
int_RPS16splicing4%4%NA
int_ATPFsplicing89%82%NA
int_RPOC1splicing64%19%NA
int_YCF3_i2splicing79%42%NA
int_YCF3_i1splicing63%45%NA
int_CLP_i2splicing60%71%NA
int_CLP_i1splicing69%62%NA
int_PETBsplicing91%58%NA
int_PETDsplicing97%62%NA
int_RPL16splicing69%12%NA
int_RPL2.1splicing66%52%NA
int_NDHB.1splicing68%55%NA
int_RPS12Csplicing92%81%NA
int_NDHAsplicing68%27%NA
matK_2931editing53%79%93%
atpF_12707editing89%91%95%
atpH_UTR_13210editing5%3%4%
rpoC1_21806editing33%21%15%
rpoB_23898editing87%82%85%
rpoB_25779editing64%83%86%
rpoB_25992editing69%76%94%
psbZ_35800editing93%90%95%
rps14_37092editing89%93%94%
rps14_37161editing92%97%96%
ycf3_i2_43350editing16%10%12%
rps4_UTR_45095editing6%3%10%
ndhK_ndhJ_49209editing4%4%6%
accD_57868editing90%95%99%
accD_58642editing76%75%83%
psbF_63985editing90%98%98%
psbE_64109editing95%100%100%
petL_65716editing79%91%86%
rps18_UTR_68453editing3%4%NA
rps12_69553editing21%26%27%
clpP_69942editing82%72%81%
rpoA_78691editing78%76%91%
rpl23_86055editing34%74%75%
ycf2_as_91535editing3%4%NA
ndhB_UTR_94622editing8%0%NA
ndhB_94999editing88%93%94%
ndhB_95225editing95%98%99%
ndhB_95608editing87%84%80%
ndhB_95644editing78%87%81%
ndhB_95650editing88%91%84%
ndhB_96419editing75%94%92%
ndhB_96439editing6%4%6%
ndhB_96457editing6%3%5%
ndhB_96579editing90%89%90%
ndhB_96698editing81%88%82%
ndhB_97016editing94%94%95%
ndhF_112349editing85%93%96%
ndhD_116281editing76%83%92%
ndhD_116290editing77%84%90%
ndhD_116494editing88%90%93%
ndhD_116785editing94%97%98%
ndhD_117166editing35%33%45%
ndhG_118858editing69%78%85%
NA: Not Analyzed. The genomic position of each site and the corresponding nomenclature of Rüdinger et al. [34] are given in Table S1.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Guilcher, M.; Liehrmann, A.; Seyman, C.; Blein, T.; Rigaill, G.; Castandet, B.; Delannoy, E. Full Length Transcriptome Highlights the Coordination of Plastid Transcript Processing. Int. J. Mol. Sci. 2021, 22, 11297. https://doi.org/10.3390/ijms222011297

AMA Style

Guilcher M, Liehrmann A, Seyman C, Blein T, Rigaill G, Castandet B, Delannoy E. Full Length Transcriptome Highlights the Coordination of Plastid Transcript Processing. International Journal of Molecular Sciences. 2021; 22(20):11297. https://doi.org/10.3390/ijms222011297

Chicago/Turabian Style

Guilcher, Marine, Arnaud Liehrmann, Chloé Seyman, Thomas Blein, Guillem Rigaill, Benoit Castandet, and Etienne Delannoy. 2021. "Full Length Transcriptome Highlights the Coordination of Plastid Transcript Processing" International Journal of Molecular Sciences 22, no. 20: 11297. https://doi.org/10.3390/ijms222011297

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop