Next Article in Journal
Clinical Outcomes and Prognostic Factors in Nonmetastatic Castration-Resistant Prostate Cancer Treated with Androgen Receptor Signaling Inhibitors Therapy
Previous Article in Journal
The Effect of Immune Checkpoint Inhibitor Therapy on Pre-Existing Gastroparesis and New Onset of Symptoms of Delayed Gastric Emptying
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Exploring the Dark Matter of Human Proteome: The Emerging Role of Non-Canonical Open Reading Frame (ncORF) in Cancer Diagnosis, Biology, and Therapy

Department of Pathology and Molecular Medicine, Queen’s University, Kingston, ON K7L 3N6, Canada
*
Author to whom correspondence should be addressed.
Cancers 2024, 16(15), 2660; https://doi.org/10.3390/cancers16152660
Submission received: 26 June 2024 / Revised: 21 July 2024 / Accepted: 24 July 2024 / Published: 26 July 2024
(This article belongs to the Section Cancer Causes, Screening and Diagnosis)

Abstract

:

Simple Summary

With advanced bioinformatic and sequencing technologies, many non-coding RNAs have been found to harbor non-canonical open reading frames (ncORFs). Some studies suggest that many ncORF-encoded microproteins or micropeptides possess important roles in cancer biology, yet the functional roles of these ncORF-encoded peptides are mostly unknown. The purpose of this review is to provide a comprehensive overview of existing knowledge on ncORF-encoded microproteins in cancer biology and highlight their roles as potential prognostic and therapeutic targets in cancer.

Abstract

Cancer develops from abnormal cell growth in the body, causing significant mortalities every year. To date, potent therapeutic approaches have been developed to eradicate tumor cells, but intolerable toxicity and drug resistance can occur in treated patients, limiting the efficiency of existing treatment strategies. Therefore, searching for novel genes critical for cancer progression and therapeutic response is urgently needed for successful cancer therapy. Recent advances in bioinformatics and proteomic techniques have allowed the identification of a novel category of peptides encoded by non-canonical open reading frames (ncORFs) from historically non-coding genomic regions. Surprisingly, many ncORFs express functional microproteins that play a vital role in human cancers. In this review, we provide a comprehensive description of different ncORF types with coding capacity and technological methods in discovering ncORFs among human genomes. We also summarize the carcinogenic role of ncORFs such as pTINCR and HOXB-AS3 in regulating hallmarks of cancer, as well as the roles of ncORFs such as HOXB-AS3 and CIP2A-BP in cancer diagnosis and prognosis. We also discuss how ncORFs such as AKT-174aa and DDUP are involved in anti-cancer drug response and the underestimated potential of ncORFs as therapeutic targets.

1. Introduction

1.1. Definition of ncORF

Non-canonical open reading frames (ncORFs) are regions of translation that occur separately from the canonical protein-coding sequences [1]. In 2004, the Human Genome Project annotated the entire human genome and identified the location and sequences of around 20,000 protein-coding genes [2]. Historically, stringent rules were placed on the prediction of protein-coding open reading frames (ORFs) to minimize false positives [3]. For example, coding sequences (CDS) must meet a size cut-off of 100 amino acids; initiate translation from the conventional AUG start codon; and have monocistronic transcription, meaning each mRNA molecule cannot encode more than one polypeptide [3]. However, these rules created a blind spot in the human proteome for ncORFs that did not fit these criteria. Over the past two decades, improved understanding of gene expression and modern detection methods has rendered these criteria obsolete. To date, numerous ncORFs have been identified and are believed to have the potential to regulate gene expression or produce proteins with unique biological roles [4].

1.2. Classification of ncORF

Technological advances have enabled the discovery of many neglected ncORFs, which are classified into different categories, such as long non-coding ORFs (lncORFs), circular ORFs (circORFs), intronic ORFs, primary microRNA-derived ORFs (pri-miORFs), and small ORFs (smORFs) (Figure 1) [1,3]. This section reviews ncORFs based on each category.
ncORFs can be found within the 5′ and 3′ untranslated regions (UTR) of the annotated CDS in an alternate reading frame from the CDS, including upstream (uORFs) and downstream ORFs (dORFs), respectively (Figure 1) [3]. uORFs regulate translation of the main CDS through interactions with the 43S ribosome pre-initiation complex in the “leaky scanning” mechanism. As the 43S ribosome pre-initiation complex scans the mRNA and encounters the uORF, which often has a near-cognate non-AUG start codon, it has three options: (1) initiate translation at the uORF and disassemble upon reaching the stop codon, resulting in no translation of the downstream CDS; (2) fail to recognize the uORF and continue scanning until it reaches the downstream CDS, resulting in CDS translation; or (3) initiate transition at the uORF, remain bound to the mRNA, and resume scanning, resulting in CDC translation [3,5]. uORFs can encode proteins that interact or functionally cooperate with the downstream CDS protein [6]. A ncORF positioned at the upstream of a protein kinase C (PKC) isoform encodes uORF2, which significantly impairs viability of breast cancer and leukemia cells through inhibition of the PKC family [7]. Compared to uORFs, the function of dORFs is less defined, but it has been hypothesized that translation of dORFs can enhance main CDS translation by recruiting ribosomes or translation initiation factors to the main CDS [8]. Unlike uORFs and dORFs, alternative ORFs (altORFs) overlap with the main CDS, but in a shifted or alternate ORF [3]. The altFUS protein is a prime example of an altORF, where it overlaps its main CDS, FUS, in an open reading frame that is shifted by a single nucleotide [9]. ncORFs can also be in-frame with the main CDS, expressing extended or truncated isoforms of the annotated proteins based on the location and presence of the ncORF start and stop codons [3,10]. For example, the MYC gene has an alternate CUG start codon upstream of the main AUG start codon, which can translate into N-terminally extended variants of the MYC protein [11].
lncORFs are encoded within long non-coding RNA (lncRNA), which are RNA transcripts longer than 200 nucleotides annotated as non-coding (Figure 1) [3,12]. lncRNAs are pervasively expressed and play important roles in gene regulation as untranslated RNA molecules [13]. Many RNA sequences that are previously considered as lncRNA have now been discovered to contain ORFs with protein-coding potential [14,15,16,17,18,19,20]. An example of a lncORF is the steroid receptor RNA activator (SRA) gene. In its RNA form, SRA functions as a nuclear receptor coactivator [21,22], while it also encodes a protein, SRAP, associated with breast cancer cell motility [21,22]. lncRNAs can originate from pseudogenes, referring to DNA regions that contain presumably non-functional, untranslatable copies annotated genes derived from the retrotransposition of processed mRNAs and segmental duplication [12,23]. Increasing evidence has revealed peptides translated from pseudogenic ncORFs [24]. In the case of lncRNAs located in between genes with no overlap with the CDS, they are recognized as intergenic, and ncORFs derived from these regions are called long intergenic non-coding ORFs (lincORFs) or intergenic ORFs [25]. Though rare, intronic ORFs with translational capabilities have also been discovered in the introns of pre-sliced mRNAs [26]. Another type of non-coding RNA (ncRNA) is micro-RNAs (miRNAs) with 18–24 nucleotides, which originate as primary transcripts of miRNAs (pri-miRNAs) and become mature miRNAs through specific cleavage and progressing [27,28]. ncORFs derived from these unprocessed transcripts of miRNAs are referred to as pri-miORFs (Figure 1).
Circular RNAs (circRNAs) are single-stranded RNA molecules covalently linked at the 5′ and 3′ ends as a product of back-splicing and display translation potential, resulting in the discovery of a new class of ncORFs, termed circORFs [3,29]. circORFs can regulate gene expression through their interactions with micro-RNA and circRNA binding proteins (cRBPs) [30]. Due to the lack of the 5′ end of linear messenger transcripts (5′ cap), circRNAs were historically considered to be a type of lncRNA [12]. However, through cap-independent initiation mechanisms, circRNAs have been demonstrated to encode functionally significant circORF proteins (Figure 1) [31]. An example of a circORF is circMAPK1, encoding MAPK1–109aa with tumor suppressive functions in gastric cancer cells through its interaction with MEK1 of the MAPK signaling pathway [32].
Finally, smORFs are sequences that encode micropeptides or microproteins with 100 amino acids or smaller in size [3]. This arbitrary size limit is due to the historical assumption that proteins smaller than 100 amino acids are statistically unlikely to be functional and the difficulty of detecting DNA sequences less than 300 nucleotides long before the emergence of modern omics-based technology, such as ribosome sequencing [3]. Regardless, all other classifications of ncORFs are also classified as smORFs if they meet the size cut-off. Outside of the nuclear genome, eight instances of smORFs have been identified in mitochondrial DNA, which encodes mitochondrial-derived peptides (MDPs) [33].

1.3. Identification of ncORF

1.3.1. Computational Techniques

Computational analyses have been the most common method for annotating canonical genes, where they are defined as the longest evolutionarily conserved AUG-containing ORF in an mRNA [34]. However, ncORFs/smORFs are trickier to predict and produce more noise, and thus alternative methods are required for their annotation [35]. The detection of ncORFs in the early stages was challenging due to the conservativeness of existing gene identification algorithms, which were not designed for ncORFs [36]. sORF finder is one of the first successful ncORF prediction tools, identificating over 2,000 intergenic ORFs in a plant species called Arabidopsis thaliana [37]. Technological improvements led to the establishment of various ncORF/smORF prediction tools and databases, such as PhyloCSF, MiPepid, uPEPperoni, and PhyloCSF [36,38,39,40,41]. In addition, machine learning approaches can be used to predict ncORFs based on features derived from genomic and transcriptomic data. Techniques such as support vector machines (SVMs) and neural networks can classify regions of the genome as coding or non-coding based on training data [42]. Databases such as Ensembl and UCSC Genome Browser provide resources for exploring ncORFs. These databases integrate data from multiple sources, including RNA-Seq and Ribo-Seq, and offer tools for visualizing and analyzing ncORFs in the context of the entire genome [43]. While bioinformatic tools provide great value in reduced cost and convenience compared to experimental validations of ncORFs, they are inadequate in confirming the translation of novel ncORFs [36]. Investigation and characterization of these translatable ncORFs require further study.

1.3.2. Experimental Techniques

(a) Ribosome profiling: During translation, ribosomes enable protein synthesis by reading mRNA transcripts. As the ribosome reads the mRNA, it protects 30–31 nucleotides of the mRNA from nuclease degradation, creating a ribosome footprint. In 2009, Ingolia et al. exploited this phenomenon to develop the Ribo-Seq technique [44]. By converting these footprints to DNA sequences and utilizing deep sequencing, they were able to map the precise positions of ribosomes and quantify translational activity. Actively translated ORFs are characterized by continuous 3-nucleotide periodicity, which results from the 80S ribosomes reading the mRNA template one codon at a time [34]. Furthermore, ribosome footprint density can be used to deduce the rate of translation for a particular polypeptide, with higher ribosome density meaning slower elongation, and vice versa [45]. Another advantage of Ribo-Seq is its ability to identify ncORFs regardless of their start codon, which is invaluable since a large portion of ncORFs do not initiate translation with an AUG start codon [46,47]. Ribo-Seq data can be analyzed using computational methods, such as RibORF, which calculates the overall probability of translation [24,35]. However, validation of ncORFs at the protein level is required, as ribosome occupancy alone cannot distinguish between coding and non-coding RNA transcripts, limiting the reliability of Ribo-seq [36,48,49]. To reduce the number of false positives, a method called polysome profiling or Poly-Ribo-Seq was developed [50]. Scanning of the 40S ribosome subunit alone can result in ribosome footprints and the false detection of translation. Poly-Ribo-Seq takes advantage of the fact that polysomes, which consist of multiple ribosomes, bind to mRNAs collectively during active translation. By isolating polysomal fractions, active translation regions can be differentiated from false positives derived from single ribosomes or ribosome subunits. Yet, this is not a perfect solution as it leaves a blind spot for particularly short smORFs, which are not long enough to bind multiple ribosomes [50].
(b) Mass spectrometry (MS): MS-based proteomic techniques are often used to complement Ribo-Seq in the direct identification and quantification of ncORF-encoded proteins [36,51]. MS-based approaches can also aid in determining post-translational modifications, furthering our understanding of the protein’s functional mechanisms [52].
(c) CRISPR/Cas9 genome editing: the biological functions of ncORF-encoded proteins can be validated using CRISPR/Cas9-based approaches [36,53,54]. Knockout (loss-of-function) or overexpression (gain-of-function) assays can be performed to elucidate the functions of specific ncORF-encoded proteins [36,53,54]. The CRISPR/Cas9 system can also be used to observe the expression and localization of ncORF-encoded proteins by knocking-in epitope tags into DNA sequences of ncORFs, which can be detected by the corresponding antibodies [55]. Genome-wide CRISPR-based mutagenesis screens have successfully identified high-priority ncORFs that may encode functionally significant proteins such as GREP1, ASNSD1-uORF, or ASDURF [4,53,54]. CRISPR screening can also be combined with RNA sequencing (RNA-Seq) to confirm the biological function of ncORFs. By perturbing expression of candidate ncORFs using CRISPR/Cas9 and observing the changes in RNA-seq profiles within a single cell, the molecular mechanisms of the ncORF-encoded peptides can be readily elucidated [53].

2. ncORFs as Regulators of Cancer HallmarksIntroduction

Disruption of regulatory circuits that govern cell homeostasis enables the transformation of normal cells into malignant cancers, driven by genetic alternations [56]. Over two decades ago, Hanahan and Weinberg proposed six essential hallmarks acquired by the cancer cell genotypes, which were further extended to ten hallmarks including increased proliferation, reduced cell death, loss of tumor suppressor genes, increased angiogenesis, enabling replicative immortality, activating invasion and metastasis, immune escape, inflammation, genomic instability, and dysregulated metabolism [56,57]. Previous studies have shown that ncRNAs play important roles in regulation of various hallmarks of cancer [58]. In this review, we discuss the implication and molecular mechanism underlying the role of known ncORFs in different cancer hallmarks (Figure 2 and Figure 3).

2.1. Cell Proliferation and Death

The development of cancer results in continual unregulated cell proliferation and inhibition of cell death [56]. Unlike normal cells, cancer cells can grow uncontrollably, invade normal tissues, and spread throughout the body, leading to metastasis [56]. This loss of growth control is a net outcome of abnormal cell regulatory pathways [56].

2.1.1. Cell Proliferation

Normal cells are strictly regulated to inhibit uncontrolled proliferation [56]. However, cancer cells can grow uncontrollably by sustaining self-sufficient growth signals and evading growth suppressors [56]. Many proteins translated from ncORFs foster cell proliferation. For instance, a novel protein, Cancer-Associated Small Integral Membrane Open reading frame 1 (CASIMO1), is preferentially expressed in hormone-receptor-positive breast tumors and regulates proliferation and cell cycle [59]. CASIMO1 interacts with squalene epoxidase (SQLE) to induce ERK phosphorylation [59]. Loss of SQLE mimics the effect of CASIMO1 deficiency on breast cancer cell proliferation, indicating the role of CASIMO1 in modulating the MAPK pathway through its interaction with SQLE (Figure 2) [59]. These results suggest the oncogenic role of CASIMO1 in breast cancer cells.
Additionally, the proliferation of HCC cells is modulated by a lncRNA-encoded peptide, SMIM30, which functions as an adaptor protein for membrane anchoring of non-receptor tyrosine kinases, SRC/YES1, to promote downstream MAPK signaling activation [60]. Another example is HOXB-AS3, a lncRNA-encoded peptide that exhibits opposing effects in different cancer types. Overexpression of the HOXB-AS3 protein enhances the proliferation and viability of oral squamous cell carcinoma cells through c-Myc mRNA stabilization [61]. Conversely, tumor growth of CRC is suppressed by the HOXB-AS3 peptide, which blocks glucose metabolic reprogramming through inhibition of the splicing of PKM1 (Figure 2) [62]. A ncORF found in C20orf204 RNA encodes C20orf204-189AA, which enhances proliferation of HCC cells when overexpressed; the mechanistic details of this peptide in promoting HCC-specific proliferation remain to be investigated [63]. The proliferative capability of cancer cells can also be promoted by other ncORF-encoded peptides, including ACLY-BP and circPPP1R12A-73aa [64,65]. Knockout of ACLY-BP significantly suppresses the proliferation of clear cell renal cell lines, suggesting its important role in maintaining the stability of ATP citrate lyase (ACLY), leading lipid metabolic shift to promote cell proliferation in clear cell RCC (ccRCC) (Figure 2) [65]. Increased proliferation ability was observed in colon cancer (CC) cells with circPPP1R12A-73aa overexpression, which subsequently activated YAP activity [64]. In addition, CRISPR knockout of 57 of the 553 ncORFs (10%) demonstrated a growth-inhibitory effect [54].
Although many proteins translated from ncORFs often display oncogenic roles in cancers, some peptides can exhibit tumor-suppressive roles. FBXW7-185aa, a novel protein encoded by the circular form of the FBXW7 gene, de-stabilizes cMyc by interacting with a de-ubiquitinating enzyme, USP28, to inhibit the proliferation and cell cycle acceleration in glioma cells (Figure 2) [66]. Two peptides encoded by circRNAs, AKT3-174aa and PINT87aa, have been discovered to suppress cell proliferation of GBM through two independent studies [67,68]. Specifically, AKT3-174aa negatively regulates the RTK/PI3K pathway by blocking AKT phosphorylation at the Thr308 site, and PINT87aa suppresses oncogenic transcriptional elongation through PAF1C [67,68]. The protein-coding capacity of TINCR, a ncRNA involved in epidermal cell differentiation, has recently been identified [69]. Expression of the pTINCR protein significantly suppresses the tumor growth through interaction with the SUMO/CDC42 complex in squamous cell carcinoma (Figure 2) [69]. KRASIM, another microprotein translated from lncRNA NCBP2-AS2, appears to reduce KRAS levels, suppressing ERK/MAPK signaling to inhibit the growth and proliferation of HCC cells (Figure 2) [70].

2.1.2. Cell Death

The apoptotic program can be initiated by various physiological signals, triggering a series of choreographed molecular steps involving in disruption of cellular membranes, cytoskeletal collapsing, chromatin condensation, and nuclear fragmentation [56]. The main apoptotic pathways are extrinsic and intrinsic as well as an additional T cell mediated perforin/granzyme pathway [71]. Although each pathway depends on specific activating signals, they converge on the same execution pathway triggered by caspase-3 activation [71]. Tumor cells can gain apoptotic resistance by expressing anti-apoptotic proteins (e.g., Bcl-2) or inhibiting proapoptotic proteins (e.g., Bax) [71]. Many peptides translated from ncORFs have been implicated in cell death regulation. For example, humanin (HN) inhibits apoptosis by preventing Bax translocation from cytosol to mitochondria (Figure 2) [72]. Inhibition of HN expression resensitizes the cells to Bax [72]. One of lncRNAs, BVES-AS1, encodes a short peptide, named BVES-AS1-201-50aa, which promotes cell viability by activating Src/mTOR signaling in CRC cells [73]. ELABELA is a 54aa peptide hormone that activates PI3K/AKT signaling via the apelin receptor to stimulate hESC growth and maintain self-renewal through inhibition of apoptotic pathways (Figure 2) [19]. Another lncRNA-encoded microprotein regulating cell viability is YY1-blocking microprotein (YY1BM) [74]. Under nutrient deprivation, YY1BM induces apoptosis by inhibiting the interaction of YY1 and androgen receptor in male esophageal squamous cell carcinoma (Figure 2) [74]. As previously described, pri-miRNAs, precusors of miRNAs, have been found to harbor ncORFs that encode functional peptides [28]. Microproteins encoded by these ncORFs are termed miRNA-encoded peptides (miPEPs). A novel miPEP, miPEP133, has demonstrated its tumor suppressive ability [75]. Overexpression of miPEP133 induces cell cycle arrest and apoptosis by blocking the interaction between mitochondaril heat shock protein 70 and its binding partner (Figure 2) [75].

2.2. Metastasis

2.2.1. Cell Migration and Invasion

Metastases account for 90% of cancer-related mortality [76]. The ability of a tumor to colonize foreign environments enhances its survival, especially when the nutrients and space become limited at the primary site [56]. Cell migration and invasion are initial steps during metastatic dissemination involving the activation of cellular pathways that modulate the dynamics of cytoskeletons in tumor cells [77]. The epithelial-to-mesenchymal transition (EMT) is an important cellular program activated by cancer cells to acquire invasive phenotypes during a metastatic event [78]. Loss of adherent junctions, acquisition of fibroblastic morphology, and expression of matrix-degrading proteases are common traits established during EMT [78]. A reversal of EMT traits occurs to recapitulate epithelial characteristics for colonization at a distant secondary place [78]. This process is often termed as mesenchymal to epithelial transition (MET) [78]. Regulation of F-actin dynamic is essential for cell motility, and loss of CASMO1 impacts actin assembly, dramatically reducing the migration ability of BC cells [59]. Moreover, two lncRNA-encoded peptides, SMIM30 and ZFAS1, independently promote cell migration and invasion of HCC cells [60,79], though their involvement of HCC metastatic events in vivo needs further exploration. BVES-AS1-201-50aa is one of many lncRNA-encoded proteins involved in tumor progression [73]. This 50 aa microprotein enhances the migratory and invasive abilities of CRC cells in vitro [73]. Overexpression of BVES-AS1-201-50aa is thought to increase the production of metastasis-associated protein MM9 in CRC cells (Figure 3) [73]. Through ribosome profiling and RNA-seq, a conserved and secreted peptide, Toddler, also known as APELA, is found to be expressed by a noncoding RNA in zebrafish, modulating cell movements during gastrulation [80,81]. Later research reveals that APELA is upregulated in ovarian cancer, contributing to cell migration and cancer progression [81]. Other than lncRNA-encoded proteins, circRNA is another type of ncRNAs containing ncORFs. One small peptide, circPPP1R12A-73aa, is expressed by such ncORFs from circPPP1R12A and displays metastasis-promoting abilities in CC by activating YAP activity (Figure 3) [64].
A lncRNA-encoded microprotein, SMIM26, behaves as a tumor suppressor by inhibiting EMT in ccRCC [82]. Overexpression of SMIM26 leads to reduced metastatic phenotypes, while silencing SMIM26 greatly enhances the migration and invasion of RCC cells [82]. Meng et al. (2023) revealed that SMIM26 interacts with AGK, a mitochondrial acylglycerol kinase, as well as glutathione transport regulator SKC25A11 to interfere with AKT-signaling-mediated metastasis (Figure 3) [82]. The role of TGF-β signal transduction is well recognized in cancer metastasis, and induced EMT phenotypes and cancer stemness can be achieved by TGF-β treatment in breast cancer [18]. CIP2A-BP is a microprotein encoded by lncRNA and inhibits metastatic capabilities when overexpressed in TNBC cells [18]. Expression of CIP2A-BP is usually reduced in TNBC by TGF-β signaling to induce tumor invasion and metastasis (Figure 3) [18]. Cell migration and invasion can also be inhibited by a previously discussed microprotein, miPEP133, though its anti-metastatic role in vivo has not been fully illustrated [75].

2.2.2. Angiogenesis

Without blood supplies, nutrients and oxygen become depleted quickly, limiting the tumor from growing beyond a size of more than 1 mm in diameter and subsequently forcing it to enter a dormant state [83]. To sustain sufficient growth support, cancer cells promote vascularization by inducing angiogenesis, another hallmark for tumor development [83]. Normally, angiogenesis occurs during wound healing [83]. A fine-tuned balance of angiogenic activators and inhibitors controls the production of new blood vessels [83]. However, tumor cells can induce the angiogenic switch by tipping this balance to the pro-angiogenic state [83]. Compared to normal blood vessels, tumor vasculature is organized chaotically, resulting in abnormal blood flow and leaky vessels, which increase the likelihood of distant metastases [83]. Two lncRNA-encoded microproteins display opposing roles during angiogenic development in TNBC, including ASRPS and XBP1SBM [16,84]. ASRPS inhibits angiogenesis by regulating the VEGF pathway through its interaction with STAT3, whereas XBP1SBM can upregulate VEGF expression level, promoting angiogenesis in TNBC (Figure 3) [16,84].

2.3. Inflammation and Immune Responses

Tumor cells express specific antigens that immune cells recognize as non-self and targeted for destruction [85]. Antigen-presenting cells (APC) such as dendritic cells (DCs) recognize and display tumor-specific antigens, which elicit T-cell responses to destroy tumor cells [85]. However, tumor cells exploit various mechanisms to evade immune surveillance, such as downregulating antigens or inducing an immunosuppressive microenvironment [85]. A common strategy is to induce immune tolerance to cancer by expressing the ligands for the receptor of the Ig superfamily, PD-1 [85].
Advancements in proteomics technologies allow the identification of peptides or proteins derived from ncORFs. Several ncORF-encoded microproteins contribute to immune responses (Figure 3). For example, the identification of non-annotated ORFs led to the discovery of PC3-secreted microprotein (PSMP), a chemoattractant protein highly expressed in some prostate cancers. PSMP regulates inflammation during tumor development through binding to the CCR2B receptor and activates downstream ERK/MAPK signaling (Figure 3) [86]. Moreover, protein-coding ORFs within lncRNA Aw112010 express proteins essential for mucosal immunity [87]. Another microprotein encoded by a non-protein coding transcript is miPEP155, which suppresses DC-driven auto inflammation and T-cell priming by regulating antigen presentation through disrupting HSP70–HSP90 interaction (Figure 3) [88]. Deep mining of non-canonical proteins also results in the discovery of tumor-specific peptides, highlighting their potential for establishing cancer immunotherapies [89,90].

2.4. DNA Damage Response and Genetic Instability

Genomic integrity relies on DNA monitoring and repair mechanisms to ensure a pristine state of the whole genome [56]. Genomic DNA is sensitive to exogenous and endogenous damage, resulting in genetic instability [91]. Diverse DNA damage response mechanisms have evolved to sustain the genome integrity, including canonical homologous recombination repair (HRR), non-homologous end-joining (NHEJ) pathways, and the post-replication repair (PRR) pathway [91]. Yet, tumor cells accumulate substantial genomic alternations that allow them to acquire survival advantages during carcinogenesis [56]. Defects of DNA repair mechanisms explain the high mutational rate found in cancer cells [56]. Genomic instability enabled by malfunctions of DNA repair machinery then promotes clonal diversity and selection [56]. Known microproteins regulating genome stability include MRI-2, which interacts with DNA end-binding protein Ku in the nucleus to stimulate the NHEJ DNA repair pathway (Figure 3) [92]. PACMP and DDUP are reported lncRNA-encoded microproteins that also trigger DNA damage responses through different mechanisms [91,93]. PACMP modulates DDR by preventing degradation of CTBP-interacting protein and enhancing PARP1-dependent poly(ADP-ribosyl)ation (Figure 3) [93]. DNA damage induces upregulation of DDUP, which regulates PCNA ubiquitination and RAD18 retention at DNA damage sites, inducing HRR and PRR mechanisms [91]. The ubiquitin-like microprotein TINCR is reported to be a protective factor preventing UV exposure as accelerated skin lesions were observed in UV-treated mice with TINCR mutant [69].

2.5. Metabolism

Reprogramming of cellular metabolism is another core hallmark of cancer development, supporting the acquisition and maintenance of malignant phenotypes [94]. Anabolic growth in nutrient-depleted conditions is made possible by altering metabolic activity in cancer cells for survival [94]. The Warburg effect or aerobic glycolysis is a classic example of metabolic reprogramming found in cancer [94]. In the 1920s, Warburg discovered that cancer cells constitutively uptake glucose and convert it into lactate, regardless of the presence of oxygen [94]. Elevated glycolytic intermediates are utilized as macromolecule precursors to fulfill the demands of proliferating cells [94]. Additionally, intermediates from the tricarboxylic acid cycle can be supplied into subsidiary pathways for macromolecule synthesis [94]. Thus, the ability of cancer cells to reprogram metabolic pathways supports tumorigenic proliferation and progression. The selective expression of glycolytic enzyme pyruvate kinase M (PKM) isoforms is regulated by splicing factors, such as hnRNP A1 [62]. Re-expression of PKM2 in human cancers is thought to promote aerobic glycolysis, enabling proliferative advantages [62]. A small peptide encoded by the lncRNA HOXB-AS3 suppresses glucose reprogramming by antagonizing hnRNP A1-mediated PKM splicing in CRC (Figure 3) [62]. Lactate utilization is restricted by MP31, a microprotein encoded from 5′ UTR of PTEN in GBM [95]. Loss of MP31 increased global lactate metabolism, enhancing tumorigenicity [95].
Abnormal lipid oxidation is prevalent during tumor development [96,97]. In addition to glucose, tumor cells adapt to the nutrient-deprived microenvironment by increasing fatty acid metabolism [96]. A key enzyme of fatty acid oxidation (FAO), carnitine palmitoyl transferase 1A (CPT1A), is upregulated by an exosomal lncAKR1C2-encoded microprotein, pep-AKR1C2, inducing a metabolic switch towards FAO in gastric cancer (Figure 3) [96,98]. Lipid metabolism and homeostasis can be influenced by microproteins, including CASIMO1 and ACLY-BP, through distinct mechanisms, enhancing proliferation in BC and ccRCC, respectively (Figure 3) [59,65].

3. ncORFs in Cancer Diagnosis and Prognosis

Selective expression of certain ncORFs has been observed in specific cancers and is associated with worse disease outcomes, enabling early detection and progression prediction of certain diseases [99,100,101,102,103]. Here, we discuss known non-canonical proteins and their involvements as diagnostic and prognostic factors (Figure 4).

3.1. Colorectal Cancer

HOXB-AS3 is a lncRNA that plays an oncogenic role in human cancers and displays an elevated RNA expression in the cancer cells [104,105,106,107]. Interestingly, Huang et al. (2017) discovered that HOXB-AS3 encodes a small peptide, and they found that the expression of HOXB-AS3 RNA and the peptide was downregulated in colorectal cancer (CRC) tissues compared to paired adjacent normal tissues [62]. Furthermore, the tumor suppressive effects of the HOXB-AS3 peptide were revealed, and a low level of the HOXB-AS3 peptide was associated with worse prognosis of CRC patients [62]. Specifically, Kaplan–Meier survival analyses indicated that CRC patients with high HOXB-AS3 peptide expression had a 1.6-fold increase in mean overall survival time compared to those with low expression, suggesting that high expression of the HOXB-AS3 peptide correlates with reduced CRC-related mortalities [62]. The expression of SRSP, another lncORF-encoded peptide, was significantly elevated in CRC tissues compared to normal colorectal tissues [108]. Further investigation revealed that SRSP expression was positively associated with clinicopathological features of CRC patients, such as histological grade, presence of lymph node metastasis, clinical stage, and risk of death [108]. CRC patients with upregulated SRSP expression had 3.3-fold reduction in median survival time compared to those with low expression [108]. Other examples of potentially prognostic or diagnostic ncORF-encoded peptides in CRC include RBRP and ASAP, derived from lncRNAs, and PPP1R12A-C, derived from circRNA [15,64,109].

3.2. Breast Cancer

ASRPS, a lncORF-encoded peptide, is significantly downregulated in triple-negative breast cancer (TNBC) compared to other breast cancer subtypes, and low ASRPS expression is correlated with worse prognosis and overall survival in TNBC patients [16]. The low expression of another lncORF-encoded peptide, CIP2A-BP, showed a similar association with poorer prognosis in TNBC [18]. Boix and colleagues (2022) demonstrated that the lncORF-encoded peptide pTINCR was upregulated in epithelial tissue upon cellular stress and promoted epithelial differentiation in several cancers of epithelial origin, including luminal breast cancer [70], cutaneous squamous cell carcinoma, and lung adenocarcinoma. Although pTINCR peptide expression was not analyzed, correlation studies using public data revealed that high TINCR lncRNA expression was associated with more favorable prognoses in various epithelial cancers including breast cancer, suggesting the pTINCR peptide may be prognostically significant [70].

3.3. Glioblastoma

SHPRH-146aa is a circORF-encoded peptide that has the potential to serve as a prognostic or diagnostic biomarker for glioblastoma (GBM) [110]. The expression of SHPRH-146aa was downregulated in GBM tissue samples, and longer patient survival times were correlated with higher SHPRH-146aa expression [110]. PINT87aa, another circORF-encoded peptide, displayed a similar association with clinical prognosis of GBM [68]. When comparing GBM of varying WHO grades and normal brain tissues, PINT87aa expression was significantly reduced in GBM tissues and negatively correlated with tumor grade, with WHO grade IV glioblastomas displaying the lowest PINT87aa expression [68]. Another circORF-encoded peptide, FBXW7-185aa, also exhibited reduced expression and positive correlations with survival times of GBM patients [66].

3.4. Hepatocellular Carcinoma (HCC)

In HCC, the lncORF-encoded peptide SMIM30 has been associated with poor patient survival rates [60]. MPM, also a lncORF-encoded peptide, was significantly downregulated in HCC tissues, with low MPM expression associated with increased HCC metastasis, recurrence, and mortality [111].

3.5. Ovarian Cancer (OC)

DDUP is a peptide encoded within the lncRNA CTBP1-DT and is involved in DNA damage response (DDR) signaling [112]. Ren and colleagues (2023) demonstrated that high CTBP1-DT expression conferred markedly shorter overall and progression-free survival in ovarian, lung, and gastric cancer patients [112]. Notably, it has been shown that the DDUP peptide, not the CTBP1-DT lncRNA, promoted chemoresistance in patient-derived ovarian cancer cells [112].

3.6. Prostate Cancer

Although these results emphasize that ncORFs and their encoded peptides have potential clinical diagnostic and prognostic values in various human cancers, much of their detection occurs in the context of biopsied tissue samples. If these peptides can be detected non-invasively, such as in body fluids like blood, serum, or urine, it will immensely benefit their adoption as cancer biomarkers in clinical settings [99]. An example of such an ncORF is SHLP2, a mitochondrial-derived peptide that has demonstrated potential diagnostic significance in prostate cancer (PC) [113]. Xiao and colleagues (2017) analyzed serum SHLP2 concentrations in black and white PC patients and found SHLP2 expression to be significantly reduced in white confirmed PC cases compared to white healthy controls, while no significant difference in SHLP2 expression could be observed in black cases vs. controls. They have shown that downregulation of SHLP2 peptide expression was associated with greater PC risk in white men, and high serum SHLP2 concentration accurately ruled out PC in both racial groups, demonstrating potential as a novel diagnostic biomarker for PC [113]. Moreover, no association was found between SHLP2 expression and PC clinical grade, suggesting that SHLP2 did not confer any prognostic value [113]. However, the diagnostic value of SHLP2 requires further validation since a relatively small dataset was analyzed in the study.

3.7. Other Types of Cancer

miPEP133, a pri-miORF-encoded peptide, was demonstrated by Kan and colleagues (2020) to be a favorable prognostic biomarker for nasopharyngeal carcinoma (NPC) [75]. Low miPEP133 mRNA levels were associated with advanced metastatic progression in NPC and significantly worse overall survival in NPC patients [75].
Sun and colleagues demonstrated that lncRNA ASH1L-AS1 encoded a peptide called APPLE, and both the lncRNA and the peptide were significantly upregulated in preliminary acute myeloid leukemia (AML) patients when compared to healthy controls or AML patients in complete remission [17]. AML patients with high ASH1L-AS1 expression were more likely to achieve a 5-year leukemia-free survival than those with low expression [17]. Interestingly, ASH1L-AS1 expression could also be used to differentiate patients in complete remission status and those with preliminary diagnosis, further supporting the status of ASH1L-AS1 and APPLE as predictors of poor AML prognosis [17].
circMAPK1, a circORF-encoded peptide, exhibited downregulated expression in gastric cancer (GC) tissue compared to paired adjacent normal tissues [32]. High circMAPK1 expression was also linked with favorable clinicopathologic parameters (e.g., tumor size, lymphatic invasion, clinical stage) as well as longer overall survival [32].
MIAC, a lncORF-encoded peptide, was demonstrated to have higher expression in renal cell carcinoma (RCC) tissue compared with normal tissues in TCGA database screening as well as clinical samples [114]. RCC patients with high MIAC expression demonstrated significantly higher survival rates than those with low expression [114]. MIAC expression was also significantly higher in patients with early stage renal cancer than those in advanced stages [114]. This result emphasizes that MIAC lncORF has potential clinical diagnostic and prognostic value in RCC.
To identify novel prognostic and diagnostic biomarkers for cancer, proteomic approaches can be applied. For example, in a recent study, global micro-/alt-protein quantitation in two human leukemia cell lines, K562 and MOLT4, identified 12 unannotated proteins that were differentially expressed in these cell lines [115]. Further quantification of ncORF protein in clinical samples will identify novel ncORF biomarkers for the diagnosis and prognosis of human cancers.

4. ncORF in Cancer Therapy

4.1. ncORF in Anti-Cancer Drug Response

Drug resistance occurs when cancer cells tolerate a treatment, greatly limiting the efficiency of anti-tumor therapies in many patients [116,117]. Genomic instability is one factor contributing to drug resistance, which can be intrinsic or acquired [117]. Innate resistance exists in patients even before administrating anti-cancer drugs due to pre-existing genetic perturbations, tumor heterogeneity, or activation of defense mechanisms [117]. For example, resistance to cisplatin treatment was found in gastric cancer patients with HER2 overexpression, which upregulated the expression of EMT-related transcriptional factor Snail [118]. Moreover, studies show that Snail and Slug not only mediate EMT but also promote stem-like characteristics, conferring resistance to radiotherapy and chemotherapy [118,119]. Alternatively, relapse to treatment can occur when new mutations are acquired in tumor cells, resulting in a gradual decrease in therapy efficacy [117]. For instance, a point mutation from threonine-to-isoleucine in the BCR-ABL kinase domain can cause resistance or relapse of tyrosine kinase inhibitor treatment in patients with chronic myelogenous leukemia [120]. Several microproteins have been shown to affect drug efficiency in cancers (Figure 5). Radiation sensitivity was enhanced by overexpressing AKT3-174aa in GBM cells, while loss of AKT3-174aa expression regained resistance to radiation [67]. Chemoresistance can be acquired by evading DNA-damage-induced genetic instability [117]. DDUP is a lncRNA-encoded microprotein and functions as a regulator of DDR, promoting resistance to DNA-damaging chemotherapies [91]. Inhibition of DDUP by the ATR inhibitor Berzosertib re-sensitized ovarian cancer cells to cisplatin treatment, indicating its vital role in affecting drug response [91]. The tumor-suppressive role of a microprotein, N1DARP, was recognized in pancreatic cancer since it could suppress Notch1 signaling pathways, which contribute to chemoresistance [121].

4.2. ncORFs as Potential Drug Targets for Cancer Therapy

Given the fact that ncORFs or microproteins play important roles in various human diseases including cancer, their roles as therapeutic targets have been proposed or demonstrated recently [122]. Targeted therapy is a high-precision treatment designed to attack certain molecules that play oncogenic roles in tumors [99]. Its goal is to minimize non-specific cytotoxicity while destroying the growth and survival of cancer cells [99]. Molecular targeted therapeutic agents act on growth factors, cell surface antigens, and signaling pathways that initiate and maintain cancer hallmarks to block cancer progression [116]. Many anti-cancer therapies target different anti-apoptotic components, such as Bcl-2 family proteins, to induce apoptosis in tumor cells [116]. Molecular targeted therapies offer significant advantages in treating different types of cancers, including breast, gastric, colon, and lung cancers [116]. Compared to conventional chemotherapy, inhibitors of EGFR-tyrosine kinase improve clinical outcomes in NSCLC patients with EGFR mutations [116]. Despite the observed advantages, limitations exist and restrict the efficiency of targeted therapy in cancer patients [116]. Since targeted treatment requires specific biomarkers to be effective, patients who do not express such targets will respond poorly [116]. Several treatment options have been proposed for cancer patients [98]. Cancer cells are enriched with ncORF MHC-I-associated peptides (MAPs), suggesting their potential as a source of tumor antigens for immunotherapeutic applications. Additionally, pharmacological targeting of cancer-specific ncRNAs to silence pro-oncogenic microprotein translation may lead to downstream inhibition of previously undruggable targets. Microproteins can also provide valuable insights for drug design by identifying protein–protein interaction hotspots or inhibitor binding sites. Exploring microprotein conjugation to other drugs may offer potential applications in molecular glues or heterobifunctional compounds. Microproteins are also promising candidates as non-antibody binding scaffolds. Furthermore, the differential expression of ncORFs and microproteins in cancer enhances their potential as biomarkers, particularly in cases involving differentially expressed secreted microproteins.

5. Conclusions and Future Perspective

The recognition of ncORFs has attracted increasing attention and uncovers new aspects of regulating biological processes. The protein-coding capacity of the human genome is greatly underestimated and remains to be fully explored. Improved approaches prompt the identification of many ncORF-encoded peptides involved in cancer pathogenesis. Even though an increasing number of proteins expressed from ncORFs are unveiled, only a small set of these peptides are functionally characterized. Regulatory functions of many microproteins translated from ncRNAs are reported to influence tumor progression. Many ncRNA-derived microproteins can act as tumor suppressors or inducers. Thus, understanding the role of these ncRNA-encoded proteins offers critical prospects for cancer therapies.
In this review, we describe different types of ncRNAs reported to have coding potential, mention present techniques used for capturing ncORFs, and summarize functional proteins encoded by ncORFs implicated in cancer research. The discovery of ncORF-encoded microproteins expands our understanding of tumor development as many of them display prognostic and/or carcinogenic roles in human cancers. Despite more and more studies focusing on ncORFs, only a small portion of ncRNA-encoded microproteins have been identified and validated, and their mechanisms of action in cancer remain mostly unclear. Therefore, identifying and characterizing noncanonical proteins may lead to the establishment of novel biomarkers and drug targets for cancer diagnosis and therapeutics.

Author Contributions

Conceptualization, X.Y.; writing—original draft preparation, A.G. and C.C.; writing—review and editing, A.G., C.C. and X.Y.; visualization, A.G. and C.C.; funding acquisition, X.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This study was funded by the Canadian Institute of Health Research (CIHR), grant numbers 186142 and 148629.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Liu, Y.; Zeng, S.; Wu, M. Novel insights into noncanonical open reading frames in cancer. Biochim. Et Biophys. Acta (BBA)—Rev. Cancer 2022, 1877, 188755. [Google Scholar] [CrossRef] [PubMed]
  2. Consortium, I.H.G.S. Finishing the euchromatic sequence of the human genome. Nature 2004, 431, 931–945. [Google Scholar]
  3. Wright, B.W.; Yi, Z.; Weissman, J.S.; Chen, J. The dark proteome: Translation from noncanonical open reading frames. Trends Cell Biol. 2022, 32, 243–258. [Google Scholar] [CrossRef] [PubMed]
  4. Hofman, D.A.; Ruiz-Orera, J.; Yannuzzi, I.; Murugesan, R.; Brown, A.; Clauser, K.R.; Condurat, A.L.; Dinter, J.T.V.; Engels, S.A.G.; Goodale, A.; et al. Translation of non-canonical open reading frames as a cancer cell survival mechanism in childhood medulloblastoma. Mol. Cell 2024, 84, 261–276. [Google Scholar] [CrossRef]
  5. Ferreira, J.P.; Noderer, W.L.; Arce, A.J.D.D.; Wang, C.L. Engineering ribosomal leaky scanning and upstream open reading frames for precise control of protein translation. Bioengineered 2014, 5, 186–192. [Google Scholar] [CrossRef]
  6. Samandi, S.; Roy, A.V.; Delcourt, V.; Lucier, J.-F.; Gagnon, J.; Beaudoin, M.C.; Vanderperre, B.; Breton, M.-A.; Motard, J.; Jacques, J.-F.; et al. Deep transcriptome annotation enables the discovery and functional characterization of cryptic small proteins. Elife 2017, 6, e27860. [Google Scholar] [CrossRef]
  7. Jayaram, D.R.; Frost, S.; Argov, C.; Liju, V.B.; Anto, N.P.; Muraleedharan, A.; Ben-Ari, A.; Sinay, R.; Smoly, I.; Novoplansky, O.; et al. Unraveling the hidden role of a uORF-encoded peptide as a kinase inhibitor of PKCs. Proc. Natl. Acad. Sci. USA 2021, 118, e2018899118. [Google Scholar] [CrossRef]
  8. Wu, Q.; Wright, M.; Gogol, M.M.; Bradford, W.D.; Zhang, N.; Bazzini, A.A. Translation of small downstream ORFs enhances translation of canonical main open reading frames. EMBO J. 2020, 39, e104763. [Google Scholar] [CrossRef]
  9. Brunet, M.A.; Jacques, J.-F.; Nassari, S.; Tyzack, G.E.; McGoldrick, P.; Zinman, L.; Jean, S.; Robertson, J.; Patani, R.; Roucou, X. The FUS gene is dual-coding with both proteins contributing to FUS-mediated toxicity. EMBO Rep. 2021, 22, e50640. [Google Scholar] [CrossRef]
  10. Valdivia-Francia, F.; Sendoel, A. No country for old methods: New tools for studying microproteins. iScience 2024, 27, 108972. [Google Scholar] [CrossRef]
  11. Hann, S.R.; King, M.W.; Bentley, D.L.; Anderson, C.W.; Eisenman, R.N. A non-AUG translational initiation in c-myc exon 1 generates an N-terminally distinct protein whose synthesis is disrupted in Burkitt’s lymphomas. Cell 1988, 52, 185–195. [Google Scholar] [CrossRef] [PubMed]
  12. Mattick, J.S.; Amaral, P.P.; Carninci, P.; Carpenter, S.; Chang, H.Y.; Chen, L.-L.; Chen, R.; Dean, C.; Dinger, M.E.; Fitzgerald, K.A.; et al. Long non-coding RNAs: Definitions, functions, challenges and recommendations. Nat. Rev. Mol. Cell Biol. 2023, 24, 430–447. [Google Scholar] [CrossRef] [PubMed]
  13. Statello, L.; Guo, C.-J.; Chen, L.-L.; Huarte, M. Gene regulation by long non-coding RNAs and its biological functions. Nat. Rev. Mol. Cell Biol. 2021, 22, 96–118. [Google Scholar] [CrossRef] [PubMed]
  14. Patraquim, P.; Magny, E.G.; Pueyo, J.I.; Platero, A.I.; Couso, J.P. Translation and natural selection of micropeptides from long non-canonical RNAs. Nat. Commun. 2022, 13, 6515. [Google Scholar] [CrossRef]
  15. Ge, Q.; Jia, D.; Cen, D.; Qi, Y.; Shi, C.; Li, J.; Sang, L.; Yang, L.-J.; He, J.; Lin, A.; et al. Micropeptide ASAP encoded by LINC00467 promotes colorectal cancer progression by directly modulating ATP synthase activity. J. Clin. Invest. 2021, 131, e152911. [Google Scholar] [CrossRef] [PubMed]
  16. Wang, Y.; Wu, S.; Zhu, X.; Zhang, L.; Deng, J.; Li, F.; Guo, B.; Zhang, S.; Wu, R.; Zhang, Z.; et al. LncRNA-encoded polypeptide ASRPS inhibits triple-negative breast cancer angiogenesis. J. Exp. Med. 2020, 217, e20190950. [Google Scholar] [CrossRef]
  17. Sun, L.; Wang, W.; Han, C.; Huang, W.; Sun, Y.; Fang, K.; Zeng, Z.; Yang, Q.; Pan, Q.; Chen, T.; et al. The oncomicropeptide APPLE promotes hematopoietic malignancy by enhancing translation initiation. Mol. Cell 2021, 81, 4493–4508. [Google Scholar] [CrossRef]
  18. Guo, B.; Wu, S.; Zhu, X.; Zhang, L.; Deng, J.; Li, F.; Wang, Y.; Zhang, S.; Wu, R.; Lu, J.; et al. Micropeptide CIP2A-BP encoded by LINC00665 inhibits triple-negative breast cancer progression. EMBO J. 2020, 39, e102190. [Google Scholar] [CrossRef]
  19. Ho, L.; Tan, S.Y.X.; Wee, S.; Wu, Y.; Tan, S.J.C.; Ramakrishna, N.B.; Chng, S.C.; Nama, S.; Szczerbinska, I.; Chan, Y.-S.; et al. ELABELA Is an Endogenous Growth Factor that Sustains hESC Self-Renewal via the PI3K/AKT Pathway. Cell Stem Cell. 2015, 17, 435–447. [Google Scholar] [CrossRef]
  20. Li, M.; Li, X.; Zhang, Y.; Wu, H.; Zhou, H.; Ding, X.; Zhang, X.; Jin, X.; Wang, Y.; Yin, X.; et al. Micropeptide MIAC Inhibits HNSCC Progression by Interacting with Aquaporin 2. J. Am. Chem. Soc. 2020, 142, 6708–6716. [Google Scholar] [CrossRef]
  21. Lanz, R.B.; McKenna, N.J.; Onate, S.A.; Albrecht, U.; Wong, J.; Tsai, S.Y.; Tsai, M.J.; O’Malley, B.W. A Steroid Receptor Coactivator, SRA, Functions as an RNA and Is Present in an SRC-1 Complex. Cell 1999, 97, 17–27. [Google Scholar] [CrossRef] [PubMed]
  22. Yan, Y.; Cooper, C.; Hamedani, M.K.; Guppy, B.; Xu, W.; Tsuyuki, D.; Zhang, C.; Nugent, Z.; Blanchard, A.; Davie, J.R.; et al. The steroid receptor RNA activator protein (SRAP) controls cancer cell migration/motility. FEBS Lett. 2015, 589, 4010–4018. [Google Scholar] [CrossRef]
  23. Cheetham, S.W.; Faulkner, G.J.; Dinger, M.E. Overcoming challenges and dogmas to understand the functions of pseudogenes. Nat. Rev. Genet. 2020, 21, 191–201. [Google Scholar] [CrossRef] [PubMed]
  24. Ji, Z.; Song, R.; Regev, A.; Struhl, K. Many lncRNAs, 5′UTRs, and pseudogenes are translated and some are likely to express functional proteins. eLife 2015, 4, e08890. [Google Scholar] [CrossRef] [PubMed]
  25. Ransohoff, J.D.; Wei, Y.; Khavari, P.A. The functions and unique features of long intergenic non-coding RNA. Nat. Rev. Mol. Cell Biol. 2018, 19, 143–157. [Google Scholar] [CrossRef] [PubMed]
  26. Apcher, S.; Millot, G.; Daskalogianni, C.; Scherl, A.; Manoury, B.; Fåhraeus, R. Translation of pre-spliced RNAs in the nuclear compartment generates peptides for the MHC class I pathway. Proc. Natl. Acad. Sci. USA 2013, 110, 17951–17956. [Google Scholar] [CrossRef]
  27. Ratti, M.; Lampis, A.; Ghidini, M.; Salati, M.; Mirchev, M.B.; Valeri, N.; Hahne, J.C. MicroRNAs (miRNAs) and Long Non-Coding RNAs (lncRNAs) as New Tools for Cancer Therapy: First Steps from Bench to Bedside. Target. Oncol. 2020, 15, 261–278. [Google Scholar] [CrossRef] [PubMed]
  28. Motameny, S.; Wolters, S.; Nürnberg, P.; Schumacher, B. Next Generation Sequencing of miRNAs—Strategies, Resources and Methods. Genes 2010, 1, 70–84. [Google Scholar] [CrossRef]
  29. Chen, R.; Wang, S.K.; Belk, J.A.; Amaya, L.; Li, Z.; Cardenas, A.; Abe, B.T.; Chen, C.-K.; Wender, P.A.; Chang, H.Y. Engineering circular RNA for enhanced protein production. Nat. Biotechnol. 2023, 41, 262–272. [Google Scholar] [CrossRef] [PubMed]
  30. Misir, S.; Wu, N.; Yang, B.B. Specific expression and functions of circular RNAs. Cell Death Differ. 2022, 29, 481–491. [Google Scholar] [CrossRef]
  31. Yang, Y.; Fan, X.; Mao, M.; Song, X.; Wu, P.; Zhang, Y.; Jin, Y.; Yang, Y.; Chen, L.-L.; Wang, Y.; et al. Extensive translation of circular RNAs driven by N(6)-methyladenosine. Cell Res. 2017, 27, 626–641. [Google Scholar] [CrossRef] [PubMed]
  32. Jiang, T.; Xia, Y.; Lv, J.; Li, B.; Li, Y.; Wang, S.; Xuan, Z.; Xie, L.; Qiu, S.; He, Z.; et al. A novel protein encoded by circMAPK1 inhibits progression of gastric cancer by suppressing activation of MAPK signaling. Mol. Cancer 2021, 20, 66. [Google Scholar] [CrossRef] [PubMed]
  33. Merry, T.L.; Chan, A.; Woodhead, J.S.T.; Reynolds, J.C.; Kumagai, H.; Kim, S.-J.; Lee, C. Mitochondrial-derived peptides in energy metabolism. Am. J. Physiol. Endocrinol. Metab. 2020, 319, E659–E666. [Google Scholar] [CrossRef] [PubMed]
  34. Yang, H.; Li, Q.; Stroup, E.K.; Wang, S.; Ji, Z. Widespread stable noncanonical peptides identified by integrated analyses of ribosome profiling and ORF features. Nat. Commun. 2024, 15, 1932. [Google Scholar] [CrossRef] [PubMed]
  35. Martinez, T.F.; Chu, Q.; Donaldson, C.; Tan, D.; Shokhirev, M.N.; Saghatelian, A. Accurate annotation of human protein-coding small open reading frames. Nat. Chem. Biol. 2020, 16, 458–468. [Google Scholar] [CrossRef] [PubMed]
  36. Leong, A.Z.-X.; Lee, P.Y.; Mohtar, M.A.; Syafruddin, S.E.; Pung, Y.-F.; Low, T.Y. Short open reading frames (sORFs) and microproteins: An update on their identification and validation measures. J. Biomed. Sci. 2022, 29, 19. [Google Scholar] [CrossRef]
  37. Hanada, K.; Zhang, X.; Borevitz, J.O.; Li, W.-H.; Shiu, S.-H. A large number of novel coding small open reading frames in the intergenic regions of the Arabidopsis thaliana genome are transcribed and/or under purifying selection. Genome Res. 2007, 17, 632–640. [Google Scholar] [CrossRef] [PubMed]
  38. Brito-Estrada, O.; Hassel, K.R.; Makarewich, C.A. An Integrated Approach for Microprotein Identification and Sequence Analysis. J. Vis. Exp. 2022, 185, e63841. [Google Scholar]
  39. Zhu, M.; Gribskov, M. MiPepid: MicroPeptide identification tool using machine learning. BMC Bioinform. 2019, 20, 559. [Google Scholar] [CrossRef]
  40. Skarshewski, A.; Stanton-Cook, M.; Huber, T.; Mansoori, S.A.; Smith, R.; Beatson, S.A.; Rothnagel, J.A. uPEPperoni: An online tool for upstream open reading frame location and analysis of transcript conservation. BMC Bioinform. 2014, 15, 36. [Google Scholar] [CrossRef]
  41. Lin, M.F.; Jungreis, I.; Kellis, M. PhyloCSF: A comparative genomics method to distinguish protein coding and non-coding regions. Bioinformatics 2011, 27, i275–i282. [Google Scholar] [CrossRef]
  42. Uszczynska-Ratajczak, B.; Lagarde, J.; Frankish, A.; Guigó, R.; Johnson, R. Towards a complete map of the human long non-coding RNA transcriptome. Nat. Rev. Genet. 2018, 19, 535–548. [Google Scholar] [CrossRef] [PubMed]
  43. Cunningham, F.; Allen, J.E.; Allen, J.; Alvarez-Jarreta, J.; Amode, M.R.; Armean, I.M.; Austine-Orimoloye, O.; Azov, A.G.; Barnes, I.; Bennett, R.; et al. Ensembl 2022. Nucleic Acids Res. 2022, 50, D988–D995. [Google Scholar] [CrossRef]
  44. Ingolia, N.T.; Ghaemmaghami, S.; Newman, J.R.S.; Weissman, J.S. Genome-Wide Analysis in Vivo of Translation with Nucleotide Resolution Using Ribosome Profiling. Science 2009, 324, 218–223. [Google Scholar] [CrossRef] [PubMed]
  45. Giacomini, G.; Graziani, C.; Lachi, V.; Bongini, P.; Pancino, N.; Bianchini, M.; Chiarugi, D.; Valleriani, A.; Andreini, P. A Neural Network Approach for the Analysis of Reproducible Ribo–Seq Profile. Algorithms 2022, 15, 274. [Google Scholar] [CrossRef]
  46. Kearse, M.G.; Wilusz, J.E. Non-AUG translation: A new start for protein synthesis in eukaryotes. Genes Dev. 2017, 31, 1717–1731. [Google Scholar] [CrossRef]
  47. Slavoff, S.A.; Mitchell, A.J.; Schwaid, A.G.; Cabili, M.N.; Ma, J.; Levin, J.Z.; Karger, A.D.; Budnik, B.A.; Rinn, J.L.; Saghatelian, A. Peptidomic discovery of short open reading frame–encoded peptides in human cells. Nat. Chem. Biol. 2013, 9, 59–64. [Google Scholar] [CrossRef] [PubMed]
  48. Guttman, M.; Russell, P.; Ingolia, N.T.; Weissman, J.S.; Lander, E.S. Ribosome Profiling Provides Evidence that Large Noncoding RNAs Do Not Encode Proteins. Cell 2013, 154, 240–251. [Google Scholar] [CrossRef]
  49. Brar, G.A.; Weissman, J.S. Ribosome profiling reveals the what, when, where and how of protein synthesis. Nat. Rev. Mol. Cell Biol. 2015, 16, 651–664. [Google Scholar] [CrossRef]
  50. Aspden, J.L.; Eyre-Walker, Y.C.; Phillips, R.J.; Amin, U.; Mumtaz, M.A.S.; Brocard, M.; Couso, J.-P. Extensive translation of small Open Reading Frames revealed by Poly-Ribo-Seq. Elife 2014, 3, e03528. [Google Scholar] [CrossRef]
  51. Mohsen, J.J.; Martel, A.A.; Slavoff, S.A. Microproteins-Discovery, structure, and function. Proteomics 2023, 23, e2100211. [Google Scholar] [CrossRef] [PubMed]
  52. López, E.; Wang, X.; Madero, L.; López-Pascual, J.; Latterich, M. Functional phosphoproteomic mass spectrometry-based approaches. Clin. Transl. Med. 2012, 1, 20. [Google Scholar] [CrossRef]
  53. Chen, J.; Brunner, A.-D.; Cogan, J.Z.; Nuñez, J.K.; Fields, A.P.; Adamson, B.; Itzhak, D.N.; Li, J.Y.; Mann, M.; Leonetti, M.D.; et al. Pervasive functional translation of noncanonical human open reading frames. Science 2020, 367, 1140–1146. [Google Scholar] [CrossRef] [PubMed]
  54. Prensner, J.R.; Enache, O.M.; Luria, V.; Krug, K.; Clauser, K.R.; Dempster, J.M.; Karger, A.; Wang, L.; Stumbraite, K.; Wang, V.M.; et al. Noncanonical open reading frames encode functional proteins essential for cancer cell survival. Nat. Biotechnol. 2021, 39, 697–704. [Google Scholar] [CrossRef]
  55. Schlesinger, D.; Elsässer, S.J. Revisiting sORFs: Overcoming challenges to identify and characterize functional microproteins. FEBS J. 2022, 289, 53–74. [Google Scholar] [CrossRef]
  56. Hanahan, D.; Weinberg, R.A. The hallmarks of cancer. Cell 2000, 100, 57–70. [Google Scholar] [CrossRef] [PubMed]
  57. Hanahan, D.; Weinberg, R.A. Hallmarks of cancer: The next generation. Cell 2011, 144, 646–674. [Google Scholar] [CrossRef]
  58. Merino-Valverde, I.; Greco, E.; Abad, M. The microproteome of cancer: From invisibility to relevance. Exp. Cell Res. 2020, 392, 111997. [Google Scholar] [CrossRef]
  59. Polycarpou-Schwarz, M.; Groß, M.; Mestdagh, P.; Schott, J.; Grund, S.E.; Hildenbrand, C.; Rom, J.; Aulmann, S.; Sinn, H.-P.; Vandesompele, J.; et al. The cancer-associated microprotein CASIMO1 controls cell proliferation and interacts with squalene epoxidase modulating lipid droplet formation. Oncogene 2018, 37, 4750–4768. [Google Scholar] [CrossRef]
  60. Pang, Y.; Liu, Z.; Han, H.; Wang, B.; Li, W.; Mao, C.; Liu, S. Peptide SMIM30 promotes HCC development by inducing SRC/YES1 membrane anchoring and MAPK pathway activation. J. Hepatol. 2020, 73, 1155–1169. [Google Scholar] [CrossRef]
  61. Leng, F.; Miu, Y.; Zhang, Y.; Luo, H.; Lu, X.; Cheng, H.; Zheng, Z. A micro-peptide encoded by HOXB-AS3 promotes the proliferation and viability of oral squamous cell carcinoma cell lines by directly binding with IGF2BP2 to stabilize c-Myc. Oncol. Lett. 2021, 22, 697. [Google Scholar] [CrossRef]
  62. Huang, J.-Z.; Chen, M.; Chen, D.; Gao, X.-C.; Zhu, S.; Huang, H.; Hu, M.; Zhu, H.; Yan, G.-R. A Peptide Encoded by a Putative lncRNA HOXB-AS3 Suppresses Colon Cancer Growth. Mol. Cell 2017, 68, 171–184. [Google Scholar] [CrossRef] [PubMed]
  63. Lara, S.B.D.; Tran, D.D.H.; Allister, A.B.; Polenkowski, M.; Nashan, B.; Koch, M.; Tamura, T. C20orf204, a hepatocellular carcinoma-specific protein interacts with nucleolin and promotes cell proliferation. Oncogenesis 2021, 10, 31. [Google Scholar] [CrossRef] [PubMed]
  64. Zheng, X.; Chen, L.; Zhou, Y.; Wang, Q.; Zheng, Z.; Xu, B.; Wu, C.; Zhou, Q.; Hu, W.; Wu, C.; et al. A novel protein encoded by a circular RNA circPPP1R12A promotes tumor pathogenesis and metastasis of colon cancer via Hippo-YAP signaling. Mol. Cancer 2019, 18, 47. [Google Scholar] [CrossRef]
  65. Zhang, S.; Zhang, Z.; Liu, X.; Deng, Y.; Zheng, J.; Deng, J.; Wang, Y.; Guo, B.; Li, F.; Chen, X.; et al. LncRNA-Encoded Micropeptide ACLY-BP Drives Lipid Deposition and Cell Proliferation in Clear Cell Renal Cell Carcinoma via Maintenance of ACLY Acetylation. Mol. Cancer Res. 2023, 21, 1064–1078. [Google Scholar] [CrossRef] [PubMed]
  66. Yang, Y.; Gao, X.; Zhang, M.; Yan, S.; Sun, C.; Xiao, F.; Huang, N.; Yang, X.; Zhao, K.; Zhou, H.; et al. Novel Role of FBXW7 Circular RNA in Repressing Glioma Tumorigenesis. J. Natl. Cancer Inst. 2018, 110, 304–315. [Google Scholar] [CrossRef] [PubMed]
  67. Xia, X.; Li, X.; Li, F.; Wu, X.; Zhang, M.; Zhou, H.; Huang, N.; Yang, X.; Xiao, F.; Liu, D.; et al. A novel tumor suppressor protein encoded by circular AKT3 RNA inhibits glioblastoma tumorigenicity by competing with active phosphoinositide-dependent Kinase-1. Mol. Cancer 2019, 18, 131. [Google Scholar] [CrossRef] [PubMed]
  68. Zhang, M.; Zhao, K.; Xu, X.; Yang, Y.; Yan, S.; Wei, P.; Liu, H.; Xu, J.; Xiao, F.; Zhou, H.; et al. A peptide encoded by circular form of LINC-PINT suppresses oncogenic transcriptional elongation in glioblastoma. Nat. Commun. 2018, 9, 4475. [Google Scholar] [CrossRef] [PubMed]
  69. Morgado-Palacin, L.; Brown, J.A.; Martinez, T.F.; Garcia-Pedrero, J.M.; Farhad Forouhar, S.A.Q.; Reglero, C.; Vaughan, J.; Heydary, Y.H.; Donaldson, C.; Rodriguez-Perales, S.; et al. The TINCR uniquitin-like microprotein is a tumor suppressor in squamous cell carcinoma. Nat. Commun. 2023, 14, 1328. [Google Scholar] [CrossRef]
  70. Xu, W.; Deng, B.; Lin, P.; Liu, C.; Li, B.; Huang, Q.; Zhou, H.; Yang, J.; Qu, L. Ribosome profiling analysis identified a KRAS-interacting microprotein that represses oncogenic signaling in hepatocellular carcinoma cells. Sci. China Life Sci. 2020, 63, 529–542. [Google Scholar] [CrossRef]
  71. Vitale, I.; Pietrocola, F.; Guilbaud, E.; Aaronson, S.A. Apoptotic cell death in disease—Current understanding of the NCCD 2023. Cell Death Differ. 2023, 30, 1097–1154. [Google Scholar]
  72. Guo, B.; Zhai, D.; Cabezas, E.; Welsh, K.; Nouraini, S.; Satterthwait, A.C.; Reed, J.C. Humanin peptide suppresses apoptosis by interfering with Bax activation. Nature 2003, 423, 456–461. [Google Scholar] [CrossRef] [PubMed]
  73. Zhang, W.; Guo, Y.; Zhang, G.; Bai, J.; Song, Y.; Song, X.; Zhu, Q.; Bao, X.; Wu, G.; Zhang, C. Peptide encoded by lncRNA BVES-AS1 promotes cell viability, migration, and invasion in colorectal cancer cells via the SRC/mTOR signaling pathway. PLoS ONE 2023, 18, e0287133. [Google Scholar] [CrossRef] [PubMed]
  74. Wu, S.; Zhang, L.; Deng, J.; Guo, B.; Li, F.; Wang, Y.; Wu, R.; Zhang, S.; Lu, J.; Zhou, Y. A Novel Micropeptide Encoded by Y-Linked LINC00278 Links Cigarette Smoking and AR Signaling in Male Esophageal Squamous Cell Carcinoma. Cancer Res. 2020, 80, 2790–2803. [Google Scholar] [CrossRef] [PubMed]
  75. Kang, M.; Tang, B.; Li, J.; Zhou, Z.; Liu, K.; Wang, R.; Jiang, Z.; Bi, F.; Patrick, D.; Kim, D.; et al. Identification of miPEP133 as a novel tumor-suppressor microprotein encoded by miR-34a pri-miRNA. Mol. Cancer 2020, 19, 143. [Google Scholar] [CrossRef]
  76. Fares, J.; Fares, M.Y.; Khachfe, H.H.; Salhab, H.A.; Fares, Y. Molecular principles of metastasis: A hallmark of cancer revisited. Signal Transduct. Target. Ther. 2020, 5, 28. [Google Scholar] [CrossRef] [PubMed]
  77. Friedl, P.; Alexander, S. Cancer invasion and the microenvironment: Plasticity and reciprocity. Cell 2011, 147, 992–1009. [Google Scholar] [CrossRef] [PubMed]
  78. Seyfried, T.N.; Huysentruyt, L.C. On the origin of cancer metastasis. Crit. Rev. Oncog. 2013, 18, 43–73. [Google Scholar] [CrossRef] [PubMed]
  79. Guo, Z.-W.; Meng, Y.; Zhai, X.-M.; Xie, C.; Zhao, N.; Li, M.; Zhou, C.-L.; Li, K.; Liu, T.-C.; Yang, X.-X.; et al. Translated Long Non-Coding Ribonucleic Acid ZFAS1 Promotes Cancer Cell Migration by Elevating Reactive Oxygen Species Production in Hepatocellular Carcinoma. Front. Genet. 2019, 10, 1111. [Google Scholar] [CrossRef]
  80. Pauli, A.; Norris, M.L.; Valen, E.; Chew, G.-L.; Gagnon, J.A.; Zimmerman, S.; Mitchell, A.; Ma, J.; Dubrulle, J.; Reyon, D.; et al. Toddler: An Embryonic Signal That Promotes Cell Movement via Apelin Receptors. Science 2014, 343, 1248636. [Google Scholar] [CrossRef]
  81. Yi, Y.; Tsai, S.-H.; Cheng, J.-C.; Wang, E.Y.; Anglesio, M.S.; Cochrane, D.R.; Fuller, M.; Gibb, E.A.; Wei, W.; Huntsman, D.G.; et al. APELA promotes tumor growth and cell migration in ovarian cancer in a p53-dependent manner. Gynecol. Oncol. 2017, 147, 663–671. [Google Scholar] [CrossRef] [PubMed]
  82. Meng, K.; Lu, S.; Li, Y.-Y.; Hu, L.-L.; Zhang, J.; Cao, Y.; Wang, Y.; Zhang, C.Z.; He, Q.-Y. LINC00493-encoded microprotein SMIM26 exerts anti-metastatic activity in renal cell carcinoma. EMBO Rep. 2023, 24, e56282. [Google Scholar] [CrossRef] [PubMed]
  83. Geiger, T.R.; Peeper, D.S. Metastasis mechanisms. Biochim. Et Biophys. Acta (BBA)-Rev. Cancer 2009, 1796, 293–308. [Google Scholar] [CrossRef] [PubMed]
  84. Wu, S.; Guo, B.; Zhang, L.; Zhu, X.; Zhao, P.; Deng, J.; Zheng, J.; Li, F.; Wang, Y.; Zhang, S.; et al. A micropeptide XBP1SBM encoded by lncRNA promotes angiogenesis and metastasis of TNBC via XBP1s pathway. Oncogene 2022, 41, 2163–2172. [Google Scholar] [CrossRef]
  85. Adam, J.K.; Odhav, B.; Bhoola, K.D. Immune responses in cancer. Pharmacol. Ther. 2003, 99, 113–132. [Google Scholar] [CrossRef] [PubMed]
  86. Pei, X.; Sun, Q.; Zhang, Y.; Wang, P.; Peng, X.; Guo, C.; Xu, E.; Zheng, Y.; Mo, X.; Ma, J.; et al. PC3-Secreted Microprotein Is a Novel Chemoattractant Protein and Functions as a High-Affinity Ligand for CC Chemokine Receptor 2. J. Immunol. 2014, 192, 1878–1886. [Google Scholar] [CrossRef]
  87. Jackson, R.; Kroehling, L.; Khitun, A.; Bailis, W. The translation of non-canonical open reading frames controls mucosal immunity. Nature 2018, 564, 434–438. [Google Scholar] [CrossRef]
  88. Niu, L.; Lou, F.; Sun, Y.; Sun, L.; Cai, X.; Liu, Z. A micropeptide encoded by lncRNA MIR155HG suppresses autoimmune inflammation via modulating antigen presentation. Sci. Adv. 2020, 6, eaaz2059. [Google Scholar] [CrossRef]
  89. Chong, C.; Müller, M.; Pak, H.; Harnett, D.; Huber, F.; Grun, D.; Leleu, M.; Auger, A.; Arnaud, M.; Stevenson, B.J.; et al. Integrated proteogenomic deep sequencing and analytics accurately identify non-canonical peptides in tumor immunopeptidomes. Nat. Commun. 2020, 11, 1293. [Google Scholar] [CrossRef]
  90. Irajizad, E.; Fahrmann, J.; Long, J.; Vykoukal, J.; Kobayashi, M.; Capello, M.; Yu, C.; Cai, Y.; Hsiao, F.; Patel, N.; et al. A Comprehensive Search of Non-Canonical Proteins in Non-Small Cell Lung Cancer and Their Impact on the Immune Response. Int. J. Mol. Sci. 2022, 23, 8933. [Google Scholar] [CrossRef]
  91. Yu, R.; Hu, Y.; Zhang, S.; Li, X.; Tang, M.; Yang, M.; Wu, X.; Li, Z.; Liao, X.; Xu, Y.; et al. LncRNA CTBP1-DT-encoded microprotein DDUP sustains DNA damage response signalling to trigger dual DNA repair mechanisms. Nucleic Acids Res. 2022, 50, 8060–8079. [Google Scholar] [CrossRef]
  92. Slavoff, S.; Heo, J.; Budnik, B.; Hanakahi, L.; Saghatelian, A. A human short open reading frame (sORF)-encoded polypeptide that stimulates DNA end joining. J. Biol. Chem. 2014, 289, 10950–10957. [Google Scholar] [CrossRef] [PubMed]
  93. Zhang, C.; Zhou, B.; Gu, F.; Liu, H.; Wu, H.; Yao, F.; Zheng, H.; Fu, H.; Chong, W.; Cai, S.; et al. Micropeptide PACMP inhibition elicits synthetic lethal effects by decreasing CtIP and poly(ADP-ribosyl)ation. Mol. Cell 2022, 82, 1297–1312. [Google Scholar] [CrossRef] [PubMed]
  94. Deberardinis, R.; Chandel, N. Fundamentals of cancer metabolism. Sci. Adv. 2016, 2, e1600200. [Google Scholar] [CrossRef]
  95. Huang, N.; Li, F.; Zhang, M.; Zhou, H.; Chen, Z.; Ma, X.; Yang, L.; Wu, X.; Zhong, J.; Xiao, F.; et al. An Upstream Open Reading Frame in Phosphatase and Tensin Homolog Encodes a Circuit Breaker of Lactate Metabolism. Cell Metab. 2021, 33, 128–144. [Google Scholar] [CrossRef]
  96. Zhu, K.-G.; Yang, J.; Zhu, Y.; Zhu, Q.; Pan, W.; Deng, S.; He, Y.; Zuo, D.; Wang, P.; Han, Y.; et al. The microprotein encoded by exosomal lncAKR1C2 promotes gastric cancer lymph node metastasis by regulating fatty acid metabolism. Cell Death Dis. 2023, 14, 708. [Google Scholar] [CrossRef] [PubMed]
  97. Carracedo, A.; Cantley, L.; Pandolfi, P. Cancer metabolism: Fatty acid oxidation in the limelight. Nat. Rev. Cancer 2013, 13, 227–232. [Google Scholar] [CrossRef] [PubMed]
  98. Posner, Z.; Yannuzzi, I.; Prensner, J.R. Shining a light on the dark proteome: Non-canonical open reading frames and their encoded miniproteins as a new frontier in cancer biology. Protein Sci. 2023, 32, e4708. [Google Scholar] [CrossRef] [PubMed]
  99. Zhu, S.; Wang, J.; He, Y.; Meng, N.; Yan, G.-R. Peptides/Proteins Encoded by Non-coding RNA: A Novel Resource Bank for Drug Targets and Biomarkers. Front. Pharmacol. 2018, 9, 1295. [Google Scholar] [CrossRef]
  100. Erady, C.; Boxall, A.; Puntambekar, S.; Jagannathan, N.S.; Chauhan, R.; Chong, D.; Meena, N.; Kulkarni, A.; Kasabe, B.; Bhayankaram, K.P.; et al. Pan-cancer analysis of transcripts encoding novel open-reading frames (nORFs) and their potential biological functions. NPJ Genom. Med. 2021, 6, 4. [Google Scholar] [CrossRef]
  101. Zhou, H.; Wu, Y.; Cai, J.; Zhang, D.; Lan, D.; Dai, X.; Liu, S.; Song, T.; Wang, X.; Kong, Q.; et al. Micropeptides: Potential treatment strategies for cancer. Cancer Cell International. Cancer Cell Int. 2024, 24, 134. [Google Scholar] [CrossRef]
  102. Setrerrahmane, S.; Li, M.; Zoghbi, A.; Lv, X.; Zhang, S.; Zhao, W.; Lu, J.; Craik, D.J.; Xu, H. Cancer-related micropeptides encoded by ncRNAs: Promising drug targets and prognostic biomarkers. Cancer Lett. 2022, 547, 215723. [Google Scholar] [CrossRef] [PubMed]
  103. Carlomagno, N.; Incollingo, P.; Tammaro, V.; Peluso, G.; Rupealta, N.; Chiacchio, G.; Sotelo, M.L.S.; Minieri, G.; Pisani, A.; Riccio, E.; et al. Diagnostic, Predictive, Prognostic, and Therapeutic Molecular Biomarkers in Third Millennium: A Breakthrough in Gastric Cancer. BioMed Res. Int. 2017, 2017, 7869802. [Google Scholar] [CrossRef] [PubMed]
  104. Xu, S.; Jia, G.; Zhang, H.; Wang, L.; Cong, Y.; Lv, M.; Xu, J.; Ruan, H.; Jia, X.; Xu, P.; et al. LncRNA HOXB-AS3 promotes growth, invasion and migration of epithelial ovarian cancer by altering glycolysis. Life Sci. 2021, 264, 118636. [Google Scholar] [CrossRef] [PubMed]
  105. Jiang, W.; Kai, J.; Li, D.; Wei, Z.; Yang, Y.; Wang, W. lncRNA HOXB-AS3 exacerbates proliferation, migration, and invasion of lung cancer via activating the PI3K-AKT pathway. J. Cell. Physiol. 2020, 235, 7194–7203. [Google Scholar] [CrossRef] [PubMed]
  106. Papaioannou, D.; Petri, A.; Dovey, O.M.; Terreri, S.; Wang, E.; Collins, F.A.; Woodward, L.A.; Walker, A.E.; Nicolet, D.; Pepe, F.; et al. The long non-coding RNA HOXB-AS3 regulates ribosomal RNA transcription in NPM1-mutated acute myeloid leukemia. Nat. Commun. 2019, 10, 5351. [Google Scholar] [CrossRef]
  107. Xing, Y.; Sun, X.; Li, F.; Jiang, X.; Jiang, A.; Li, X.; Lv, R.; Shao, L. Long non-coding RNA (lncRNA) HOXB-AS3 promotes cell proliferation and inhibits apoptosis by regulating ADAM9 expression through targeting miR-498-5p in endometrial carcinoma. J. Int. Med. Res. 2021, 49, 03000605211013548. [Google Scholar] [CrossRef]
  108. Meng, N.; Chen, M.; Chen, D.; Chen, X.H.; Wang, J.Z.; Zhu, S.; He, Y.T.; Zhang, X.L.; Lu, R.X.; Yan, G.R. Small Protein Hidden in lncRNA LOC90024 Promotes “Cancerous” RNA Splicing and Tumorigenesis. Adv. Sci. 2020, 7, 1903233. [Google Scholar] [CrossRef]
  109. Zhu, S.; Wang, J.-Z.; Chen, D.; He, Y.-T.; Meng, N.; Chen, M.; Lu, R.-X.; Chen, X.-H.; Zhang, X.-L.; Yan, G.-R. An oncopeptide regulates m6A recognition by the m6A reader IGF2BP1 and tumorigenesis. Nat. Commun. 2020, 11, 1685. [Google Scholar] [CrossRef]
  110. Zhang, M.; Huang, N.; Yang, X.; Luo, J.; Yan, S.; Xiao, F.; Chen, W.; Gao, X.; Zhao, K.; Zhou, H.; et al. A novel protein encoded by the circular form of the SHPRH gene suppresses glioma tumorigenesis. Oncogene 2018, 37, 1805–1814. [Google Scholar] [CrossRef]
  111. Xiao, M.-H.; Lin, Y.-F.; Xie, P.-P.; Chen, H.-X.; Deng, J.-W.; Zhang, W.; Zhao, N.; Xie, C.; Meng, Y.; Liu, X.; et al. Downregulation of a mitochondrial micropeptide, MPM, promotes hepatoma metastasis by enhancing mitochondrial complex I activity. Mol. Ther. 2022, 30, 714–725. [Google Scholar] [CrossRef] [PubMed]
  112. Ren, L.; Qing, X.; Wei, J.; Mo, H.; Liu, Y.; Zhi, Y.; Lu, W.; Zheng, M.; Zhang, W.; Chen, Y.; et al. The DDUP protein encoded by the DNA damage-induced CTBP1-DT lncRNA confers cisplatin resistance in ovarian cancer. Cell Death Dis. 2023, 14, 568. [Google Scholar] [CrossRef] [PubMed]
  113. Xiao, J.; Howard, L.; Wan, J.; Wiggins, E.; Vidal, A.; Cohen, P.; Freedland, S.J. Low circulating levels of the mitochondrial-peptide hormone SHLP2: Novel biomarker for prostate cancer risk. Oncotarget 2017, 8, 94900–94909. [Google Scholar] [CrossRef] [PubMed]
  114. Li, M.; Liu, G.; Jin, X.; Guo, H.; Setrerrahmane, S.; Xu, X.; Li, T.; Lin, Y.; Xu, H. Micropeptide MIAC inhibits the tumor progression by interacting with AQP2 and inhibiting EREG/EGFR signaling in renal cell carcinoma. Mol. Cancer 2022, 21, 181. [Google Scholar] [CrossRef] [PubMed]
  115. Cao, X.; Khitun, A.; Na, Z.; Dumitrescu, D.G.; Kubica, M.; Olatunji, E.; Slavoff, S.A. Comparative Proteomic Profiling of Unannotated Microproteins and Alternative Proteins in Human Cell Lines. J. Proteome Res. 2020, 19, 3418–3426. [Google Scholar] [CrossRef]
  116. Lee, Y.T.; Tan, Y.J.; Oon, C.E. Molecular targeted therapy: Treating cancer with specificity. Eur. J. Pharmacol. 2018, 834, 188–196. [Google Scholar] [CrossRef]
  117. Wang, X.; Zhang, H.; Chen, X. Drug resistance and combating drug resistance in cancer. Cancer Drug Resist. 2019, 2, 141–160. [Google Scholar] [CrossRef]
  118. Huang, D.; Duan, H.; Huang, H.; Tong, X.; Han, Y.; Ru, G.; Qu, L.; Shou, C.; Zhao, Z. Cisplatin resistance in gastric cancer cells is associated with HER2 upregulation-induced epithelial-mesenchymal transition. Sci. Rep. 2016, 6, 20502. [Google Scholar] [CrossRef] [PubMed]
  119. Kurrey, N.K.; Jalgaonkar, S.P.; Joglekar, A.V.; Ghanate, A.D.; Chaskar, P.D.; Doiphode, R.Y.; Bapat, S.A. Snail and Slug mediate radioresistance and chemoresistance by antagonizing p53-mediated apoptosis and acquiring a stem-like phenotype in ovarian cancer cells. Stem Cells 2009, 27, 2059–2068. [Google Scholar] [CrossRef]
  120. Quintás-Cardama, A.; Kantarjian, H.M.; Cortes, J.E. Mechanisms of primary and secondary resistance to imatinib in chronic myeloid leukemia. Cancer Control 2009, 16, 122–131. [Google Scholar] [CrossRef]
  121. Zhai, S.; Lin, J.; Ji, Y.; Zhang, R.; Zhang, Z.; Cao, Y.; Liu, Y.; Tang, X.; Liu, J.; Liu, P.; et al. A microprotein N1DARP encoded by LINC00261 promotes Notch1 intracellular domain (N1ICD) degradation via disrupting USP10-N1ICD interaction to inhibit chemoresistance in Notch1-hyperactivated pancreatic cancer. Cell Discov. 2023, 9, 95. [Google Scholar] [CrossRef]
  122. Hassel, K.R.; Brito-Estrada, O.; Makarewich, C.A. Microproteins: Overlooked regulators of physiology and disease. iScience 2023, 26, 106781. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Different types of RNAs with coding capacity. Distinct categories of non-canonical open reading frames (ncORFs) have been identified from non-coding RNAs, which include 5′UTR upstream ORF (uORF), 3′ UTR downstream ORF (dORF), long non-coding ORF (lncORF), and long intergenic non-coding ORF (lincORF) from long non-coding RNAs (lncRNAs), miRNA ORF (miORF) from miRNA, and ORF derived from circular RNAs (circORF).
Figure 1. Different types of RNAs with coding capacity. Distinct categories of non-canonical open reading frames (ncORFs) have been identified from non-coding RNAs, which include 5′UTR upstream ORF (uORF), 3′ UTR downstream ORF (dORF), long non-coding ORF (lncORF), and long intergenic non-coding ORF (lincORF) from long non-coding RNAs (lncRNAs), miRNA ORF (miORF) from miRNA, and ORF derived from circular RNAs (circORF).
Cancers 16 02660 g001
Figure 2. Regulation of cell proliferation and cell death by ncORF. ncORFs are indicated in blue color background. Cancers 16 02660 i001 activation; Cancers 16 02660 i002 inhibition.
Figure 2. Regulation of cell proliferation and cell death by ncORF. ncORFs are indicated in blue color background. Cancers 16 02660 i001 activation; Cancers 16 02660 i002 inhibition.
Cancers 16 02660 g002
Figure 3. Regulation of cell migration/invasion, EMT, angiogenesis, immunity, genetic instability, and metabolism by ncORFs. Labels are as described in the legend of Figure 2.
Figure 3. Regulation of cell migration/invasion, EMT, angiogenesis, immunity, genetic instability, and metabolism by ncORFs. Labels are as described in the legend of Figure 2.
Cancers 16 02660 g003
Figure 4. ncORFs as prognosis and diagnosis biomarkers. The prognostic and diagnostic roles of microproteins encoded by ncORFs in different human cancers is shown with dashed lines. AML, acute myeloid leukemia; BC, breast cancer; CRC, colorectal carcinoma; CSCC, cutaneious squamous cell carcinoma; GBM, glioblastoma; GC, gastric cancer; HCC, hepatocellular carcinoma; LAC, lung adenocarcinoma; NPC, nasopharyngeal carcinoma; OC, ovarian cancer; PC, prostate cancer; RCC, renal cell carcinoma.
Figure 4. ncORFs as prognosis and diagnosis biomarkers. The prognostic and diagnostic roles of microproteins encoded by ncORFs in different human cancers is shown with dashed lines. AML, acute myeloid leukemia; BC, breast cancer; CRC, colorectal carcinoma; CSCC, cutaneious squamous cell carcinoma; GBM, glioblastoma; GC, gastric cancer; HCC, hepatocellular carcinoma; LAC, lung adenocarcinoma; NPC, nasopharyngeal carcinoma; OC, ovarian cancer; PC, prostate cancer; RCC, renal cell carcinoma.
Cancers 16 02660 g004
Figure 5. The contribution of ncORFs-encoded microproteins in anti-cancer drug responses. Labels are as described in legend of Figure 2.
Figure 5. The contribution of ncORFs-encoded microproteins in anti-cancer drug responses. Labels are as described in legend of Figure 2.
Cancers 16 02660 g005
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ge, A.; Chan, C.; Yang, X. Exploring the Dark Matter of Human Proteome: The Emerging Role of Non-Canonical Open Reading Frame (ncORF) in Cancer Diagnosis, Biology, and Therapy. Cancers 2024, 16, 2660. https://doi.org/10.3390/cancers16152660

AMA Style

Ge A, Chan C, Yang X. Exploring the Dark Matter of Human Proteome: The Emerging Role of Non-Canonical Open Reading Frame (ncORF) in Cancer Diagnosis, Biology, and Therapy. Cancers. 2024; 16(15):2660. https://doi.org/10.3390/cancers16152660

Chicago/Turabian Style

Ge, Anni, Curtis Chan, and Xiaolong Yang. 2024. "Exploring the Dark Matter of Human Proteome: The Emerging Role of Non-Canonical Open Reading Frame (ncORF) in Cancer Diagnosis, Biology, and Therapy" Cancers 16, no. 15: 2660. https://doi.org/10.3390/cancers16152660

APA Style

Ge, A., Chan, C., & Yang, X. (2024). Exploring the Dark Matter of Human Proteome: The Emerging Role of Non-Canonical Open Reading Frame (ncORF) in Cancer Diagnosis, Biology, and Therapy. Cancers, 16(15), 2660. https://doi.org/10.3390/cancers16152660

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop