1. Introduction
Recent data implies that at least 80% of the human genome is transcribed, producing about 20,000 messenger RNAs (mRNAs), over 2300 microRNAs (miRNAs), a set of structural RNAs (such as ribosomal RNAs (rRNAs), transfer RNAs (tRNAs), and small nuclear RNAs (snRNAs)), and at least 50,000 long noncoding RNAs (lncRNAs) [
1,
2,
3]. LncRNAs comprise a diverse group of non-protein coding transcripts that are longer than 200 nucleotides in length, lack an open reading frame, and regulate gene expression at many levels by affecting transcription, cellular localization, mRNA stability, translation, and other post-transcriptional events [
4,
5,
6,
7]. As a result, lncRNAs regulate a broad range of biological processes, including aging, cell growth and differentiation, hematopoiesis, and immune response [
8,
9,
10,
11]. Accordingly, numerous studies have reported highly diverse roles for lncRNAs in cancer [
12,
13,
14], including development and maintenance of resistance to anti-cancer therapy [
15,
16,
17,
18]. Most prominent examples involve well-characterized lncRNAs, such as CDKN2B antisense RNA 1 (
CDKN2B-AS1, also known as
ANRIL) [
19], growth arrest specific 5 (
GAS5) [
20,
21], H19 imprinted maternally expressed transcript (
H19) [
22,
23], HOX transcript antisense RNA (
HOTAIR) [
24,
25], metastasis associated lung adenocarcinoma transcript 1 (
MALAT1) [
26,
27], and X inactive specific transcript (
XIST) [
28,
29]. However, the functions of lncRNAs in the pathogenesis of B-cell lymphomas are still poorly understood.
Lymphomas are a collective term used for malignancies of the lymph system, and they are divided into Hodgkin’s lymphomas (10% of lymphomas) and non-Hodgkin lymphomas (NHL), which account for ca. 90% of all lymphomas [
30,
31,
32]. NHLs comprise a diverse group of lymphoid cancers with approx. 85%–90% of NHL cases originating from B cells, while the remaining NHLs derive from T cells or NK cells [
33,
34]. Diffuse large B-cell lymphoma (DLBCL) is the most common form of NHL, accounting for up to 40% of all lymphomas, with an incidence rate of approximately five to six cases per 100,000 Americans per year [
35]. The 5-year survival rate ranges from 50%–80%, depending on risk profile, and the first-line treatment is the R-CHOP regimen (Rituximab plus Cyclophosphamide, Hydroxydaunorubicin, Oncovin, Prednisone combination therapy) [
36]. Rituximab, a chimeric anti-CD20 monoclonal antibody [
37] was added to the standard CHOP chemotherapy regimen in 2003, which improved the two-year progression-free survival from 51% to 61% and the overall survival rate from 52% to 78% [
38,
39]. Nevertheless, 30%–40% of the patients eventually relapse or are refractory to R-CHOP, experiencing poor two-year outcomes with overall survival rates of only 20%–40% [
40,
41], which calls for improved molecular understanding of the disease and new therapeutic strategies for the treatment of DLBCL. To this end, lncRNAs might be a key to further understand the diagnostic and prognostic measures for DLBCL to promote better treatment options for DLBCL patients. In this article, we summarize the current status of lncRNAs in DLBCL and provide a survey for lncRNAs in DLBCL cell lines to assist further research into the biological functions of lncRNAs in the pathogenesis of DLBCL.
2. General Functions of lncRNAs
LncRNAs exert their functions by binding to other macromolecules: DNA, RNA, or proteins. At the level of transcription, a lncRNA can form a scaffold for the epigenetic or transcription factor complex to assist its activation or repression of specific set of genes. The best investigated example is the binding between histone H3 lysine 27 methyltransferase, polycomb repressive complex 2 (PRC2) (i.e., the functional enzymatic component, enhancer of zeste homolog 2 (EZH2)) and lncRNAs such as
HOTAIR [
42], taurine up-regulated 1 (
TUG1) [
43], and
XIST [
44]. However, recent studies suggest that EZH2 binds many RNAs promiscuously, including lncRNAs [
45,
46,
47,
48,
49,
50]. Besides being transcriptional regulators, some lncRNAs can bind other RNAs to exert post-transcriptional regulation. For example, lncRNAs have been suggested to function as competitive endogenous RNAs (ceRNAs) to sequester miRNAs thereby influencing the amount of proteins being produced from the target mRNAs. Such lncRNAs are more commonly known as miRNA sponges, which include the
H19 lncRNA sponging
miR-675 [
51], the hepatocellular carcinoma up-regulated long non-coding RNA (
HULC) binding to
miR-372 [
52], and the ZNFX1 antisense RNA 1 (
ZFAS1) sequestering
miR-150 [
53]. As for the translational control, lncRNAs can bind to RNA-binding proteins, which regulate mRNA translation. For example, in murine cardiomyocytes, the paternally imprinted lncRNA gene, antisense of IGF2R non-protein coding RNA (
Airn, also known as
Air), binds to the RNA-binding protein insulin like growth factor 2 mRNA binding protein 2 (Igf2bp2) to regulate the translational efficiency of hundreds of mRNAs to control the homeostasis of cardiomyocytes. In addition, lncRNA sequences themselves may be important for genomic imprinting, an epigenetic phenomenon through which genes are expressed in a parent-of-origin dependent manner, as in the case of the paternal imprinting lncRNA
Airn [
54] and the maternal imprinting lncRNA
H19 [
55]). Genomic lncRNA sequences may also contain important regulatory sequences, such as enhancers, as in the case of Hand2, opposite strand 1 (
Hand2os1, also known as
Upperhand [
56]) and Hand2 downstream lncRNA (
Hdnr, also known as
Handsdown [
57])].
Because of the broad definition of lncRNAs as non-protein-coding transcripts longer than 200 nucleotides in length, pseudogenes, which arise from the duplication of DNA sequences or retrotransposition and integration into the genome [
58], are currently considered as a new category of lncRNAs [
59]. Furthermore, circular RNAs (circRNAs) have emerged as an abundant class of covalently closed non-coding RNA molecules in eukaryotes that arise from backsplicing of protein-coding and non-coding exons and/or introns [
60,
61,
62]. Due to the lack of free 5′ and 3′ ends, circRNAs are more resistant to exonuclease-mediated degradation compared to linear RNAs (e.g., mRNAs). Because of their stability, circRNAs accumulate in the blood, saliva, and urine, which have made them attractive candidates for biomarker discovery [
63,
64]. Finally, some lncRNAs have been shown to encode small peptides, named micropeptides. The examples of micropeptides are myomixer, myoblast fusion factor (
MYMX) [
65], and myoregulin (
MRLN) [
66] which regulate muscle formation and activity, respectively. Taken together, lncRNAs comprise a heterogeneous class of non-protein coding transcripts with highly diverse roles in biological processes. However, additional functional studies are required to establish the functions of lncRNAs in human diseases, such as cancer.
3. LncRNAs in DLBCL
Due to the increased interest to study lncRNAs in recent years, researchers have also identified a number of differentially expressed lncRNAs in DLBCL patients compared to healthy donors or in DLBCL cell lines [
67,
68,
69,
70]. For example, the p53-activated lncRNA, promoter of CDKN1A antisense DNA damage activated RNA (
PANDA), was reported to inactivate the MAPK/ERK signaling pathway to suppress cell growth by a G0/G1 cell cycle arrest [
68] (
Figure 1A). By comparison, the firre intergenic repeating RNA element (
FIRRE) lncRNA was shown to function as an oncogene by activating Wnt/β-catenin signaling pathway via promoting nuclear translocation of β-catenin [
71] (
Figure 1B). Other lncRNAs, acting as post-transcriptional regulators, include the DBH antisense RNA 1 (
DBH-AS1) binding to the RNA-binding protein, BUD13 homolog (BUD13), controlling the translation of fibronectin 1 (
FN1) to positively regulate the proliferation, migration, and invasion of DLBCL cells [
72] (
Figure 1C). Another example is the EZH2-mediated lncRNA, FAS antisense RNA 1 (
FAS-AS1), which binds to the RNA-binding protein, RNA binding motif protein 5 (RBM5), to control alternative splicing of Fas cell surface death receptor (
FAS) mRNA [
73]. Yet, other lncRNAs were shown to directly control protein modifications to influence tumor growth, as in the case of
TUG1, which reduces protein levels of the MET proto-oncogene, receptor tyrosine kinase (MET) by promoting its ubiquitination [
74].
Among various functions of lncRNAs, there is an increasing trend to investigate lncRNAs as miRNA sponges by viewing lncRNAs as an extra layer of the post-transcriptional regulatory machinery to fine-tune miRNA-mediated control of protein abundance. For example, the nuclear paraspeckle assembly transcript 1 (
NEAT1) was shown to bind
miR-34b-5p to affect the proliferation of DLBCL cell lines by targeting the GLI family zinc finger 1 (
GLI1) [
75]. Two lncRNAs have been implicated in the proliferation of DLBCL cells, the paternally expressed 10 (
PEG10) sponging
miR-101-3p, which targets kinesin family member 2A (
KIF2A) [
76], and SMAD5 antisense RNA 1 (
SMAD5-AS1) that binds
miR-135b-5bp to regulate the translation of the APC regulator of the WNT signaling pathway (
APC) [
77].
Small nucleolar RNAs (snoRNAs) are a class of small RNA molecules that assist in chemical modifications of other regulatory RNAs (e.g., rRNAs and tRNAs) [
78,
79]. Similar to miRNAs, snoRNAs are processed from much longer host genes, which are categorized as lncRNAs. In DLBCL, three snoRNA host genes have been proposed to function as miRNA sponges: small nucleolar RNA host gene 12 (
SNHG12) sequesters
miR-195 to control the cell growth, migration, and invasion of DLBCL cells in vitro [
80]; small nucleolar RNA host gene 14 (SNHG14) sponges
miR-5590-3p to upregulate Zinc finger E-box binding homeobox 1 (
ZEB1); and small nucleolar RNA host gene 16 (
SNHG16) sequesters
miR-497-5p, to derepress the Pim-1 proto-oncogene, serine/threonine kinase (
PIM1).
Since a single miRNA can modulate hundreds of mRNAs, it is not surprising that one miRNA can potentially be sequestered by several lncRNAs.
MiR-195, which is a member of the
miR-15/107 family, is a known tumor suppressor, whose dysregulation is linked to human diseases, including Alzheimer’s disease, cardiac hypertrophy, and many cancers [
81,
82]. As mentioned above [
80],
miR-195 is sequestered by
SNHG12. In DLBCL, another lncRNA,
MALAT1, was shown to regulate expression of the CD274 molecule (
PD-L1) via
miR-195 [
83]. If the search is extended to outside of DLBCL, more than two dozen lncRNAs are suggested to sponge
miR-195, including AFAP1 antisense RNA 1 (
AFAP1-AS1) [
84], AGAP2 antisense RNA 1 (
AGAP2-AS1) [
85], cytoskeleton regulator RNA (
CYTOR) [
86], maternally expressed 3 (
MEG3) [
87],
NEAT1 [
88], OIP5 antisense RNA 1 (
OIP5-AS1) [
89], urothelial cancer associated 1 (
UCA1) [
90,
91], and
XIST [
92] to name a few (
Figure 1D). Thus, it is clear that the studying of lncRNAs as miRNA sponges is far more complex than one would hope for, which suggests that it would be very challenging to use lncRNAs as miRNA sponges for therapeutic purposes.
4. Differential Expression of lncRNAs in Rituximab Sensitive and Resistant DLBCL Cell Lines
Understanding the role of lncRNAs in rituximab-resistant DLBCL phenotype could potentially guide the development of improved DLBCL therapies, thereby increase the survival rate of DLBCL patients, and cut the treatment costs. As evident from the current findings of lncRNAs in DLBCL, the number of lncRNAs identified and characterized is still limited. To provide further insights into the role of lncRNAs in DLBCL, we analyzed the transcription profiles of DLBCL cell lines, which are either sensitive or resistant to rituximab, by RNA-sequencing (
Figure 2A). These line cells are from different individuals and were not treated with rituximab to uncover the intrinsic transcriptomic differences that underlie drug resistance. We uncovered 195 up- and 428 down-regulated genes in rituximab-resistant DLCBL cell lines compared to sensitive cell lines (
Figure 2B), suggesting that there are large gene expression differences inherent to rituximab resistance. When Gene Ontology (GO) analysis for differentially expressed genes (both up- and down-regulated genes combined) was performed, several key signaling pathways known to be involved in tumor development were enriched (
Figure 2C); most notably the GTPase signaling pathway [
93,
94] (
Figure 2D), confirming the intrinsic differences in DLBCL cell lines compared.
Of the 458 differentially expressed genes, 123 are classified as lncRNAs by the latest annotation provided by the Ensembl database (GRCh38 version 100), comprising 54 up- and 69-down regulated lncRNAs (
Supplementary Table S1). To initiate the dissection of the molecular signatures of the differentially expressed lncRNAs, additional bioinformatic analyses were performed. First, the promoter regions of the differentially expressed lncRNAs were examined for potential binding sites of transcription factors to uncover gene regulatory networks of lncRNA expression. Among the 350 transcription factors with potential binding sites in the promoter regions, 10 transcription factors were differentially expressed in rituximab-resistant compared to rituximab-sensitive DLCBL cell lines (
p < 0.05) (
Figure 3A;
Supplementary Table S2). Among the differentially expressed transcription factors, the homeodomain transcription factor Meis homeobox 1 (MEIS1) is predicted to bind most sites in the highest number of promoter regions of the differentially expressed lncRNAs. Although MEIS1 is down-regulated in rituximab-resistant DLCBL cell lines, a recent study reported that the MYC proto-oncogene, bHLH transcription factor (MYC)-dependent down-regulation of MEIS1 is linked to the tumor development and progression via elevated homeobox B13 (HOXB13) expression and androgen receptor (AR) activity in prostate cancers [
97], suggesting that the resistance to rituximab may be linked to the down-regulation of MEIS1, which, in turn, could regulate the expression of downstream lncRNAs. Furthermore, the Kruppel like factor 5 (KLF5) is the only up-regulated transcription factor in rituximab-resistant DLCBL cell lines. Interestingly, a previous report found KLF5 to be upregulated in human breast cancer cells treated with the HER2/epidermal growth factor receptor inhibitor, lapatinib [
98]. Together with its family member KLF4, KLF5 was shown to induce expression of the anti-apoptotic factor, MCL1 (a BCL2 family membe
r), to coordinate the gene regulatory program in resistance to lapatinib. Thus, further studies are warranted to uncover the importance of KLF5 in DLCBL in relation to regulation of lncRNA expression and rituximab resistance.
Besides lncRNAs functioning as miRNA sponges, increasing evidence suggests that many lncRNAs sequester RNA-binding proteins (RBPs) to influence mRNA degradation and stability. Such lncRNAs are collectively called RBP sponges [
101]. Interestingly, the up-regulated lncRNAs in rituximab-resistant DLCBL cell lines harbor many putative RBP binding sites within their sequences (
Supplementary Table S3). In particular, the cholesterol-induced regulator of metabolism RNA (
CHROMR, also known as
CHROME and
PRKRA-AS1) has the fourth highest number of RBP binding sites among all up-regulated lncRNAs examined (
Supplementary Table S3). A previous study showed that
CHROMR expression is increased in the plasma and atherosclerotic plaques of individuals with coronary artery disease [
102]. Mechanistically,
CHROMR either sequesters or degrades a set of miRNAs (i.e.,
miR-27b,
miR-33a,
miR-33b, and
miR-128) to repress the expression of genes mediating cholesterol transport in human hepatocytes and macrophages [
102]. As cholesterol metabolism has been reported to drive tumor growth and invasion [
103], it would be of high interest to study the intertwined link between cholesterol metabolism and DLCBL [
104]. Given that many RBPs are involved in cholesterol metabolism and
CHROMR is 3.4-fold up-regulated in rituximab-resistant DLCBL cell lines [
103], in addition to being miRNA sponge,
CHROMR could function as an RBP sponge to modulate the translation of genes mediating cholesterol transport (
Figure 3B).