Expression of lncRNAs in Low-Grade Gliomas and Glioblastoma Multiforme: An In Silico Analysis

In an in silico study, Anindya Dutta and colleagues study the expression of long noncoding RNAs in brain tumors and normal brain tissue and the possible implications for disease prognosis.

Published in the journal: Expression of lncRNAs in Low-Grade Gliomas and Glioblastoma Multiforme: An In Silico Analysis. PLoS Med 13(12): e32767. doi:10.1371/journal.pmed.1002192
Category: Research Article
doi: 10.1371/journal.pmed.1002192


In an in silico study, Anindya Dutta and colleagues study the expression of long noncoding RNAs in brain tumors and normal brain tissue and the possible implications for disease prognosis.


Malignant gliomas are the most common aggressive primary brain tumor, with nearly 23,000 new cases diagnosed each year in the US [1]. The most aggressive malignant gliomas, anaplastic astrocytoma and glioblastoma multiforme (GBM), have 5-y survival rates of 23% and 5%, respectively. World Health Organization grade II and III gliomas are less aggressive than grade IV glioblastomas (GBMs), and have been grouped together by The Cancer Genome Atlas (TCGA) as lower grade gliomas (LGGs). Once thought to be a single disease, GBMs are now recognized as having a considerable level of intertumor heterogeneity, and studies have found that GBMs can be subdivided into four subtypes, proneural, neural, classical, and mesenchymal, based on their transcriptional profile [2,3]. Importantly, these subtypes are associated with differing clinical outcomes, including varying responses to intensive therapy and differences in overall survival [3]. Similar to GBMs, LGGs can be categorized into distinct subtypes, IDH1/2 mut, IDH1/2 mut 1p19q codeletion, and IDH1/2 wt, based on IDH1/2 mutational status and the presence of a codeletion of 1p19q [4]. Each subtype has distinct clinical phenotypes, with the IDH1/2 wt subtype being the most aggressive and dissimilar to the other LGG subtypes [4,5]. Although knowledge of tumor subtype has clinical utility, the best prognostic indicator for patients with glial tumors is the mutational status of IDH1 and IDH2 [6]. In LGGs, patients with wild-type IDH1/2 have a median survival of 1.7 y, while those with mutant IDH1/2 have a median survival between 6.3 and 8.0 y. In GBMs, the corresponding median survival estimates are 1.1 and 2.1 y for wild-type and mutant IDH1/2, respectively [4].

Proteins have been thought to be the primary functional effectors of cells until relatively recently, when the roles of noncoding RNAs (ncRNAs) began to be appreciated for their contributions to most biological processes. Spurred by large sequencing consortia such as ENCODE and FANTOM, interest in ncRNAs has grown rapidly, in part due to the discovery that the vast majority of the mammalian genome is transcribed and that most of the resulting transcripts do not code for proteins [79].

Long noncoding RNAs (lncRNAs) are a class of ncRNAs greater than 200 bp in length that do not code for a protein. lncRNAs have mechanistically diverse functions in the cell, and in the nucleus have been shown to regulate gene expression either in cis or in trans by recruiting chromatin-modifying complexes to promoters of target genes [10,11]. Also, lncRNAs have been found to regulate gene expression by promoting long-distance genomic interactions [12]. Other, mainly cytoplasmic, lncRNAs have been shown to regulate the protein concentrations produced from target genes in part by affecting mRNA stability or the translational efficiency of an mRNA [1315].

Recent work has shown that lncRNAs play a critical role in various biological pathways including the immune system [16], muscle differentiation [17,18], neural lineage commitment, lineage specification, and synaptogenesis [1923]. In addition to their role in normal physiological processes, lncRNAs are also important regulators of disease processes [24,25]. In cancer, lncRNAs can act as either tumor suppressors or oncogenes, and have been shown to regulate tumor growth and metastasis in breast, prostate, and liver cancer [10,2628]. Although some lncRNAs have been linked to brain tumor development and pathogenesis, the overall study of lncRNAs in brain tumors has lagged behind [2932]. In this study, we sought to categorize all dysregulated lncRNAs in glial tumors and to identify lncRNAs that are associated with patient prognosis.


Ethics Statement

All patient samples, including 15 primary GBM specimens and five normal brain specimens, were obtained from consented patients undergoing surgical treatment at the University of Virginia and were acquired in accordance with a protocol approved by the University of Virginia’s institutional review board.

Planning the Analysis

There was no protocol or prospective analysis plan for the study. Unaligned sequencing reads for TCGA GBMs and LGGs were downloaded from the Cancer Genomics Hub. Normal brain sample SRA files (SRP033725, SRP045638, SRP044668, and SRP048683) [3336] were downloaded from the Sequence Read Archive (SRA) database on October 4, 2014. Most RNA sequencing (RNA-seq) samples in TCGA originate from patients from the US; however, TCGA collects patient samples from other countries as well, including Canada, Russia, and Italy. The LGG survival algorithm was devised in November 2015; 70% of patients were used as a test set, and 30% were retained for a validation set. Following suggestions by a reviewer in September 2016, this division was subsequently changed to 60% of patients in the test set and the remaining 40% in the validation set.

Identification of Novel lncRNAs and Quantification of lncRNA Abundance

The aforementioned SRA files were converted to fastq files using the SRA Toolkit v2.3.5. All fastq files were aligned to the hg38 reference genome with Tophat2 using default settings [37]. Novel transcripts (transcripts not found in reference transcript annotation files: GENCODE and RefSeq) were identified in each sample using Cufflinks2 de novo assembly [38]. A consensus transcript assembly was generated using Cuffmerge. Novel transcripts whose genomic coordinates did not intersect with known transcripts from a custom GTF file consisting of transcripts from GENCODE v21, RefSeq, and Cabili et al. [39] were kept for further validation.

We determined the coding potential for each novel transcript using CPAT (Coding-Potential Assessment Tool) and intersection with a mass spectrometry database. First, the in silico coding potential of each novel transcript was assessed using CPAT [40], and any transcripts with a CPAT score above 0.5242 were considered transcripts of unknown coding potential and were not included in downstream analysis. Second, we mapped all unique peptides from the ProteomicsDB [41] mass spectrometry database to all known proteins. All potential ORFs within each novel transcript were translated, and all unmapped peptides were mapped on the translated ORFs. Any novel transcript with more than one mapped peptide was not considered for downstream analysis. All novel transcripts that met these criteria were considered as novel lncRNAs and added to our finalized GTF file. Using the finalized GTF file, the expression of all genes was quantified using Cuffquant and Cuffnorm.

Validating lncRNA Expression in Clinical Samples

Fresh-frozen GBM and normal brain tissue samples were obtained from the University of Virginia, and RNA was isolated using Trizol (Thermo Fisher). RNA treated with DNase (RQ1 Promega, Thermo Fisher) was used for reverse transcription with SuperScript III (Thermo Fisher). Quantitative real-time PCR (RT-PCR) was performed on tissue cDNA using SYBR Green (Thermo Fisher), and the expression of LINC00152, TUNAR, and LINC01476 was normalized to that of the housekeeping gene encoding actin.

Identifying Differentially Expressed lncRNAs and lncRNAs Associated with Mutation Status and Subtype

To identify differentially expressed lncRNAs, we selected only lncRNAs with a median expression greater than 0.5 fragments per kilobase of exon per million fragments mapped (FPKM); 4,288 lncRNAs in LGGs and 3,297 lncRNAs in GBMs met this threshold and were used in our downstream analysis. Expression values for each lncRNA in 170 GBM samples, 497 LGG samples (we removed 16 samples from our initial pool of 513 LGGs due to >15% of transcripts having expression values three standard deviations above or below the mean expression for all LGGs), and 78 normal tissue samples were used to calculate the Kolmogorov-Smirnov test (KS test) statistic [42]. lncRNAs with a Benjamini-Hochberg-corrected false discovery rate (FDR) [43] of <0.05 and a fold change greater than or equal to four were considered differentially expressed. The correlation between copy number variation (CNV) and lncRNA dysregulation was determined by calculating the Spearman correlation coefficient between lncRNA expression and the copy number segment mean (Tier 3 TCGA data accessed from the Broad GDAC Firehose; of the genomic region that gives rise to each lncRNA. lncRNAs with Spearman correlation coefficients greater than or equal to 0.2 were considered to be correlated. The overlap between GBM and LGG differentially expressed lncRNAs was measured using the Jaccard index, a statistic that compares the similarity and divergence of two datasets, defined as the intersection of two datasets divided by the union of the datasets:

To identify lncRNAs associated with individual somatic mutations, we separated patients into two groups based on the presence of nonsynonymous mutation or a non-inframe insertion or deletion in commonly mutated protein-coding genes (prevalence of 5% or greater). Differential expression was measured using the same statistical method mentioned above, with a log2 fold change of greater than 0.5 as a cutoff.

Subtype-associated lncRNAs were selected by separating tumors into groups based on their subtype and using the KS test statistic to determine if the expression of a given lncRNA was different in a specific subtype (FDR < 0.05). Specifically, a lncRNA was considered subtype specific only if its expression was statistically different from each other subtype in a paired comparison. GBMs were separated into subtypes based on predesignated subtyping (Cancer Genome Browser). LGGs subtypes were determined by identifying the mutational status of IDH1 or IDH2 from TCGA’s preprocessed mutation calling data and identifying LGGs with 1p19q deletions from TCGA’s preprocessed CNV data. The statistical significance of the overlap between GBM and LGG subtype-specific lncRNAs was determined by comparing the observed lncRNA overlap to the overlap obtained through 1,000 random iterations of the two sample sets, keeping the number of lncRNAs in each sample equal to that in the observed subtype-specific lncRNA sets.

LGG Survival Prediction

To create a lncRNA survival model for LGGs, we first randomly selected 60% of patients to serve as a test set and reserved the remaining 40% of patients for independent validation. Random subsamples of 66% of the test set of patients were subjected to a multivariate Cox regression [44] survival model. This was repeated with 100 random subsamples from the test set of patients. Age, grade, sex, IDH1/2 mutational status (S7 Table), and inverse normalized lncRNA expression levels were used as variables in the LGG survival model. In total, 64 lncRNAs had statistically significant Cox coefficients in 80% of the 100 subsamples and were included in our survival algorithm. To combine the predictive power of the prognostic lncRNAs, the following steps were taken. For every patient in our test set, the expression of each prognostic lncRNA was compared to the average expression of that lncRNA in patients from the test set. If the absolute value of the expression Z-score of a given lncRNA was ≥1, the median Cox coefficient of that lncRNA from the 100 subsamples was added to a summed Cox coefficient, and this was repeated for each of the prognostic lncRNAs (S2 and S3 Figs). Patients were divided into two groups based on whether the summed Cox coefficient was positive (poor prognosis) or negative (good prognosis), consistent with the interpretation that a Cox coefficient > 0 indicates a poor outcome and a Cox coefficient < 0 indicates a good outcome. The survival differences of the groups were displayed on a Kaplan-Meier survival curve. We independently applied this survival algorithm, as mentioned above, to the validation patient set (the retained 40% of patients).

lncRNAs Associated with Survival in GBM and GBM Subtypes

Prognostic lncRNAs were identified using a Cox proportional hazard model similar to that stated above, except that we included age, sex, and inverse normalized lncRNA expression in the survival model. lncRNAs that were associated with prognosis (p-value < 0.05) were then separated based on whether they predicted a poor or good prognosis. We predicted the possible pathways that each lncRNA is involved in using guilt-by-association analysis, as previously described [45]. To identify lncRNAs that predict survival in each subtype, we performed Cox regression for each lncRNA in a given subtype (based on subtypes specified by the Cancer Genome Browser) and selected only lncRNAs with a p-value of ≤0.05.


Identifying Novel lncRNAs

In order to identify and catalog all novel lncRNAs (unannotated lncRNAs) in brain cancers, we used the Tuxedo Suite [37,38] to align, assemble, and quantify the expression of novel and annotated transcripts from 170 GBMs and 497 LGGs originating from TCGA and 78 normal brain samples from both TCGA and publicly available datasets (Fig 1A; S1 Table). We initially filtered all novel transcripts from the consensus transcriptome that (1) overlapped with any annotated transcript, (2) were less than 200 bp, or (3) did not contain a splice junction. We next assessed the coding potential of all novel transcripts using both in silico predictions as well as intersection with a human proteome database with peptide data from over 100 cell lines and 60 tissues that, importantly, includes brain tissue [41].

Identification of novel lncRNAs in glial tumors and normal brain RNA-seq samples.
Fig. 1. Identification of novel lncRNAs in glial tumors and normal brain RNA-seq samples.
(A) Overview of analysis pipeline for identifying novel lncRNAs and determining their associations with clinically relevant phenotypes. (B) Cumulative distribution function plot of CPAT scores demonstrates that the majority of novel transcripts are predicted to not code for proteins (CPAT score < 0.5242). (C) Metagene plot of H3K4me3 ChIP-seq data from U87 cell samples shows enrichment in promoters of protein-coding genes and novel lncRNAs but not in a randomized genomic control. GBM, glioblastoma multiforme; LGG, lower grade glioma; lncRNA, long noncoding RNA; RNA-seq, RNA sequencing.

CPAT (Coding-Potential Assessment Tool) determines the coding potential of a novel transcript based on relative ORF size, codon bias, and nucleotide hexamer bias. Any transcripts with a CPAT score ≥ 0.5242—a threshold that separates noncoding RNAs and protein-coding genes [46]—were removed from further consideration (Fig 1B). We next sought to determine if there was any biological evidence of protein products derived from the novel transcripts by parsing data from ProteomicsDB [41]. Peptides from the database were first aligned to all known proteins, and any unaligned peptide was then aligned to all translated ORFs within the novel transcripts. Any transcript with two or more mapped peptides, suggesting that it could potentially be a novel protein-coding gene, was removed from downstream analysis. Only seven novel transcripts had more than two aligned peptides and were removed from our downstream analysis, which is in line with other studies reporting low levels of spurious ribosomal associations with ncRNAs [47]. After filtering our list of novel transcripts, we identified over 2,700 novel lncRNAs. After cataloging the novel lncRNAs expressed in LGGs, GBMs, and normal brain, the analyses were carried out with the entire pool of lncRNAs (both novel and annotated) whose average expression was greater than 0.5 FPKM.

Similar to protein-coding genes, lncRNAs are often transcribed by RNA polymerase II and share similar active chromatin marks of H3K4me3 on their promoters. We therefore tested whether H3K4me3 was located at the promoters of the novel lncRNAs identified in this study, using H3K4me3 ChIP-seq data from U87 cells. The promoters of the novel lncRNAs were enriched in H3K4me3 chromatin marks relative to a randomized genomic control, although to a lesser extent than the promoters of protein-coding genes (Fig 1C).

Dysregulation of lncRNAs in Brain Cancers

Although recent work has begun to address the role of lncRNAs in brain tumors, very few lncRNAs have been found to be dysregulated in glial tumors [30,48]. Therefore, we sought to form a comprehensive list of all lncRNAs whose expression is significantly altered in brain tumors. TCGA RNA-seq data for glial tumors have very few accompanying normal brain samples, making comparisons between normal and tumor groups difficult due to a lack of adequate sample size. To bolster our ability to identify dysregulated lncRNAs in glial tumors, we included RNA-seq data from publicly available normal brain samples that were obtained from regions of the brain where glial tumors commonly arise (e.g., cortex, and excluding regions such as hippocampus and cerebellum; see Methods).

Using our normal brain samples as a comparison, we tested whether any lncRNAs were over- or underexpressed in GBMs or LGGs relative to normal brain. We identified 454 upregulated and 642 downregulated lncRNAs in GBMs and 403 upregulated and 340 downregulated lncRNAs in LGGs that all had FDRs of <0.05, a statistic that takes into account errors that may arise from multiple testing (see Methods), and had fold changes greater than four (S1 Table). Of these dysregulated lncRNAs, over 80 were newly identified in this study. We tested whether the expression differences of the dysregulated lncRNAs could be explained by genomic CNV in GBMs and LGGs. Consistent with previous studies [49], only a fraction (19% and 20% from GBMs and LGGs, respectively) of the differentially expressed lncRNAs were associated (Spearman correlation coefficient of 0.2 or greater) with tumor CNV (S1 Fig). This suggests that other mechanisms, in addition to CNV, play a role in regulating the changes in lncRNA expression observed in glial tumors.

We have highlighted several differentially expressed lncRNAs in GBMs and LGGs in Fig 2A and 2B, respectively. Of note, our analysis confirmed previous work that identified a lncRNA, CRNDE, that is upregulated in a number of tumors, including GBMs [29]; in our analysis, CRNDE is upregulated over 40-fold in GBMs compared to normal brain (Fig 2A). Furthermore, we also identified TUNAR as being severely downregulated in all glial tumors, almost 45-fold in GBMs and 14-fold in LGGs (Fig 2A and 2B). This is interesting, as other work has shown that TUNAR is a crucial positive regulator of neuronal development and differentiation in zebrafish, mice, and humans, which suggests that brain tumors require the downregulation of TUNAR in order to gain oncogenic properties and escape the restrictions on neuronal cell growth [50,51]. In order to further validate our analysis, we measured by RT-PCR the expression of one lncRNA, LINC00152, which is upregulated 20-fold in GBMs in TCGA data (Fig 2A). Using normal brain and GBM tumor tissue from patients at the University of Virginia, we validated the altered expression of three lncRNAs, LINC00152, TUNAR, and LINC01476. LINC00152 was found to have 3-fold higher expression in tumor tissue relative to normal brain tissue (Fig 2C). TUNAR and LINC01476, which both have lower expression in GBMs relative to normal brain tissue in TCGA, were found to have 12-fold and nearly 100-fold higher expression in normal brain tissue compared to GBM tissue, respectively (Fig 2C).

Large alterations in lncRNA expression in glial tumors.
Fig. 2. Large alterations in lncRNA expression in glial tumors.
(A) Boxplot of ten candidate lncRNAs that are differentially expressed in GBMs compared to normal tissue (blue = upregulated, red = downregulated, grey = no change). (B) Boxplot of nine candidate lncRNAs that are differentially expressed in LGGs compared to normal tissue (blue = upregulated, red = downregulated, grey = no change). (C) Real-time PCR of LINC00152, LINC01476, and TUNAR in 15 GBM and five normal brain samples confirms upregulation of LINC00152 and downregulation of LINC01476 and TUNAR in GBMs compared to normal brain. Expression values for each lncRNA are normalized to that of the gene encoding actin. (D) Large overlap in differentially expressed lncRNAs between GBMs and LGGs. GBM, glioblastoma multiforme; LGG, lower grade glioma; lncRNA, long noncoding RNA.

We next tested whether there is any overlap between the differentially expressed lncRNAs in GBMs and LGGs. Interestingly, there was a large degree of overlap in both the upregulated and downregulated lncRNAs, with a Jaccard index (described in Methods) of 0.4 and 0.45, respectively (Fig 2D). Unlike other tumors, whose tumor grades are more commonly viewed as being along a disease continuum, GBMs and their grade II and III counterparts are not typically regarded as being different stages of a single disease [49]. However, our results suggest that there is a great deal of similarity in the lncRNA profile of GBMs and LGGs. Some of the overlap could be due to the need of glial tumors to downregulate genes related to the differentiation of glia or neurons, though it is unlikely that such de-differentiation would account for such a high degree of overlap between LGGs and GBMs.

lncRNAs Associated with Patient Tumor Mutation Status

Somatic mutations are well-known drivers of tumorigenesis, and their profound effects on the cell’s transcriptional landscape have been well characterized [5254]. Although most studies have focused on changes in protein-coding gene expression, recent work has begun to show that somatic mutations can lead to large alterations in lncRNA expression as well [5557]. Through the TCGA consortium, many recurrent somatic mutations have been identified in GBMs and LGGs, many of which are shared between the two groups [4,58]. In order to determine what effect these mutations might have on the lncRNA transcriptome, for each commonly mutated gene (S7 Table), we separated patients into groups based on their tumor mutational status and then tested whether the expression of any lncRNA is altered in GBMs and LGGs harboring each common somatic mutation.

We identified hundreds of lncRNAs that were differentially expressed (as described in Methods) in mutated versus non-mutated GBMs and LGGs (Fig 3A and 3C; S2 and S3 Tables). Interestingly, in GBMs there was little overlap in the differentially expressed lncRNAs between different somatic mutation groups (Fig 3B). In contrast, LGGs had a higher degree of overlap between mutation-associated lncRNAs (Fig 3D).

Many lncRNAs are associated with common mutations in GBMs and LGGs.
Fig. 3. Many lncRNAs are associated with common mutations in GBMs and LGGs.
(A) Stacked bar graph of mutation-associated lncRNAs in GBMs. (B) Minimal overlap between mutation-associated lncRNAs in GBMs (red = upregulated, blue = downregulated, grey = no change). (C) Stacked bar graph of mutation-associated lncRNAs in LGGs shows robust deregulation depending on tumor mutational background. (D) Moderate overlap between mutation-associated lncRNA expression trends in GBMs; however, each group of mutation-associated lncRNAs represents a distinct set of GBMs (red = upregulated, blue = downregulated, grey = no change). GBM, glioblastoma multiforme; LGG, lower grade glioma; lncRNA, long noncoding RNA.

lncRNAs Associated with Cancer Subtypes

Work by TCGA and others has found that both GBMs and LGGs are not homogenous collections of tumors, but can rather be categorized into separate subtypes [25]. Each of the glial tumor subtypes is clinically distinct, and understanding the lncRNAs associated with a particular subtype could help to better distinguish between the groups or possibly identify novel therapeutic targets. To this end, we separated patients based on their tumor subtype (S7 Table) and determined whether there were any lncRNAs that were specifically expressed in a given subtype. We identified 64, 211, 95, and 71 lncRNAs that were specifically up- or downregulated in neural, proneural, mesenchymal, and classical GBMs, respectively (Fig 4A; S4 Table). Thirteen of these lncRNAs were novel lncRNAs identified in this study. Furthermore, 1,357, 1,216, and 466 lncRNAs were specifically up- or downregulated in IDH1/2 wt, IDH1/2 mut, and IDH1/2 mut 1p19q codeletion LGGs, respectively (Fig 4B; S5 Table).

Identification of subtype-associated lncRNAs reveals similarities between GBM and LGG subtypes.
Fig. 4. Identification of subtype-associated lncRNAs reveals similarities between GBM and LGG subtypes.
(A) Heatmap of all lncRNAs in GBMs with subtype-specific expression patterns demonstrates large expression differences between GBM subtypes. (B) Heatmap of all lncRNAs in LGGs with subtype-specific expression patterns demonstrates large expression differences between LGG subtypes. (C) Overlap of IDH1/2 wt LGG subtype-specific lncRNAs with GBM subtype-specific lncRNAs reveals similarities between mesenchymal GBMs and IDH1/2 wt LGGs. (D) Overlap of GBM mesenchymal subtype-specific lncRNAs with each group of LGG subtype-specific lncRNAs reveals similarities between mesenchymal GBMs and IDH1/2 wt LGGs. * p < 0.00005, ** p < 0.00001. GBM, glioblastoma multiforme; LGG, lower grade glioma; lncRNA, long noncoding RNA.

Traditionally, GBMs and LGGs have been viewed as distinct oncological entities; however, recent work has begun to suggest that IDH1/2 wt LGGs might be more similar to GBMs than to their IDH1/2 mut LGG counterparts [49]. In order to better understand these similarities, we tested whether there is significant overlap (as described in Methods) between the differentially expressed genes (DEGs) of the IDH1/2 wt LGGs and the DEGs for each GBM subtype. Although there was no statistically significant overlap in DEGs between the neural GBM subtype and the IDH1/2 wt LGG subtype, the proneural GBM subtype had much less overlap with the IDH1/2 wt LGG subtype than would be expected by random chance (Fig 4C). This finding is consistent with the fact that proneural GBMs frequently have point mutations in IDH1/2 [3]. There was a slight increase in the overlap between classical GBM subtype DEGs and IDH1/2 wt LGG DEGs compared to the random model; however, this difference was not statistically significant (p = 0.055). Surprisingly, DEGs in mesenchymal GBMs had much higher overlap with DEGs in IDH1/2 wt LGGs compared to the random model (Fig 4C). We next determined whether the overlap between mesenchymal GBMs and IDH1/2 wt LGGs is specific to this LGG subtype or is found with the other LGG subtypes, by measuring the overlap of mesenchymal differentially expressed lncRNAs with differentially expressed lncRNAs from each LGG subtype. In contrast to the greater degree of overlap with the IDH1/2 wt subtype, both the IDH1/2 mut and IDH1/2 mut 1p19q codeletion subtypes had less overlap than would be expected by random chance (Fig 4D). These similarities in the lncRNA profiles of IDH1/2 wt LGGs and mesenchymal GBMs suggest that LGGs with wild-type IDH1/2 may progress to mesenchymal GBMs.

lncRNA Expression and Survival in LGG Patients

The main prognostic variable for patients with glial tumors is the mutational status of IDH1 or IDH2. In LGGs, recent work has shown that patients whose tumors also harbor 1p19q codeletions have a slightly better overall survival than patients with IDH1/2 mut tumors without 1p19q codeletions [4]. We decided to test whether the expression of lncRNAs can be used to separate patients into distinct survival groups, independent of IDH1/2 mutational status. To this end, we performed survival analysis utilizing a multivariate Cox proportional hazard model that included IDH1/2 mutational state, age, sex, tumor grade, and lncRNA expression as independent variables in the survival model. It is common to find extreme outliers in large RNA-seq datasets, which can negatively impact survival regression analysis. In order to correct for these outliers, the expression values for each lncRNA were inverse normal transformed, a procedure that increases the sensitivity and specificity of regression analysis using RNA-seq expression values [59]. To attempt to separate patients into distinct prognostic groups using lncRNAs, we randomly assigned 60% of LGG patients with complete clinical data (269 patients) to a test set, on which we performed Cox regression to identify lncRNAs associated with survival in this patient cohort. We then selected all lncRNAs that were significantly associated with survival and created a survival algorithm with variables for each lncRNA that were weighted based on each lncRNA’s contribution to overall survival (see Methods). This survival algorithm was then applied to the remaining 40% of patients (180 patients) who constituted the validation set (Figs 5A, S2 and S3).

Expression of a subset of lncRNAs is associated with survival in LGG patients.
Fig. 5. Expression of a subset of lncRNAs is associated with survival in LGG patients.
(A) Schematic of survival analysis: 60% of LGG patients were randomly selected to be the test set used to find survival-associated lncRNAs using Cox regression analysis. A summed Cox coefficient derived from 64 survival-associated lncRNAs (selected as in S3 Fig) was used to stratify patients in the test set into two survival groups. This same set of 64 lncRNAs was then used to derive the summed Cox coefficient in the validation set to separate them into two survival groups. (B) Survival plot shows the effective separation of patients from the test set into two distinct survival groups, good prognosis (GoodProg) and poor prognosis (BadProg), based on each patient’s summed Cox coefficient of the 64 lncRNAs (hazard ratio [HR] = 2.168, 95% CI = 1.765–2.807, p < 0.001). (C) The summed Cox coefficient for the same 64 lncRNAs is capable of separating patients from the validation set into two groups with very distinct survival probabilities (HR = 1.921, 95% CI = 1.333–2.767, p < 0.001). LGG, lower grade glioma; lncRNA, long noncoding RNA.

After performing Cox regression on our test set, we identified 64 lncRNAs that were consistently associated with survival. These lncRNAs were included in our survival algorithm, which was then applied to the test set. Each patient received a score, based on how many prognostic lncRNAs met our expression cutoff (see Methods), and patients were then divided into two groups, good prognosis and poor prognosis. Our algorithm separated the test set into groups of 85 and 184 patients with median survival times of 1,209 and 4,084 d, respectively (HR = 2.168, 95% CI = 1.765–2.807, p < 0.001) (Fig 5B; S6 and S8 Tables). We next applied this survival algorithm to the validation set and were successfully able to separate patients into distinct groups of 66 and 114 patients with median survival times of 2,235 and 4,412 d, respectively (HR = 1.921, 95% CI = 1.333–2.767, p < 0.001) (Fig 5C; S6 and S8 Tables).

Identifying lncRNAs Associated with Survival in GBMs

We next sought to identify all lncRNAs that were associated with overall survival in patients with GBMs. Using Cox regression we identified 584 lncRNAs that were associated with a poor prognosis and 282 lncRNAs that were associated with better survival outcomes (S9 Table). A subset of these lncRNAs were independently used to separate GBM patients based on lncRNA expression levels in the top third and bottom third of patients (55 patients), and Kaplan-Meier plots show that these groups were associated with prognosis with statistical significance (Fig 6A and 6B). Patients with high expression of RP11-334C17.6 had a median survival time of 485 d, while patients with low expression had a median survival time of 380 d (HR = 0.728, 95% CI = 0.6011–0.883, p = 0.00122) (Fig 6A). Patients with high and low expression of BTAT10 had median survival times of 335 and 485 d, respectively (HR = 1.298, 95% CI = 1.0881–1.548, p = 0.00374) (Fig 6B). However, unlike in the LGGs, we have not yet succeeded in combining the individually predictive lncRNAs into a survival algorithm that can predict prognosis in GBMs with statistical significance.

Survival-associated lncRNAs in GBMs and GBM subtypes.
Fig. 6. Survival-associated lncRNAs in GBMs and GBM subtypes.
(A and B) Representative survival plots of lncRNAs that predict survival in GBMs: RP11-334C17.6 (HR = 0.728, 95% CI = 0.6011–0.883, p = 0.00122) (A) and BTAT10 (HR = 1.298, 95% CI = 1.0881–1.548, p = 0.00374) (B). (C and D) lncRNAs associated with a poor prognosis (C) and good prognosis (D) in individual subtypes show minimal overlap between subtypes. GBM, glioblastoma multiforme; HR, hazard ratio; lncRNA, long noncoding RNA.

Unlike for proteins, ascertaining the function of a lncRNA based on sequence composition is extremely difficult. However, studies have shown that it is possible to infer what biological pathways a lncRNA might function in using guilt-by-association analysis, a technique that infers association of a lncRNA with a biological pathway based on the pathway enrichment of protein-coding genes whose expression is highly correlated with the lncRNA [45]. We used guilt-by-association analysis to determine what biological pathways are enriched in our positive and negative prognostic lncRNA groups. Interestingly, lncRNAs that are associated with a better prognosis in GBMs are more likely to be associated with signaling pathways, showing enrichment in protein kinase and phosphorylation pathways as well as signal transduction pathways. Conversely, lncRNAs that are associated with poor patient outcomes are highly associated with cell cycle pathways, immune response, and chromosome organization (Table 1).

Tab. 1. Enriched biological pathways associated with prognostic lncRNAs by guilt-by-association analysis.
Enriched biological pathways associated with prognostic lncRNAs by guilt-by-association analysis.
lncRNA, long noncoding RNA.

We next subdivided all of the GBM tumors into their respective subtypes and performed Cox regression with all lncRNAs in each subtype: 165, 128, 88, and 385 lncRNAs were associated with prognosis in classical, mesenchymal, neural, and proneural GBM subtypes, respectively. Given the transcriptional, clinical, and phenotypic differences between the subtypes, we then tested if there was any overlap in the identities of the positive and negative prognostic lncRNAs between subtypes. There was very little overlap noted between the prognostic lncRNAs in each subtype (Fig 6C and 6D), consistent with the hypothesis that the GBM subtypes arise from distinct mutational backgrounds and have very different biology.


Our analysis of RNA-seq data for grade II, III, and IV brain tumors and normal brain tissue has identified hundreds of dysregulated lncRNAs in glial tumors, many of which are associated with tumor subtype or mutational status. Using Cox regression, we identified a panel of 64 lncRNAs that are associated with survival in LGG patients. We also identified lncRNAs that are similarly associated with prognosis in each GBM subtype and found remarkably little overlap of prognostic lncRNAs between GBM subtypes.

The growing appreciation for the important roles that lncRNAs play in tumor development and progression necessitates having a means of prioritizing which lncRNAs should be studied in a given cancer type. Global analyses have been performed for tumor types other than GBMs and LGGs, such as squamous cell lung carcinomas and adenocarcinomas, as well as meta-analyses of all tumors within TCGA [46,57,60]. Although meta-analyses of lncRNAs are extremely important, they have not been especially informative for brain tumors for several reasons. First, due to the broad nature of the analyses, it is not possible to focus on the specific nuances of each tumor type (i.e., subtypes). Second, due to the limited number of normal brain samples in TCGA, GBMs and LGGs were not included in many of the meta-analyses, which relied on comparisons with a reference panel of normal tissues. Lastly, the effect of lncRNA expression on survival in individual tumor types was not a main focus of the studies [46,60]. This is important because, depending on the tumor context, a given lncRNA may act as a tumor suppressor or oncogene [61,62]. By focusing specifically on brain tumors and including over 70 normal brain tissue samples, our analysis provides unique insights into the roles of lncRNAs in aggressive brain cancers.

In addition to studying the roles of annotated lncRNAs, we identified 2,706 novel multi-exon lncRNAs that are present in either normal brain tissue or brain tumors, but are not annotated in the commonly used lncRNA databases (GENCODE and RefSeq). Many of these novel lncRNAs were differentially expressed in brain tumors and were associated with clinically relevant mutations. Although the exact mechanisms leading to altered lncRNA expression are not known, roughly 20% of differentially expressed lncRNAs were weakly correlated with chromosomal amplifications and deletions. We also identified several hundred lncRNAs that were associated with GBM and LGG subtypes. It is well known that IDH1/2 wt LGGs have clinical phenotypes and genomic alterations similar to those of primary GBMs [5]. Interestingly, the intersection of subtype-associated lncRNAs between GBMs and LGGs revealed transcriptional similarities between IDH1/2 wt LGGs and mesenchymal GBMs. Although our analysis suggests an evolutionary link between mesenchymal GBMs and IDH1/2 wt LGGs, it does not preclude other tumor evolutionary pathways leading to the formation of mesenchymal GBMs. Other groups have found evidence that suggests non-GCIMP (non-glioma-CpG island methylator phenotype) mesenchymal GBMs evolve from a proneural GBM precursor [63]. However, this evolutionary pathway does not explain the origin of all mesenchymal GBMs. Our analysis suggests that some mesenchymal GBMs might arise from undetected IDH1/2 wt LGGs, which at clinical presentation would appear to be mesenchymal GBMs.

There are several limitations to this analysis. One limitation is that, because RNA-seq data from TCGA were derived from bulk tumor specimens, we are unable to decipher whether the differences in expression that we have identified are a reflection of alterations in tumor cells’ transcriptional programs or a reflection of tumor heterogeneity and changes in the stromal composition of each individual tumor. Furthermore, although we validated the expression changes of some lncRNAs using independent patient-derived samples, more work is needed to confirm the expression differences we identified for lncRNAs between gliomas and normal brain, and among tumor subtypes and tumors of different mutational status. Another limitation is that the generation and testing of our survival algorithm were performed on the same dataset. Although the “validation” dataset was blinded in the algorithm generation, further validation of our survival analysis in a truly independent dataset is needed to determine the significance and robustness of the survival algorithm. Independent validation of lncRNAs that are associated with survival in GBMs and GBM subtypes is also needed.

As stated earlier, IDH1/2 mutational status is the primary prognostic indicator for glial tumors. Using multivariate survival analysis, we have shown that a panel of lncRNAs can be used to separate LGG patients into distinct prognostic groups. This group of lncRNAs could potentially be used to help identify at-risk patients who might require more intensive therapies, although further validation in an independent dataset is needed to fully test the utility of the survival algorithm. Furthermore, we have also found several hundred lncRNAs that are prognostic in GBMs as a whole, as well as in individual subtypes. In summary, we have performed the first global analysis, to our knowledge, of lncRNAs in LGGs and GBMs. Our analysis can serve as a valuable resource for those working in the field to prioritize which lncRNAs to study in brain cancers.

Supporting Information

Attachment 4

Attachment 5

Attachment 6

Attachment 7

Attachment 8

Attachment 9

Attachment 12

Attachment 13


1. Ostrom QT, Gittleman H, Liao P, Rouse C, Chen Y, Dowling J, et al. CBTRUS statistical report: primary brain and central nervous system tumors diagnosed in the United States in 2007–2011. Neuro Oncol. 2014 Oct;16(Suppl 4):iv1–63.

2. Phillips HS, Kharbanda S, Chen R, Forrest WF, Soriano RH, Wu TD, et al. Molecular subclasses of high-grade glioma predict prognosis, delineate a pattern of disease progression, and resemble stages in neurogenesis. Cancer Cell. 2006;9(3):157–73. doi: 10.1016/j.ccr.2006.02.019 16530701

3. Verhaak RGW, Hoadley KA, Purdom E, Wang V, Qi Y, Wilkerson MD, et al. Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1. Cancer Cell. 2010;17(1):98–110. doi: 10.1016/j.ccr.2009.12.020 20129251

4. Cancer Genome Atlas Research Network, Brat DJ, Verhaak RG, Aldape KD, Yung WK, Salama SR, et al. Comprehensive, integrative genomic analysis of diffuse lower-grade gliomas. N Engl J Med. 2015;372(26):2481–98. doi: 10.1056/NEJMoa1402121 26061751

5. Suzuki H, Aoki K, Chiba K, Sato Y, Shiozawa Y, Shiraishi Y, et al. Mutational landscape and clonal architecture in grade II and III gliomas. Nat Genet. 2015;47(5):458–68. doi: 10.1038/ng.3273 25848751

6. van den Bent MJ, Dubbink HJ, Marie Y, Brandes AA, Taphoorn MJB, Wesseling P, et al. IDH1 and IDH2 mutations are prognostic but not predictive for outcome in anaplastic oligodendroglial tumors: a report of the European Organization for Research and Treatment of Cancer Brain Tumor Group. Clin Cancer Res. 2010;16(5):1597–604. doi: 10.1158/1078-0432.CCR-09-2902 20160062

7. ENCODE Project Consortium, Birney E, Stamatoyannopoulos JA, Dutta A, Guigó R, Gingeras TR, et al. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature. 2007;447(7146):799–816. doi: 10.1038/nature05874 17571346

8. Carninci P, Kasukawa T, Katayama S, Gough J, Frith MC, Maeda N, et al. The transcriptional landscape of the mammalian genome. Science. 2005;309(5740):1559–63. doi: 10.1126/science.1112014 16141072

9. Katayama S, Tomaru Y, Kasukawa T, Waki K, Nakanishi M, Nakamura M, et al. Antisense transcription in the mammalian transcriptome. Science. 2005;309(5740):1564–6. doi: 10.1126/science.1112009 16141073

10. Gupta RA, Shah N, Wang KC, Kim J, Horlings HM, Wong DJ, et al. Long non-coding RNA HOTAIR reprograms chromatin state to promote cancer metastasis. Nature. 2010;464(7291):1071–6. doi: 10.1038/nature08975 20393566

11. Trimarchi T, Bilal E, Ntziachristos P, Fabbri G, Dalla-Favera R, Tsirigos A, et al. Genome-wide mapping and characterization of notch-regulated long noncoding RNAs in acute leukemia. Cell. 2014;158(3):593–606. doi: 10.1016/j.cell.2014.05.049 25083870

12. Hacisuleyman E, Goff LA, Trapnell C, Williams A, Henao-Mejia J, Sun L, et al. Topological organization of multichromosomal regions by the long intergenic noncoding RNA Firre. Nat Struct Mol Biol. 2014;21(2):198–206. doi: 10.1038/nsmb.2764 24463464

13. Carrieri C, Cimatti L, Biagioli M, Beugnet A, Zucchelli S, Fedele S, et al. Long non-coding antisense RNA controls Uchl1 translation through an embedded SINEB2 repeat. Nature. 2012;491(7424):454–7. doi: 10.1038/nature11508 23064229

14. Kretz M, Siprashvili Z, Chu C, Webster DE, Zehnder A, Qu K, et al. Control of somatic tissue differentiation by the long non-coding RNA TINCR. Nature. 2013;493(7431):231–5. doi: 10.1038/nature11661 23201690

15. Yoon J-H, Abdelmohsen K, Srikantan S, Yang X, Martindale JL, De S, et al. LincRNA-p21 suppresses target mRNA translation. Mol Cell. 2012;47(4):648–55. doi: 10.1016/j.molcel.2012.06.027 22841487

16. Gomez JA, Wapinski OL, Yang YW, Bureau J-F, Gopinath S, Monack DM, et al. The NeST long ncRNA controls microbial susceptibility and epigenetic activation of the interferon-γ locus. Cell. 2016;152(4):743–54.

17. Mousavi K, Zare H, Dell’Orso S, Grontved L, Gutierrez-Cruz G, Derfoul A, et al. ERNAs promote transcription by establishing chromatin accessibility at defined genomic loci. Mol Cell. 2013;51(5):606–17. doi: 10.1016/j.molcel.2013.07.022 23993744

18. Mueller AC, Cichewicz MA, Dey BK, Layer R, Reon BJ, Gagan JR, et al. MUNC, a long noncoding RNA that facilitates the function of MyoD in skeletal myogenesis. Mol Cell Biol. 2015;35(3):498–513. doi: 10.1128/MCB.01079-14 25403490

19. Ng SY, Johnson, Stanton LW. Human long non-coding RNAs promote pluripotency and neuronal differentiation by association with chromatin modifiers and transcription factors. EMBO J. 2012;31(3):522–33. doi: 10.1038/emboj.2011.459 22193719

20. Ramos AD, Andersen RE, Liu SJ, Nowakowski TJ, Hong SJ, Gertz CC, et al. The long noncoding RNA Pnky regulates neuronal differentiation of embryonic and postnatal neural stem cells. Cell Stem Cell. 2015;16(4):439–47. doi: 10.1016/j.stem.2015.02.007 25800779

21. Rapicavoli NA, Poth EM, Blackshaw S. The long noncoding RNA RNCR2 directs mouse retinal cell specification. BMC Dev Biol. 2010;10(1):1–10.

22. Aprea J, Prenninger S, Dori M, Ghosh T, Monasor LS, Wessendorf E, et al. Transcriptome sequencing during mouse brain development identifies long non‐coding RNAs functionally involved in neurogenic commitment. EMBO J. 2013;32(24):3145–60. doi: 10.1038/emboj.2013.245 24240175

23. Bernard D, Prasanth K V, Tripathi V, Colasse S, Nakamura T, Xuan Z, et al. A long nuclear-retained non-coding RNA regulates synaptogenesis by modulating gene expression. EMBO J. 2010;29(18):3082–93. doi: 10.1038/emboj.2010.199 20729808

24. Kerin T, Ramanathan A, Rivas K, Grepo N, Coetzee GA, Campbell DB. A noncoding RNA antisense to moesin at 5p14.1 in autism. Sci Transl Med. 2012;4(128):128ra40. doi: 10.1126/scitranslmed.3003479 22491950

25. Zhao X, Tang Z, Zhang H, Atianjoh FE, Zhao J-Y, Liang L, et al. A long noncoding RNA contributes to neuropathic pain by silencing Kcna2 in primary afferent neurons. Nat Neurosci. 2013;16(8):1024–31. doi: 10.1038/nn.3438 23792947

26. Yuan JH, Yang F, Wang F, Ma JZ, Guo YJ, Tao QF, et al. A long noncoding RNA activated by TGF-β promotes the invasion-metastasis cascade in hepatocellular carcinoma. Cancer Cell. 2014;25(5):666–81. doi: 10.1016/j.ccr.2014.03.010 24768205

27. Sakurai K, Reon BJ, Anaya J, Dutta A. The lncRNA DRAIC/PCAT29 locus constitutes a tumor-suppressive nexus. Mol Cancer Res. 2015;13(5):828–38. doi: 10.1158/1541-7786.MCR-15-0016-T 25700553

28. Malik R, Patel L, Prensner JR, Shi Y, Iyer M, Subramaniyan S, et al. The lncRNA PCAT29 inhibits oncogenic phenotypes in prostate cancer. Mol Cancer Res. 2014;12(8):1081–7. doi: 10.1158/1541-7786.MCR-14-0257 25030374

29. Ellis BC, Molloy PL, Graham LD. CRNDE: a long non-coding RNA involved in cancer, neurobiology and development. Front Genet. 2012;3:270. doi: 10.3389/fgene.2012.00270 23226159

30. Wang Y, Wang Y, Li J, Zhang Y, Yin H, Han B. CRNDE, a long-noncoding RNA, promotes glioma cell growth and invasion through mTOR signaling. Cancer Lett. 2015;367(2):122–8. doi: 10.1016/j.canlet.2015.03.027 25813405

31. Zhang K, Sun X, Zhou X, Han L, Chen L. Long non-coding RNA HOTAIR promotes glioblastoma cell cycle progression in an EZH2 dependent manner. Oncotarget. 2014;6(1):537–46.

32. Zhang X, Kiang KM, Zhang GP, Leung GK. long non-coding RNAs dysregulation and function in glioblastoma stem cells. Noncoding RNA. 2015;1(1):69–86.

33. Gill BJ, Pisapia DJ, Malone HR, Goldstein H, Lei L, Sonabend A, et al. MRI-localized biopsies reveal subtype-specific differences in molecular and cellular composition at the margins of glioblastoma. Proc Natl Acad Sci U S A. 2014;111(34):12550–5. doi: 10.1073/pnas.1405839111 25114226

34. Akula N, Barb J, Jiang X, Wendland JR, Choi KH, Sen SK, et al. RNA-sequencing of the brain transcriptome implicates dysregulation of neuroplasticity, circadian rhythms and GTPase binding in bipolar disorder. Mol Psychiatry. 2014;19(11):1179–85. doi: 10.1038/mp.2013.170 24393808

35. Jaffe AE, Shin J, Collado-Torres L, Leek JT, Tao R, Li C, et al. Developmental regulation of human cortex transcription and its clinical relevance at single base resolution. Nat Neurosci. 2015;18(1):154–61. doi: 10.1038/nn.3898 25501035

36. Li J, Shi M, Ma Z, Zhao S, Euskirchen G, Ziskin J, et al. Integrated systems analysis reveals a molecular network underlying autism spectrum disorders. Mol Syst Biol. 2014;10(12):774.

37. Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg S. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013;14(4):R36. doi: 10.1186/gb-2013-14-4-r36 23618408

38. Trapnell C, Hendrickson DG, Sauvageau M, Goff L, Rinn JL, Pachter L. Differential analysis of gene regulation at transcript resolution with RNA-seq. Nat Biotech. 2013;31(1):46–53.

39. Cabili MN, Trapnell C, Goff L, Koziol M, Tazon-Vega B, Regev A, et al. Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses. Genes Dev. 2011;25(18):1915–27. doi: 10.1101/gad.17446611 21890647

40. Wang L, Park HJ, Dasari S, Wang S, Kocher J-P, Li W. CPAT: Coding-Potential Assessment Tool using an alignment-free logistic regression model. Nucleic Acids Res. 2013;41(6):e74. doi: 10.1093/nar/gkt006 23335781

41. Wilhelm M, Schlegl J, Hahne H, Gholami AM, Lieberenz M, Savitski MM, et al. Mass-spectrometry-based draft of the human proteome. Nature. 2014;509(7502):582–7. doi: 10.1038/nature13319 24870543

42. Massey FJ Jr. The Kolmogorov-Smirnov test for goodness of fit. J Am Stat Assoc. 1951;46(253):68–78.

43. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B. 1995;57(1):289–300.

44. Cox DR. Regression models and life-tables. J R Stat Soc Ser B. 1972;34(2):187–220.

45. Guttman M, Amit I, Garber M, French C, Lin MF, Feldser D, et al. Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals. Nature. 2009;458(7235):223–7. doi: 10.1038/nature07672 19182780

46. Iyer MK, Niknafs YS, Malik R, Singhal U, Sahu A, Hosono Y, et al. The landscape of long noncoding RNAs in the human transcriptome. Nat Genet. 2015;47(3):199–208. doi: 10.1038/ng.3192 25599403

47. Sauvageau M, Goff LA, Lodato S, Bonev B, Groff AF, Gerhardinger C, et al. Multiple knockout mouse models reveal lincRNAs are required for life and brain development. Elife. 2013;2013(2):1–24.

48. Shi Y, Wang Y, Luan W, Wang P, Tao T, Zhang J, et al. Long non-coding RNA H19 promotes glioma cell invasion by deriving miR-675. PLoS ONE. 2014;9(1):e86295. doi: 10.1371/journal.pone.0086295 24466011

49. Wen PY, Kesari S. Malignant gliomas in adults. N Engl J Med. 2008;359(5):492–507. doi: 10.1056/NEJMra0708126 18669428

50. Lin N, Chang KY, Li Z, Gates K, Rana Z, Dang J, et al. An evolutionarily conserved long noncoding RNA TUNA controls pluripotency and neural lineage commitment. Mol Cell. 2014;53(6):1005–19. doi: 10.1016/j.molcel.2014.01.021 24530304

51. Ulitsky I, Shkumatava A, Jan CH, Sive H, Bartel DP. Conserved function of lincRNAs in vertebrate embryonic development despite rapid sequence evolution. Cell. 2011;147(7):1537–50. doi: 10.1016/j.cell.2011.11.055 22196729

52. Gerstung M, Pellagatti A, Malcovati L, Giagounidis A, Porta MG, Jädersten M, et al. Combining gene mutation with gene expression data improves outcome prediction in myelodysplastic syndromes. Nat Commun. 2015;6:5901. doi: 10.1038/ncomms6901 25574665

53. Stead LF, Berri S, Wood HM, Egan P, Conway C, Daly C, et al. The transcriptional consequences of somatic amplifications, deletions, and rearrangements in a human lung squamous cell carcinoma. Neoplasia. 2012;14(11):1075–86. 23226101

54. Bashashati A, Haffari G, Ding J, Ha G, Lui K, Rosner J, et al. DriverNet: uncovering the impact of somatic driver mutations on transcriptional networks in cancer. Genome Biol. 2012;13(12):R124. doi: 10.1186/gb-2012-13-12-r124 23383675

55. Garzon R, Volinia S, Papaioannou D, Nicolet D, Kohlschmidt J, Yan PS, et al. Expression and prognostic impact of lncRNAs in acute myeloid leukemia. Proc Natl Acad Sci U S A. 2014;111(52):18679–84. doi: 10.1073/pnas.1422050112 25512507

56. Flockhart RJ, Webster DE, Qu K, Mascarenhas N, Kovalski J, Kretz M, et al. BRAFV600E remodels the melanocyte transcriptome and induces BANCR to regulate melanoma cell migration. Genome Res. 2012;22(6):1006–14. doi: 10.1101/gr.140061.112 22581800

57. White N, Cabanski C, Silva-Fisher J, Dang H, Govindan R, Maher C. Transcriptome sequencing reveals altered long intergenic non-coding RNAs in lung cancer. Genome Biol. 2014;15(8):429. doi: 10.1186/s13059-014-0429-8 25116943

58. Brennan CW, Verhaak RGW, McKenna A, Campos B, Noushmehr H, Salama SR, et al. The somatic genomic landscape of glioblastoma. Cell. 2013;155(2):462–77. doi: 10.1016/j.cell.2013.09.034 24120142

59. Zwiener I, Frisch B, Binder H. Transforming RNA-Seq data to improve the performance of prognostic gene signatures. PLoS ONE. 2014;9(1):e85150. doi: 10.1371/journal.pone.0085150 24416353

60. Yan X, Hu Z, Feng Y, Hu X, Yuan J, Zhao SD, et al. Comprehensive genomic characterization of long non-coding RNAs across human cancers. Cancer Cell. 2015;28(4):529–40. doi: 10.1016/j.ccell.2015.09.006 26461095

61. Matouk IJ, DeGroot N, Mezan S, Ayesh S, Abu-lail R, Hochberg A, et al. The H19 non-coding RNA is essential for human tumor growth. PLoS ONE. 2007;2(9):e845. doi: 10.1371/journal.pone.0000845 17786216

62. Yoshimizu T, Miroglio A, Ripoche M-A, Gabory A, Vernucci M, Riccio A, et al. The H19 locus acts in vivo as a tumor suppressor. Proc Natl Acad Sci U S A. 2008;105(34):12417–22. doi: 10.1073/pnas.0801540105 18719115

63. Ozawa T, Riester M, Cheng Y-K, Huse JT, Squatrito M, Helmy K, et al. Most human non-GCIMP glioblastoma subtypes evolve from a common proneural-like precursor glioma. Cancer Cell. 2014;26(2):288–300. doi: 10.1016/j.ccr.2014.06.005 25117714

Interné lekárstvo

Článok vyšiel v časopise

PLOS Medicine

2016 Číslo 12
Najčítanejšie tento týždeň
Najčítanejšie v tomto čísle
Zabudnuté heslo

Zadajte e-mailovú adresu, s ktorou ste vytvárali účet. Budú Vám na ňu zasielané informácie k nastaveniu nového hesla.


Nemáte účet?  Registrujte sa