A bioinformatics approach to identify novel long, non-coding RNAs in breast cancer cell lines from an existing RNA-sequencing dataset

dc.contributor.authorZaheed, Oza
dc.contributor.authorSamson, Julia
dc.contributor.authorDean, Kellie
dc.contributor.funderUniversity College Corken
dc.date.accessioned2022-11-04T12:35:37Z
dc.date.available2022-11-04T12:35:37Z
dc.date.issued2020-03-17
dc.date.updated2022-10-25T17:39:35Z
dc.description.abstractBreast cancer research has traditionally centred on genomic alterations, hormone receptor status and changes in cancer-related proteins to provide new avenues for targeted therapies. Due to advances in next generation sequencing technologies, there has been the emergence of long, non-coding RNAs (lncRNAs) as regulators of normal cellular events, with links to various disease states, including breast cancer. Here we describe our bioinformatic analyses of a previously published RNA sequencing (RNA-seq) dataset to identify lncRNAs with altered expression levels in a subset of breast cancer cell lines. Using a previously published RNA-seq dataset of 675 cancer cell lines, a subset of 18 cell lines was selected for our analyses that included 16 breast cancer lines, one ductal carcinoma in situ line and one normal-like breast epithelial cell line. Principal component analysis demonstrated correlation with well-established categorisation methods of breast cancer (i.e. luminal A/B, HER2 enriched and basal-like A/B). Through detailed comparison of differentially expressed lncRNAs in each breast cancer sub-type with normal-like breast epithelial cells, we identified 15 lncRNAs with consistently altered expression, including three uncharacterised lncRNAs. Utilising data from The Cancer Genome Atlas (TCGA) and The Genotype Tissue Expression (GETx) project via Gene Expression Profiling Interactive Analysis (GEPIA2), we assessed clinical relevance of several identified lncRNAs with invasive breast cancer. Lastly, we determined the relative expression level of six lncRNAs across a spectrum of breast cancer cell lines to experimentally confirm the findings of our bioinformatic analyses. Overall, we show that the use of existing RNA-seq datasets, if re-analysed with modern bioinformatic tools, can provide a valuable resource to identify lncRNAs that could have important biological roles in oncogenesis and tumour progression.en
dc.description.sponsorshipUniversity College Cork (Translational Research Access Programme, School of Medicine, UCC)en
dc.description.statusPeer revieweden
dc.description.versionPublished Versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.citationZaheed, O., Samson, J. and Dean, K. (2020) ‘A bioinformatics approach to identify novel long, non-coding RNAs in breast cancer cell lines from an existing RNA-sequencing dataset’, Non-coding RNA Research, 5(2), pp. 48–59. https://doi.org/10.1016/j.ncrna.2020.02.004.en
dc.identifier.doi10.1016/j.ncrna.2020.02.004en
dc.identifier.endpage59en
dc.identifier.issn2468-0540
dc.identifier.issued2en
dc.identifier.journaltitleNon-Coding Rna Researchen
dc.identifier.startpage48en
dc.identifier.urihttps://hdl.handle.net/10468/13815
dc.identifier.volume5en
dc.language.isoenen
dc.publisherElsevieren
dc.relation.urihttps://doi.org/10.1016/j.ncrna.2020.02.004
dc.rights© 2020 Production and hosting by Elsevier B.V. on behalf of KeAi Communications Co., Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/BY-NC-ND/4.0/).en
dc.rights.urihttp://creativecommons.org/licenses/BY-NC-ND/4.0/en
dc.subjectBioinformaticsen
dc.subjectBreast canceren
dc.subjectDuctal carcinoma in situen
dc.subjectLong non-coding RNAs (lncRNAs) RNA sequencing (RNA-seq)en
dc.subjectQuantitative reverse transcriptase polymerase chain reaction (qRT-PCR)en
dc.titleA bioinformatics approach to identify novel long, non-coding RNAs in breast cancer cell lines from an existing RNA-sequencing dataseten
dc.typeArticle (peer-reviewed)en
Files
Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
3-Zaheed_NonCodingRNA2020.pdf
Size:
2.42 MB
Format:
Adobe Portable Document Format
Description:
Published version
Loading...
Thumbnail Image
Name:
1-s2.0-S2468054020300706-main.pdf
Size:
245.59 KB
Format:
Adobe Portable Document Format
Description:
Erratum
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.71 KB
Format:
Item-specific license agreed upon to submission
Description: