Cancer proteogenomics – identifying coding, long non-coding RNAs relevant in cancer

dc.check.chapterOfThesisNAen
dc.check.date2028-05-31
dc.contributor.advisorDean, Kellie
dc.contributor.advisorBaranov, Pavel V.
dc.contributor.advisorexternalMészáros, Bálint
dc.contributor.authorZaheed, Oza
dc.contributor.funderScience Foundation Ireland
dc.date.accessioned2025-02-06T12:01:03Z
dc.date.available2025-02-06T12:01:03Z
dc.date.issued2024en
dc.date.submitted2024
dc.description.abstractLong non-coding RNAs (lncRNAs) are RNA molecules exceeding 500 nucleotides that were traditionally considered non-functional and not translated (Mattick et al., 2023). However, recent advancements in next-generation sequencing have unveiled pivotal roles for lncRNAs in cellular processes, especially in disease contexts such as cancer. My work proposes that the non-coding transcriptome significantly impacts cancer cell biology, influencing aggressiveness and offering potential applications in patient diagnosis, disease stratification, and treatment. Cancer cells, characterised by altered metabolism and adaptations to cellular stress, may tip the balance between non-coding and coding functions. This shift could potentially result in the production of peptides from RNAs previously annotated as non-coding. Emerging evidence suggests that some of these 'non-coding' RNAs particularly lncRNAs may, in fact, encode proteins, often originating from small open reading frames (sORFs). Examples of these peptides generated from sORFs within non-coding transcripts span across species, from plants to humans, and many exhibit critical biological functions. My research focuses on identifying bifunctional RNAs that exhibit dual non-coding and coding functions in cancer. Leveraging RNA sequencing (RNA-seq) and ribosome profiling (Ribo-seq) datasets, I aim to challenge the conventional classification of 'noncoding' RNAs. My hypothesis posits that a single RNA molecule may serve both noncoding and coding roles, influenced by contextual cues and cellular conditions. To explore this hypothesis, I propose a multi-omic approach, aiming to identify potential peptides with pivotal roles in oncogenesis and cancer progression. This approach uncovered a total of 400 lncRNAs that were differentially expressed in malignant breast cancer cell lines. From this list of differentially expressed long non-coding RNAs, 64 candidate translated open reading frames were identified. By employing sequence analysis in combination with proteomic evidence and translated open reading frame curation, a scoring system was created to rank these candidates on the likelihood of producing a detectable microprotein. A total of nine parameters were assessed and fit into evidencebased categories including, translation, sequence information, and proteomics. This scoring system identified ten candidates most likely to produce a stable detectable microprotein with the three best scoring candidates arising from the lncRNAs ENSG00000253477, LINC02163, and MRPS30-DT. This integrated methodology holds promise for uncovering hidden coding potential within lncRNAs, offering novel insights into therapeutic targets and biomarkers. By examining the interplay between non-coding and coding functions in cancer, this research seeks to reshape our understanding of lncRNAs and their applications in cancer diagnosis and treatment.en
dc.description.statusNot peer revieweden
dc.description.versionAccepted Versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.citationZaheed Maheswaran, O. B. 2024. Cancer proteogenomics – identifying coding, long non-coding RNAs relevant in cancer. PhD Thesis, University College Cork.
dc.identifier.endpage207
dc.identifier.urihttps://hdl.handle.net/10468/16986
dc.language.isoenen
dc.publisherUniversity College Corken
dc.relation.projectinfo:eu-repo/grantAgreement/SFI/SFI-NSF Student Mobility Programme/18/CRT/6214 (S1)/IE/18/CRT/6214 Supplement/
dc.rights© 2024, Oza Zaheed.
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectMicroproteins
dc.subjectSmall open reading frames
dc.subjectTranslation
dc.subjectlncRNA
dc.subjectBreast cancer
dc.subjectRibosome profiling
dc.subjectProtein structure prediction
dc.titleCancer proteogenomics – identifying coding, long non-coding RNAs relevant in cancer
dc.typeDoctoral thesisen
dc.type.qualificationlevelDoctoralen
dc.type.qualificationnamePhD - Doctor of Philosophyen
Files
Original bundle
Now showing 1 - 4 of 4
Loading...
Thumbnail Image
Name:
ZaheedO_PhD2024.docx
Size:
19.66 MB
Format:
Microsoft Word XML
Description:
Full Text E-thesis (Word)
Loading...
Thumbnail Image
Name:
ZaheedO_PhD2024.pdf
Size:
6.32 MB
Format:
Adobe Portable Document Format
Description:
Full Text E-thesis (edits included)
Loading...
Thumbnail Image
Name:
ZaheedO_PhD2024_Submission for examination form.pdf
Size:
1.75 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
ZaheedO_PhD2024.pdf
Size:
6.31 MB
Format:
Adobe Portable Document Format
Description:
Full Text E-thesis
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
5.2 KB
Format:
Item-specific license agreed upon to submission
Description: