Where to get raw rna sequencing files for a large database of cancer tissues/cell lines?
1
0
Entering edit mode
2.0 years ago

Hello,

I'd like some general advice on where to start a project. I am interested in the expression of a particular type satellite DNA in disease states. Normally, this satellite dna is heterochromatin and silenced. But it does seem to be abnormally expressed as a lncRNA in various disease states, notably some cancers.

My goal is to see if this satellite lncRNA is expressed in certain types of cancer.

This is not an actual RNA that's included in any reference transcriptome, It's also a simple pentamer repeat. So I would need to access the raw reads from sequencing experiments and align to some ideal repeat.

I see there are many massive databases that could be useful here. Has anyone needed to access/process raw reads from these databases? I'm wondering what will be the best bang for my buck (time spent...).

Any help, experience, intuition would be very valuable!

transcriptome cancer RNA transcriptomics • 529 views
ADD COMMENT
0
Entering edit mode
2.0 years ago
Vitor1 ▴ 120

I don't know if i get this correctly, but for getting the raw reads there is plenty of options.

There is the SRA (https://www.ncbi.nlm.nih.gov/sra), ENA (https://www.ebi.ac.uk/ena/browser/), EGA (https://ega-archive.org/), dbGAP (https://www.ncbi.nlm.nih.gov/gap/), these are the most used, along with TCGA and contain all kind of data, RNA-SEQ, WES, etc. And there are many other options available (https://en.wikipedia.org/wiki/List_of_databases_for_oncogenomic_research, though I didn't read the wiki page).

There is also a great post on how to access these files (How to get FASTQ reads from the Short Read Archive (SRA))

ADD COMMENT

Login before adding your answer.

Traffic: 1858 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6