How To Transform The Colorspace Data Obtained From Sra Into A Format Suitable For The Tophat Rna-Seq Pipeline?
1
0
Entering edit mode
9.5 years ago
dfernan ▴ 710

Hi,

I need some help trying to know how to analyze RNA-Seq data from ABI 5500 solid sequencing data.

I need to develop the following pipeline:

2. Use SRA toolkit to transform the data into color fasta, qualities or some file format that tophat can run - here I have spent most of the time without luck! Any ideas on which command to use from the SRA toolkit to get the data in a good format for TopHat?
3. How to properly run Tophat with the previous data?

for 3, once I have a good command for the SRA part 2 I plan to run the following command for tophat, does it look good?

module load bio/bowtie-0.12.7
tophat2 --color --quals --library-type fr-secondstrand -G ucsc_mm9.gtf -o <pathtohere>/bowtie-indexes/mm9/mm9c <pathtohere>/fq/SRR534610.csfasta <pathtohere>/rnaseq/fq/SRR534610.qual


Thanks!

tophat sra geo bowtie rna-seq • 5.5k views
1
Entering edit mode

So you got several errors - don't you think that posting these might help people diagnose your problem? Are you sure your genome indexes are from a colorspace build?

0
Entering edit mode

@daniel thanks, yes I did get that error as well, then tried the color indexed still get problems. Posting on all the trials and errors I think would make this Q more like a debugging question. However, I am looking for experience and suggestions - mainly on how to use SRAtoolkit to transform SRA data to cfasta and quals format, not the alternative python stuff I tried out of ignorance regarding the SRA toolkit step, step 2 in the analysis.

1
Entering edit mode

still not clear what you are asking, what do your csfasta and quality files contain? If these are the colorspace data then there is nothing else that the SRA tools can do for you and involving them only confounds the question. If your question is how to run tophat on colorspace reads then make sure to read the manual, especially where it says that you must make sure to invoke bowtie1 in the pipeline, other issues may also apply.

1
Entering edit mode

@irtsvan Thanks! Ok I tried clarifying the question a bit more without specifying what I tried before, I think now should be better...

0
Entering edit mode

@Istvan Albert basically I have SRA files, not fastac, qual files, and I tried looking into the SRA documentation but it does not look very detailed... not clear to me which ones of the SRA tools I should use for appropiate transform of the data to bowtie/tophat suite.

2
Entering edit mode
9.5 years ago
Dan ▴ 520