which annotation to use and is it advisable to use bam to fastq
2
0
Entering edit mode
6.8 years ago

I need to reanalyze certain dataset(rna seq paired end stranded, 100bp) that has been already analyzed using a very old pipeline using mm9 reference (for all downstream analysis) and the files have been stored as bam(after alignment)

  1. While using alignment tools like star or hisat2, which annotation should one use the very latest one for eg mm10.p7 or mm9? what are the factors that may influence analysis downstream
  2. Since i do not have access to the original instrument data or the fastq files i would have to convert bam to fastq using bedtools, is this a best practice? would i loose any information when i re align the fastqs? what are the things i need to factor if I choose to do this? incase there is soft or hard clipping will this affect the conversion to fastq?
RNA-Seq alignment • 1.5k views
ADD COMMENT
2
Entering edit mode
6.8 years ago
dyollluap ▴ 310
  1. If you have the resources I'd suggest doing both mm9 and mm10. Future proofing and also back compatible using the same pipeline.

  2. You should find the relevant sequence run details, instrument data, etc., in the bam header @RG lines which can be used to deconvolute the bams to original fastq.

ADD COMMENT
2
Entering edit mode
6.8 years ago

You also have samtools fastq to convert bam to fastq. Softclipping will not affect the fastq, but hardclipping would.

ADD COMMENT

Login before adding your answer.

Traffic: 1830 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6