Question: which annotation to use and is it advisable to use bam to fastq
gravatar for novicebioinforesearcher
2.4 years ago by

I need to reanalyze certain dataset(rna seq paired end stranded, 100bp) that has been already analyzed using a very old pipeline using mm9 reference (for all downstream analysis) and the files have been stored as bam(after alignment)

  1. While using alignment tools like star or hisat2, which annotation should one use the very latest one for eg mm10.p7 or mm9? what are the factors that may influence analysis downstream
  2. Since i do not have access to the original instrument data or the fastq files i would have to convert bam to fastq using bedtools, is this a best practice? would i loose any information when i re align the fastqs? what are the things i need to factor if I choose to do this? incase there is soft or hard clipping will this affect the conversion to fastq?
rna-seq alignment • 687 views
ADD COMMENTlink modified 2.4 years ago by WouterDeCoster42k • written 2.4 years ago by novicebioinforesearcher50
gravatar for dyollluap
2.4 years ago by
USA, California, Bay Area
dyollluap300 wrote:
  1. If you have the resources I'd suggest doing both mm9 and mm10. Future proofing and also back compatible using the same pipeline.

  2. You should find the relevant sequence run details, instrument data, etc., in the bam header @RG lines which can be used to deconvolute the bams to original fastq.

ADD COMMENTlink written 2.4 years ago by dyollluap300
gravatar for WouterDeCoster
2.4 years ago by
WouterDeCoster42k wrote:

You also have samtools fastq to convert bam to fastq. Softclipping will not affect the fastq, but hardclipping would.

ADD COMMENTlink written 2.4 years ago by WouterDeCoster42k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1817 users visited in the last hour