Question: which annotation to use and is it advisable to use bam to fastq
0
gravatar for novicebioinforesearcher
22 months ago by

I need to reanalyze certain dataset(rna seq paired end stranded, 100bp) that has been already analyzed using a very old pipeline using mm9 reference (for all downstream analysis) and the files have been stored as bam(after alignment)

  1. While using alignment tools like star or hisat2, which annotation should one use the very latest one for eg mm10.p7 or mm9? what are the factors that may influence analysis downstream
  2. Since i do not have access to the original instrument data or the fastq files i would have to convert bam to fastq using bedtools, is this a best practice? would i loose any information when i re align the fastqs? what are the things i need to factor if I choose to do this? incase there is soft or hard clipping will this affect the conversion to fastq?
rna-seq alignment • 593 views
ADD COMMENTlink modified 22 months ago by WouterDeCoster39k • written 22 months ago by novicebioinforesearcher50
2
gravatar for dyollluap
22 months ago by
dyollluap300
USA, California, Bay Area
dyollluap300 wrote:
  1. If you have the resources I'd suggest doing both mm9 and mm10. Future proofing and also back compatible using the same pipeline.

  2. You should find the relevant sequence run details, instrument data, etc., in the bam header @RG lines which can be used to deconvolute the bams to original fastq.

ADD COMMENTlink written 22 months ago by dyollluap300
2
gravatar for WouterDeCoster
22 months ago by
Belgium
WouterDeCoster39k wrote:

You also have samtools fastq to convert bam to fastq. Softclipping will not affect the fastq, but hardclipping would.

ADD COMMENTlink written 22 months ago by WouterDeCoster39k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1028 users visited in the last hour