I need to reanalyze certain dataset(rna seq paired end stranded, 100bp) that has been already analyzed using a very old pipeline using mm9 reference (for all downstream analysis) and the files have been stored as bam(after alignment)
- While using alignment tools like star or hisat2, which annotation should one use the very latest one for eg mm10.p7 or mm9? what are the factors that may influence analysis downstream
- Since i do not have access to the original instrument data or the fastq files i would have to convert bam to fastq using bedtools, is this a best practice? would i loose any information when i re align the fastqs? what are the things i need to factor if I choose to do this? incase there is soft or hard clipping will this affect the conversion to fastq?