Realign Aligned.sortedByCoord.out.bam to Aligned.toTranscriptome.out.bam for RSEM input
0
3
Entering edit mode
5.4 years ago
mkvalluru ▴ 30

Hi,

I was using STAR-RSEM to quantify isoforms (primary input fastq files). Recently I have received Aligned.sortedByCoord.out.bam from the collaborators. However, I can't use these files as an input in RSEM. I have used samtools fastq to convert bam to fastq files (paired). I have used the preexisting data set for the preliminary analysis.

samtools: samtools fastq -1 Aligned.sortedByCoord.out.R1.fastq -2 Aligned.sortedByCoord.out.R2.fastq -O -t Aligned.sortedByCoord.out.bam

STAR alignment using fastq R1&R2 (samtools generated fastq files from Aligned.sortedByCoord.out.bam) generated higher unmapped reads %

        Number of input reads | 23575225
                  Average input read length |   186
               Uniquely mapped reads number |   5556353 
                    Uniquely mapped reads % |   23.57%
             % of reads unmapped: too short |   69.93%

STAR alignment using original fastq R1&R2 generated higher Uniquely mapped reads %

          Number of input reads |   23575225 
                  Average input read length |   186
               Uniquely mapped reads number |   20455386
                    Uniquely mapped reads % |   86.77%
            % of reads unmapped: too short |    7.45%

I do not know why there is a huge difference. I do not know if I am missing any steps?

-Manoj

RNA-Seq alignment assembly genome sequencing • 3.1k views
ADD COMMENT

Login before adding your answer.

Traffic: 1709 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6