Errors with RSEM/bowtie2
0
0
Entering edit mode
1 day ago
Olivia • 0

Hi,

I keep getting an error message after running the align_estimate_abundance step for RNA-seq, specifically using RSEM and bowtie2. I have previously run Trinity with the Trimmomatic step as well as CD-HIT-EST.

The error message is this:

CMD: set -o pipefail && bowtie2 --no-mixed --no-discordant --gbar 1000 --end-to-end -k 200  -q -X 800 -x /fs/ess/PAS1182/Olivia/NYCHA/cd_hit_est/CD_HIT_EST_NYCHA_1.fasta.bowtie2 -1 /tmp/slurmtmp.37615063/trinity_output37615063/unfixrm_1048_bed_260_317_S183_L006_R1_001.cor.fq.P.qtrim.gz -2 /tmp/slurmtmp.37615063/trinity_output37615063/unfixrm_1048_bed_260_317_S183_L006_R2_001.cor.fq.P.qtrim.gz -p 80 | samtools view -@ 80 -F 4 -S -b | samtools sort -@ 80 -n -o bowtie2.bam 
Error, fewer reads in file specified with -2 than in file specified with -1
(ERR): bowtie2-align died with signal 6 (ABRT) (core dumped)
Error, cmd: set -o pipefail && bowtie2 --no-mixed --no-discordant --gbar 1000 --end-to-end -k 200  -q -X 800 -x /fs/ess/PAS1182/Olivia/NYCHA/cd_hit_est/CD_HIT_EST_NYCHA_1.fasta.bowtie2 -1 /tmp/slurmtmp.37615063/trinity_output37615063/unfixrm_1048_bed_260_317_S183_L006_R1_001.cor.fq.P.qtrim.gz -2 /tmp/slurmtmp.37615063/trinity_output37615063/unfixrm_1048_bed_260_317_S183_L006_R2_001.cor.fq.P.qtrim.gz -p 80 | samtools view -@ 80 -F 4 -S -b | samtools sort -@ 80 -n -o bowtie2.bam  died with ret: 256 at /users/PAS1182/ofarinas621/local/src/trinityrnaseq-v2.15.2/util/align_and_estimate_abundance.pl line 729.

Some other posts have said that it could be mismatched reads, but I don't know how to fix this, given that I already did the trimming steps.

Any help would be greatly appreciated.

Thanks,

Olivia

RNA-seq bowtie2 RSEM Trinity • 274 views
ADD COMMENT
0
Entering edit mode

I have previously run Trinity with the Trimmomatic step as well as CD-HIT-EST.

I assume you did the above to generate the transcriptome, which are then using with bowtie2 after creating an index from it?

ADD REPLY
0
Entering edit mode

Yes, I already made the transcriptome with these tools, and the CD-HIT-EST fasta file is the transcript file that I am using for RSEM. Then, I used samtools and bowtie2 to make the index from the CD-HIT-EST fasta file through this step:

module load bowtie2/2.5.1

bowtie2-build CD-HIT-EST.fasta CD-HIT-EST_INDEX.fasta 

But, from my understanding, I am supposed to use the original CD-HIT-EST.fasta file (as shown above) as in the input for RSEM and bowtie2, not the index?

ADD REPLY
0
Entering edit mode

That is good confirmation. Bacl to the error message.

I don't know how to fix this, given that I already did the trimming steps.

DId you scan/trim your fastq data files independently? If so, they could be out of sync. You need to process paired-end data files together when scanning/trimming so that any read that is eliminated from one file needs to have its mate removed from the other file to keep data in sync. You can use repair.sh from BBMap tools to bring your data files back in sync or scan/trim them as a pair to keep data in sync.

ADD REPLY
0
Entering edit mode

I am not sure what you mean by this, but I believe that is what I did. With Trinity, I matched all the paired reads together, so they should have been done concurrently.

I just ran these commands below to see if the reads match, which so far, for three pairs of samples, they do not. But what is interesting is that the reverse read of one matches the forward read of different sample. Could this be an issue caused by the sequencing facility or just that they are in fact out of sync?

zcat unfixrm_1048_bed_260_317_S183_L006_R1_001.cor.fq.P.qtrim.gz | wc -l
zcat unfixrm_1048_bed_260_317_S183_L006_R2_001.cor.fq.P.qtrim.gz | wc -l
50368840
49042836

zcat unfixrm_1048_bed_260_317_S35_L007_R1_001.cor.fq.P.qtrim.gz | wc -l
zcat unfixrm_1048_bed_260_317_S35_L007_R2_001.cor.fq.P.qtrim.gz | wc -l
49042836
42063144

zcat unfixrm_1048_hall_252_325_S166_L006_R1_001.cor.fq.P.qtrim.gz | wc -l
zcat unfixrm_1048_hall_252_325_S166_L006_R2_001.cor.fq.P.qtrim.gz | wc -l
42063144
42065588
ADD REPLY

Login before adding your answer.

Traffic: 4568 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6