Entering edit mode
2.0 years ago
Sarah
▴
10
Hi everyone, I'm processing BAM files using htseq-count and it takes very long time to produce the counts for each file. It is about pair-end reads (around 50 million sequence each). It takes 75 minutes to count this pair; is that normal? Thanks.
htseq-count --max-reads-in-buffer=24000000000 -s no -r pos -t exon -i gene_id -f bam Sample1_Aligned.sortedByCoord.out.bam Homo_sapiens.GRCh38.106.gtf > Sample1-output_basename.counts
Thank you. I used
featureCounts
as you suggested. However, the outputcount.txt
is a text file. The question now is how can I read it in R? Do you have any suggestion?featureCounts is so much faster! Thank you, ATPoint!