running multiple RSeQC jobs
0
1
Entering edit mode
4.3 years ago
Apoorva ▴ 310

Hi all,

I want to check the integrity of my RNA-Seq samples. I usually use tin.py in the RSeQC package for this purpose. But its very slow. Is there an alternate tool or software I can use ? Or is there a parameter I can add to run it for multiple bam files simultaneously ?

Thanks

RNA-Seq RSeQC tin.py • 1.9k views
ADD COMMENT
1
Entering edit mode

Check GNU parallel, see the posts here, here and here

ADD REPLY
1
Entering edit mode

Is RSeQC running slowly because it is spending its time reading files or analyzing them? GNU Parallel won't get around I/O-bound work.

ADD REPLY
0
Entering edit mode

But the tin.py is quite slow in computing TIN as it processes transcripts sequentially. I have a large BAM file of ~35 GB. It took 18 hours to process that. Is there a way to speed it up using multithreading or multiprocessing? Should I split up my BAM file into smaller BAM files based on chromosomes and then perform tin.py qc?

ADD REPLY

Login before adding your answer.

Traffic: 1338 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6