How to calculate TIN score of the GSE dataset to perform some post quality checks on RNASeq dataset
1
0
Entering edit mode
4.5 years ago

I am working on the GSE 102741 dataset, I have both Raw Gene Count data and log2RPKM dataset , I want to assess the quality of the dataset using PCA analysis by following Paper, How can I calculate the TIN score ? Are there some online tools? can somebody guide me how can we assess the quality of the dataset or give some better suggestions or guidelines as to how to perform the quality analysis on the dataset about Raw Gene Count or log2RPKM counts?

RNA-Seq Quality • 1.9k views
ADD COMMENT
3
Entering edit mode
4.5 years ago

The material and methods of paper you link says:

The quality of the RNA-seq data was measured using the transcript integrity number (TIN) score calculated by RSeQC (version 2.6.4; tin.py) (http://rseqc.sourceforge.net/#tin-py)

I would start there.

ADD COMMENT
0
Entering edit mode

But the tin.py is quite slow in computing TIN as it processes transcripts sequentially. I have a large BAM file of ~35 GB. It took 18 hours to process that. Is there a way to speed it up using multithreading or multiprocessing?

ADD REPLY

Login before adding your answer.

Traffic: 1876 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6