Question: Sub-sampling RNA-seq data
gravatar for mohammedtoufiq91
9 days ago by
mohammedtoufiq910 wrote:


I would like to know if sub-sampling of a particular unaligned fastq file and aligned sam/bam file produce similar outputs? Is it better to sub-sample the fastq or aligned sam file? Which sub-sampling would be better?

Thank you, Toufiq

downsampling rna-seq sam bam fastq • 100 views
ADD COMMENTlink written 9 days ago by mohammedtoufiq910

Not part of your question, but if you are interested in the expression the easiest would probably be to sample the counts file. I think all approaches are more or less the same, but you did not tell us anything about the aim of the analysis.

ADD REPLYlink written 9 days ago by WouterDeCoster36k

Thank you. We have generated a set of RNA-seq samples from blood tissue (non globin depleted). These are human paired-end samples with read length of 150bp. After the alignment against hg19 genome, the alignment range is between 84-91% for different samples. Quantification process provided a rough estimate of read counts corresponding to the globin genes (HBA1, HBA2, HBB) between 30 - 62% which would later filtered before the normalization process. After this differential expression and fold change comparison between three subjects would be performed. We are currently doing technical validation to check if the RNA-seq analysis and results would be comparable or different with 100M or 50M reads depth.

ADD REPLYlink written 9 days ago by mohammedtoufiq910

I think this could be useful:

ADD REPLYlink written 9 days ago by WouterDeCoster36k

Thank you. This was helpful.

ADD REPLYlink written 8 days ago by mohammedtoufiq910
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1790 users visited in the last hour