Question: bam-readcount and paired end data
0
gravatar for chrys
2.4 years ago by
chrys40
Germany
chrys40 wrote:

EDIT1: Variant calling was done with two commonly used variant callers ( VarScan and Strelka ). The data is paired end RNA-Seq data. The presumptive variants are from exome sequencing.

Hi guys,

so I am trying to count reads over a presumptive SNP position with their respective nucleotides.
E.g the REF is A, the ALT is C and the position has 50 reads for A, and 23 reads for C.

I am currently using bam-readcount for this purpose but I am little worried that my results might be skewed.

I am note sure if ( because I did not find it in the documentation ) bam-readcount handles paired-end data.

For example if two overlapping mates cover the base, and they both show the ALT base, shouldn't they be technically only counted as one occurrence since the fragment has only been sequenced once but from two sites ?

What happens if one mate only does overlap the ALT position ? How is that counted ?

I assume that bam-readcount is aware of the pairs and counts overlapping pairs at a position as one while single read also counts as 1.

After doing some checking, I found that igv count always reports about twice as many reads at a given position as bam-readcount which leads me to believe that possibly, bam-readcount is actually doing what I assumed but I am not sure.

Maybe some of you have an idea if I am right or can suggest a different approach.

Or maybe I am entirely wrong and you guys can point me in the right direction.

Thanks!

snp rna-seq • 846 views
ADD COMMENTlink modified 2.4 years ago • written 2.4 years ago by chrys40
0
gravatar for toralmanvar
2.4 years ago by
toralmanvar900
toralmanvar900 wrote:

I think better approach would be to first call the SNP from BAM file using software like samtools or GATK. VCF file thus generated after calling SNP posses the information of number of reads representing your reference and alternate allele along with many more important information.

ADD COMMENTlink written 2.4 years ago by toralmanvar900

I added some information as an edit. Especially the fact, that the SNPs were discovered using exome data and variant callers. I am trying to check if these SNPs are actually expressed.

ADD REPLYlink written 2.4 years ago by chrys40
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1764 users visited in the last hour