Tajima'S D Test-Ngs Data
0
1
Entering edit mode
11.6 years ago
jackuser1979 ▴ 890

I have RNA seq illumina paired-end reads mapped to reference genome using bowtie and have bam file from samtools. I am interested in calculating Tajima's D test using DNAsp. As DNAsp software accepts FASTA file, I have converted bam file into fasta format.

samtools view filename.bam | \awk '{OFS="\t"; print ">"$1"\n"$10}' - > filename.fas

I have Imported this converted fasta file into DNAsp for Neutrality-based statistical test. Does this conversion and doing calculation is correct way of doing or I should do multiple alignment using clustal to get fasta file?

biology genetics • 5.0k views
ADD COMMENT
1
Entering edit mode

I don't think you can compute a TD with just the reads, you need to generate a modified sequence with your variants and then compute the TD comparing with your original reference.

ADD REPLY
0
Entering edit mode

Why not? If the reads are aligned to the reference using bowtie & samtools, we get aligned file in bam format just like clustal alignment.

ADD REPLY
0
Entering edit mode

Yes, it seems to me that DNAsp needs aligned complete sequences to perform the TD test... check the manual. BTW, the problem will be the same when using tools like PAML...

ADD REPLY
0
Entering edit mode

I'm just wondering how to generate the modified sequence with the variant from NGS dat, I mean VCF/AM/reference mapping file. Any idea ?

ADD REPLY

Login before adding your answer.

Traffic: 2255 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6