Counting number of mutations per read compared to reference

0

Entering edit mode

2.5 years ago

x_ma_x • 0

Apologies if question is basic, but I'm very new to NGS analysis!

I have some Miseq DNA sequencing data (in .fastq format) which I can align to sam/bam. What I would like to get from this data is how many mutations (if any) there are for every read, and ideally also the type of mutations present in every read. In other words, some quantitative measure of sequence similarity. What's the easiest way of getting such data? The objective is to assess random mutagenesis efficiency.

As an extra, I'd like to get same data but after translation of my sequence into protein (to detect nonsense/silent mutations etc.

mutation bam ngs • 600 views

ADD COMMENT • link updated 2.5 years ago by GenoMax 141k • written 2.5 years ago by x_ma_x • 0

0

Entering edit mode

@Pierre has a tool that does this: http://lindenb.github.io/jvarkit/Sam2Tsv.html

You will need to annotate the VCF/SNP's independently to get the information about effects of those SNP.

ADD REPLY • link 2.5 years ago by GenoMax 141k

Login before adding your answer.