Question: Extract each annotation in 'info' column in snpEff output
1
gravatar for jan
3.6 years ago by
jan110
Malaysia
jan110 wrote:

Hi,

I have used snpEff to annotate my vcf file. I previously use Annovar to annotate my vcf file and the  fields are nicely separated in tabular forms, hence making it easy to extract information for further analysis.

snpEff annotation is written in 'info' column in one line separated by a pipe ' | '  , making it difficult to extract certain information . Is there a tool to separate each annotation into separate tab ? I tried to write a python script (with a very limited knowledge about programming ) but it gets messy .

sequencing output snpeff • 1.9k views
ADD COMMENTlink written 3.6 years ago by jan110
1

awk would probably be able to do it efficiently. Exactly which information do you want? Do you need the tab-delimited stuff at the beginning, or the semi-colon-delimited values, etc?

A simple way would be to just to specify your field separators at the start of an awk command, and split everything into tab-delimited columns.

ADD REPLYlink modified 3.6 years ago • written 3.6 years ago by Joseph Pearson430
2

I just found out about snpSift and the tool suits my work

ADD REPLYlink written 3.6 years ago by jan110
1

That looks like a useful tool, thank you for making me aware of snpSift!

ADD REPLYlink written 3.6 years ago by Joseph Pearson430
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 811 users visited in the last hour