Put taxonomy code abreviattion in biom table
0
0
Entering edit mode
8.6 years ago

Dear all, I want to help, if possible

How do I change the taxonomy using cut (shell), awk and seed:

my actual taxonomy, has no abbreviation code on the begin, I would like to put the code (d: domain, p: phylum, etc) in each taxonomy level,

For example

I have: Eukaryota;__Opisthokonta;__Metazoa;__Arthropoda;__Hexapoda;__Insecta

I need to look like this: d__ Eukaryota;__Opisthokonta;p__Arthropoda;__Hexapoda;c__Insecta,

I think how to connecting awk and sed pipe, but do not know how to pass the values of the variable awk to sed;

The green gene database already inserted such abbreviations, but I'm using the database silva does not contain this. its difficult the visualization of plots results. Any suggestion or help?

Thanks!

silva-database qiime awk • 1.8k views
ADD COMMENT
1
Entering edit mode

You'll end up needing to create tables of domains, phyla, etc. for this to work. That ends up making things complicated enough that you're better off using python or perl, since awk/sed start getting pretty unwieldy.

ADD REPLY

Login before adding your answer.

Traffic: 2507 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6