Remove accession numbers form nexus tree files
2
0
Entering edit mode
7.5 years ago
AB • 0

Hi,

I'm relatively new to bioinformatics and am trying to work out how to remove the accession numbers from a Nexus formatted tree.

e.g.

#NEXUS
BEGIN TREES;
    TREE 1 = (CY116033b|A/swine/England/195852/1992,(((CY038004b|A/swine/En$
END;

I want to remove every accession number between the ( and | as this is causing issues with downstream programs.

I've tried sed (commands such as sed -e 's/.*\([A-Z][A-Z][0-9][0-9][0-9][0-9][0-9][0-9]\).*/\1/g' but can't seem to get it to work.

Does anyone know how to do this?

Any help would be much appreciated!

AB

Accession numbers Nexus • 1.3k views
ADD COMMENT
0
Entering edit mode
7.3 years ago
 sed -r  's/\([A-Za-z0-9]+\|/(/g'

?

ADD COMMENT
0
Entering edit mode
7.3 years ago
$ echo "(CY116033b|A/swine/England/195852/1992,(((CY038004b|A/swine/En" | sed 's/(\w\+|//g' 
A/swine/England/195852/1992,((A/swine/En
ADD COMMENT

Login before adding your answer.

Traffic: 3998 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6