Question: How to fetch sequences from Proteinortho5 output containing all test species and no duplication in each genome
0
gravatar for saadleeshehreen
7 months ago by
saadleeshehreen60 wrote:

Hi, I want to construct a phylogenic tree on 100 Pseudomonas aeruginosa genomes. Before constructing the tree, I want to first cluster those genomes on the basis of homology and for this purpose, I am using ProteinOrtho5 software. After running the software with synteny option I want to extract protein sequences from the output those only containing all test species and no duplication in each genome. I understand I need to run grab_protein.pl on myproject.poff to do this but how can customize/filter the output before running grab_protein.pl? I followed as following proteinOrtho out put help required But grep '^4\t4' output.proteinortho didn't print anything for me. As I tested the software on 3 species, I modified it from 3 to 4. Anyone help about the filtering? Cheers

proteinortho5 • 260 views
ADD COMMENTlink modified 25 days ago by AlishaQ0 • written 7 months ago by saadleeshehreen60
0
gravatar for h.mon
7 months ago by
h.mon26k
Brazil
h.mon26k wrote:

If you run on three species, your command should be:

grep '^3\t3' output.proteinortho
ADD COMMENTlink written 7 months ago by h.mon26k
0
gravatar for AlishaQ
25 days ago by
AlishaQ0
AlishaQ0 wrote:

grep $'^3\t3' output.proteinortho

ADD COMMENTlink written 25 days ago by AlishaQ0

My flavor of Linux required the $ character in front - yours might too.

ADD REPLYlink written 10 days ago by AlishaQ0
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 915 users visited in the last hour