How to use awk to look for the lowest e-value field?

Off topic:How to use awk to look for the lowest e-value field?

0

Entering edit mode

6.4 years ago

jxi21 • 0

Hello!, I am trying to parse some results given by HMMER and in the tblout file I was able to isolate the matches I want.

Nonetheless, the same value is being repeated several times if it matches to just one profile.

For example, this is one read is repeated 3 times:

SRR6033660.161030 FAM007172 4e-15 4.2e-15 63.4 63.4
SRR6033660.1458607 FAM019859 2.5e-12 2.7e-12 55.0 54.9
SRR6033660.1458607 FAM015326 4e-14 4.2e-14 58.8 58.7
SRR6033660.1458607 FAM000764 7.5e-25 8.1e-25 94.6 94.5

It matches to 3 families, nonetheless I just want to select the row which has the lowest e-values (3rd and 4th columns)

How can I write an awk command that gives me this output?

SRR6033660.161030 FAM007172 4e-15 4.2e-15 63.4 63.4
SRR6033660.1458607 FAM000764 7.5e-25 8.1e-25 94.6 94.5

Thanks!

awk sequencing • 620 views

ADD COMMENT • link updated 6.4 years ago by Ram 45k • written 6.4 years ago by jxi21 • 0

This thread is not open. No new answers may be added