Dear all can anyone tell me how to parse this type of file using either perl or awk
AUO97_RS0005
AUO97_RS0005 alpha hydrolase wp_567465 GI:54365463
AUO97_RS0007
AUO97_RS0007 beta hydrolase wp_567465 GI:65456475
AUO97_RS0020
AUO97_RS0020 gamma hydrolase wp_567465 GI:4536473
I want to retrieve only only those values having data in next columns and remove duplicates. Output file:
AUO97_RS0005 alpha hydrolase wp_567465 GI:54365463
AUO97_RS0007 beta hydrolase wp_567465 GI:65456475
AUO97_RS0020 gamma hydrolase wp_567465 GI:4536473
Your formatting makes any effort on our side impossible. Please put the example into code blocks to preserve new lines and other formatting.
Your question is unanswerable as written.
file is like this: