Entering edit mode
                    5.7 years ago
        waqaskhokhar999
        
    
        ▴
    
    160
    I have a file that contains multiple sequences in Fasta format. I am interested to translate them and want only protein sequences for the longest ORF. I can do this using ORFFinder but I have to process each sequence individually as it doesn't support multiple sequences. Is there any alternative of ORFFinder that support multiple sequences.?
Sample input:
>BnaA02g12810D
ATGGCTTCCGTTATGCTCTCTTCCGCTACAATGGCCTCTTCTCCGGCTCATGCCACAATGGTCGCACCATTCATCGGACTTAAGTCCTCTGCTGCTTTCCCAGTGACATGTAAGGCCAACACCAAAGTTACTTCCATCACAAGCAACGGCGGAAGAGTTAACTGCATGAAGGTGTGGCCTCCAGTTGGCAAGAAGAAGTTTGAGACTCTCTCTTACCTTCCTGACCTTACCGATGTCGAAATAGCCAAGGAAGTTGACTACCTTATCCGCAACAAGTGGACTCCATGTATTGAATTCGAGTTGGAGCACGGTTTTGTATACCGTGAGCATGGAAACATCCCTGGATACTATGATGGACGATACTGGACAATGTGGAAGCTTCCTTTGTTCGGATGTACTGACTCAGCTCAGGTGTTGAAGGAAGTGCAAGAATGCAAAAAGGAGTACCCCAACGCCTTCATTAGGATCATCGGATTCGACAACAATCGTCAAGCCCAGTGCATCAGTTTCATCGCCTACAAGCCACCAAGCTTCACTAATGCTTAATTACACAGCTTCATTGCTTTGTGTAAACAACAAAACTTTATCCTTCCCTGCCTTTGATTTATCATCTTTTTATATATTTTATCTTTTGTTGTAATTTCCGGATTTAATCTTTGTTTTCCGGGTTGCAAGATATTTTCTTTTGGGTCCTCAAATGTCCTAAAAAATAAATATGTAATGTTATAAAAATATATTATTTTGAATTTTG
>BnaA02g12810D_S31
ATGGCTTCCGTTATGCTCTCTTCCGCTACAATGGCCTCTTCTCCGGCTCATGCCACAATGGTCGCACCATTCATCGGACTTAAGTCCTCTGCTGCTTTCCCAGTGACATGTAAGGCCAACACCAAAGTTACTTCCATCACAAGCAACGGCGGAAGAGTTAACTGCATGAAGGTGTGGCCTCCAGTTGGCAAGAAGAAGTTTGAGACTCTCTCTTACCTTCCTGACCTTACCGATGTCGAAATAGCCAAGGAAGTTGACTACCTTATCCGCAACAAGTGGACTCCATGTATTGAATTCGAGTTGGAGCACGGTTTTGTATACCGTGAGCATGGAAACATCCCTGGATACTATGATGGACGATACTGGACAATGTGGAAGCTTCCTTTGTTCGGATGTACTGACTCAGCTCAGGTGTTGAAGGAAGTGCAAGAATGCAAAAAGGAGTACCCCAACGCCTTCATTAGGATCATCGGATTCGACAACAATCGTCAAGCCCAGTGCATCAGTTTCATCGCCTACAAGCCACCAAGCTTCACTAATGCTTAATTACACAGCTTCATTGCTTTGTGTAAACAACAAAACTTTATCCTTCCCTG
Desired output:
>BnaA02g12810D
MASVMLSSATMASSPAHATMVAPFIGLKSSAAFPVTCKANTKVTSITSNGGRVNCMKVWPPVGKKKFETLSYLPDLTDVEIAKEVDYLIRNKWTPCIEFELEHGFVYREHGNIPGYYDGRYWTMWKLPLFGCTDSAQVLKEVQECKKEYPNAFIRIIGFDNNRQAQCISFIAYKPPSFTNA
>BnaA02g12810D_S31
MASVMLSSATMASSPAHATMVAPFIGLKSSAAFPVTCKANTKVTSITSNGGRVNCMKVWPPVGKKKFETLSYLPDLTDVEIAKEVDYLIRNKWTPCIEFELEHGFVYREHGNIPGYYDGRYWTMWKLPLFGCTDSAQVLKEVQECKKEYPNAFIRIIGFDNNRQAQCISFIAYKPPSFTNA
Many thanks, Can you please suggest me how I can design loop for it?