Alternative of ORFFinder for multiple nucleotide sequences
1
0
Entering edit mode
4.1 years ago

I have a file that contains multiple sequences in Fasta format. I am interested to translate them and want only protein sequences for the longest ORF. I can do this using ORFFinder but I have to process each sequence individually as it doesn't support multiple sequences. Is there any alternative of ORFFinder that support multiple sequences.?

Sample input:

>BnaA02g12810D

ATGGCTTCCGTTATGCTCTCTTCCGCTACAATGGCCTCTTCTCCGGCTCATGCCACAATGGTCGCACCATTCATCGGACTTAAGTCCTCTGCTGCTTTCCCAGTGACATGTAAGGCCAACACCAAAGTTACTTCCATCACAAGCAACGGCGGAAGAGTTAACTGCATGAAGGTGTGGCCTCCAGTTGGCAAGAAGAAGTTTGAGACTCTCTCTTACCTTCCTGACCTTACCGATGTCGAAATAGCCAAGGAAGTTGACTACCTTATCCGCAACAAGTGGACTCCATGTATTGAATTCGAGTTGGAGCACGGTTTTGTATACCGTGAGCATGGAAACATCCCTGGATACTATGATGGACGATACTGGACAATGTGGAAGCTTCCTTTGTTCGGATGTACTGACTCAGCTCAGGTGTTGAAGGAAGTGCAAGAATGCAAAAAGGAGTACCCCAACGCCTTCATTAGGATCATCGGATTCGACAACAATCGTCAAGCCCAGTGCATCAGTTTCATCGCCTACAAGCCACCAAGCTTCACTAATGCTTAATTACACAGCTTCATTGCTTTGTGTAAACAACAAAACTTTATCCTTCCCTGCCTTTGATTTATCATCTTTTTATATATTTTATCTTTTGTTGTAATTTCCGGATTTAATCTTTGTTTTCCGGGTTGCAAGATATTTTCTTTTGGGTCCTCAAATGTCCTAAAAAATAAATATGTAATGTTATAAAAATATATTATTTTGAATTTTG

>BnaA02g12810D_S31

ATGGCTTCCGTTATGCTCTCTTCCGCTACAATGGCCTCTTCTCCGGCTCATGCCACAATGGTCGCACCATTCATCGGACTTAAGTCCTCTGCTGCTTTCCCAGTGACATGTAAGGCCAACACCAAAGTTACTTCCATCACAAGCAACGGCGGAAGAGTTAACTGCATGAAGGTGTGGCCTCCAGTTGGCAAGAAGAAGTTTGAGACTCTCTCTTACCTTCCTGACCTTACCGATGTCGAAATAGCCAAGGAAGTTGACTACCTTATCCGCAACAAGTGGACTCCATGTATTGAATTCGAGTTGGAGCACGGTTTTGTATACCGTGAGCATGGAAACATCCCTGGATACTATGATGGACGATACTGGACAATGTGGAAGCTTCCTTTGTTCGGATGTACTGACTCAGCTCAGGTGTTGAAGGAAGTGCAAGAATGCAAAAAGGAGTACCCCAACGCCTTCATTAGGATCATCGGATTCGACAACAATCGTCAAGCCCAGTGCATCAGTTTCATCGCCTACAAGCCACCAAGCTTCACTAATGCTTAATTACACAGCTTCATTGCTTTGTGTAAACAACAAAACTTTATCCTTCCCTG

Desired output:

>BnaA02g12810D
MASVMLSSATMASSPAHATMVAPFIGLKSSAAFPVTCKANTKVTSITSNGGRVNCMKVWPPVGKKKFETLSYLPDLTDVEIAKEVDYLIRNKWTPCIEFELEHGFVYREHGNIPGYYDGRYWTMWKLPLFGCTDSAQVLKEVQECKKEYPNAFIRIIGFDNNRQAQCISFIAYKPPSFTNA

>BnaA02g12810D_S31
MASVMLSSATMASSPAHATMVAPFIGLKSSAAFPVTCKANTKVTSITSNGGRVNCMKVWPPVGKKKFETLSYLPDLTDVEIAKEVDYLIRNKWTPCIEFELEHGFVYREHGNIPGYYDGRYWTMWKLPLFGCTDSAQVLKEVQECKKEYPNAFIRIIGFDNNRQAQCISFIAYKPPSFTNA
RNA-Seq ORF translation • 1.0k views
ADD COMMENT
2
Entering edit mode
4.1 years ago
GenoMax 141k

You could download linux version available here and run that in a loop for your sequences. There is also EMBOSS getorf program that may be of interest.

ADD COMMENT
0
Entering edit mode

Many thanks, Can you please suggest me how I can design loop for it?

ADD REPLY

Login before adding your answer.

Traffic: 2526 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6