Extracting amino acid positions from a fasta sequence
0
0
Entering edit mode
3.8 years ago
akang ▴ 110

I need a suggestion regarding extracting positions from a nucleotide fasta file.

I have a fasta file like below. I am aligning the file ,then converting the sequences into amino acid sequence and finally extracting specific amino acids (say position 1,3,4) from the amino acid data. But since their is an insertion in one sequences it messes up the positions in others. So when I say aa at position 2 for abc is X, cvb is X and bnh is T. will that be correct?

fasta

> abc 
atgcatgcgggcctttt
> cvb 
atgcatgcgggcctttt
> bnh 
atgCCcatgcgggcctttt

aligned

> abc 
atg--catgcgggcctttt
> cvb 
atg--catgcgggcctttt
> bnh 
atgCCcatgcgggcctttt
fasta python • 1.0k views
ADD COMMENT
1
Entering edit mode

I think, the position should be set according to the reference. Thank you.

ADD REPLY
1
Entering edit mode

Are you sure aligning before translating makes sense?

If you have insertions and substitutions, you should translate these first. If this leads to a frameshift for example, the result would be a radically different protein.

ADD REPLY
0
Entering edit mode

Your question is not clear

ADD REPLY

Login before adding your answer.

Traffic: 1813 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6