Entering edit mode
8.4 years ago
max1415r
▴
10
I have a batch protein FASTA file. I wan to create output file with position for each amino acid with possible mutation (remaining 19 aa) in tab-delimited file.
e.g. sequence
>sp|Q6NUK1|SCMC1_HUMAN Calcium-binding mitochondrial carrier protein
MLRWLRDFVLPTAACQDAEQPTRYETLFQALDRNGDGVVDIGELQEGLRNLGIPLGQDAE
>sp|Q6KCM7|SCMC2_HUMAN Calcium-binding mitochondrial carrier protein
MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGD
Output file; (Protein ID, position, amino acid, substitution(19). It will be given for all protein(around 4000)
Q6NUK1 1 M A
Q6NUK1 1 M R
Q6NUK1 1 M N
Q6NUK1 1 M D
Q6NUK1 1 M C
Q6NUK1 1 M Q
Q6NUK1 1 M E
Q6NUK1 1 M G
Q6NUK1 1 M H
Q6NUK1 1 M I
Q6NUK1 1 M L
Q6NUK1 1 M K
Q6NUK1 1 M F
Q6NUK1 1 M P
Q6NUK1 1 M S
Q6NUK1 1 M T
Q6NUK1 1 M W
Q6NUK1 1 M Y
Q6NUK1 1 M V
Thank you. It was very helpful.