Question: Extract mutations from fasta sequences
gravatar for dasilvauirajan
10 weeks ago by
dasilvauirajan20 wrote:

I have a large amount of align protein sequences in the .fasta forma, and a reference sequence, every of that has the same length. I would like to extract only the amino acid mutations from these sequences, so that, in the end, I want to have a list that looks something like this: I456L, W675T, etc . Is there a software or any way to do this? Thankful

mutations sequence fasta • 172 views
ADD COMMENTlink modified 10 weeks ago by Pierre Lindenbaum128k • written 10 weeks ago by dasilvauirajan20

Pierre has a complete solution but in case that does not work you could use blastp with -outfmt 3 which will identify the difference and output it so.

Subject_1  181  ..............................V.............................  240

Biopython blast parser may be able to help finish the rest.

ADD REPLYlink modified 10 weeks ago • written 10 weeks ago by genomax84k
gravatar for Pierre Lindenbaum
10 weeks ago by
France/Nantes/Institut du Thorax - INSERM UMR1087
Pierre Lindenbaum128k wrote:

Using bioalcidaejdk; and a fasta file where the very first sequence is the reference:

java -jar dist/bioalcidaejdk.jar -e 'FastaSequence ref=null; while(iter.hasNext()) { final FastaSequence seq =; if(ref==null) { ref=seq; } else { for(int i=0;i< seq.length() && i< ref.length();i++) { char aa1 = ref.charAt(i); char aa2 = seq.charAt(i); if(aa1!=aa2) println(seq.getName()+"\t"+aa1+(i+1)+aa2); } } }'  input.fasta
ADD COMMENTlink written 10 weeks ago by Pierre Lindenbaum128k

Pierre Lindenbaum I cannot find the fold dist, and bioalcidaejdk.jar too. There is a file named in the bioalcidae folder.

ADD REPLYlink modified 7 weeks ago • written 7 weeks ago by dasilvauirajan20

Did you follow install instructions:

Requirements / Dependencies

java compiler SDK 11. Please check that this java is in the ${PATH}. Setting JAVA_HOME is not enough : (e.g: )

Download and Compile

$ git clone ""
$ cd jvarkit
$ ./gradlew bioalcidaejdk
ADD REPLYlink modified 7 weeks ago • written 7 weeks ago by genomax84k

Thank you both genomax and Pierre Lindenbaum. I had a problem with my JDK compiler, just solved it, i get running and everything went well!!!

ADD REPLYlink written 7 weeks ago by dasilvauirajan20
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1760 users visited in the last hour