I have question about compare fasta-files.
Input: I have ~11.000 fasta-files. This files are of different length, for example, some files around 29.000 bp but some files around 28.000 bp.
Output: I would like have info about unique fasta-files.
For example: We have 7 fasta-files. 1,2,5,6 and 7 have the same SNP, so the 1 fasta-file interested me. 1, 3 and 4 samples have unique SNP. In output I have three unique fasta-files.
I was thinking about bash-script, but iles are of different length. I hope that you can help me.
Thank you so much!