I'm working on converting several of the historical reference genome FASTA files into SAM, BAM and VCF files for a project. (Alignment will be using hg19.) The primary issue is that many of these FASTA files are missing scores (quality metrics). One solution appears to be to add 'placeholder' scores to these FASTA files so the conversions can be accomplished.
Is there an existing script that can accomplish this (add all necessary scores to a FASTA file so that it can be processed into SAM and beyond)? If not, which FASTA scores are required for these conversions to occur and how can those scores be added (or made unnecessary)? (If you have a better solution for FASTA2BAM and FASTA2VCF when scores are not included in the FASTA files). Thanks - Irene