I am attempting to clean FASTQ PE data. I have decided to perform quality control on both R1 and R2 separately. My goal is to get the editing efficiency as well as find Indels and "real" mutations. I have been using the software AliView to view the FASTA alignment thus far. I've noticed some of the sequences would align if a gap was inserted at various positions.
Additionally, I have used Usegalaxy.org for the majority of my NGS processing. I begin with FASTQC, (barcode splitter), followed by Trimmomatic, then Bowtie2. Bowtie2 outputs both the BAM files and the aligned FASTQ reads. From here, I was wondering if anyone had suggestions on how to insert gaps with respect to my reference genome fasta file.
If anyone has successfully inserted gaps to align to the reference with Bowtie2, please let me know what parameters you tweaked and how you decided what to set them to!
While I can manually do this in AliView, I am working with 100,000+ sequences and that is obviously not feasible.
Below is a screenshot of aligned sequences with an example of a sequence that I would like to insert a gap for.
Please let me know if you have any suggestions. I am comfortable working in Galaxy and with Python. Thank you so much