Hi everyone, and thanks in advance! I'm used to doing lots of trimming, substituting, etc on large FASTQ/A files, but now I need to add sequence arbitrarily at the beginning of all reads and I'm coming up short! Been searching a couple hours for a method via toolkit (fastx_toolkit, BBmap, etc.) or simple command (sed, awk, etc.).
So I'm looking to go from something like this:
>header GTCTCAGATCGGAAGAGCACACGT >header CCGGTCCTGGTTGCAGATCGGAAG >header GTATCTCCTAAGATATAACAGGTTG >header AGGTACAGGTTGGATGATAAGTCC
>header AAAAAAGTCTCAGATCGGAAGAGCACACGT >header AAAAAACCGGTCCTGGTTGCAGATCGGAAG >header AAAAAAGTATCTCCTAAGATATAACAGGTTG >header AAAAAAAGGTACAGGTTGGATGATAAGTCC
Alternatively, I can do the same with FASTQ files (also extending the quality lines to match), if there's already a tool out there for that. I'm not interested at quality at this point, as I've already merged paired-end reads with PandaSeq and filtered out anything but the highest quality reads.