I realized that my data were not trimmed and I wish to get something out of them… I was advised "trimomatic"
So, I tested, and obtained sequences like:
>M02764:119:000000000-C5R9K:1:1101:8653:1645 1:N:0:1 GTTACTAACTTCTTAGGACCCATGATCGGGGACTGAGCAAGCTGTTGCTGAAACCAGCCGACCTGCCT GGGCCGACTAACCCTGCCCTGGCCGGCTGCAAGGTGAGGACCTGCCGCAACTCGCTGTAGATCGGA AGAGCACACGTCTGAACTCCAGTCACGCTGAAGAAGCTCGTATGCCGTCTTCTGCGTGAAAAAAAAAAAAATCAGG
I still have a lot of craps. My adaptor is GGTGAGGACCTGCCGCAACTCGCTGT Therefore I would like to obtain GTTACTAACTTCTTAGGACCCATGATCGGGGACTGAGCAAGCTGTTGCTGAAACCAGCCGACCTGCCTGGGCCGACTAACCCTGCCCTGGCCGGCTGCAA as output.
I could used "sed" but it would need a perfect match, and you know sequencers… often not reliable… Any advise?