I am analyzing miRNA sequencing data derived from a typical Illumina protocol for small RNAs. The problem is that although I have trimmed the reads for the adaptors, there are a lot of reads with length longer thatn 26bp.
What are these long reads? Should I trim the reads for hairpin sequence too? What's been actually sequenced? Else, what are the steps needed before alignining the reads to the reference sequence?