I want to know the standard value of quality score or sequence length in trimming by fastx_toolkit.
In my RNA-Seq textbook, author trim fastq sequences whose base quality scores are under 20 and lengths become under 30bp (original length is 100 bp). Is that the standard value or changes case by case ?
If there is standard value of quality score or sequence length that many biologist adopt, please tell me that value. Also, if it changes case by case, please tell me how biologist judge and decide its value.
There is no standard for these things since every dataset is different. You will need to experiment and find out what your dataset looks like on QC. If you have a good reference to align to, you may be able to use data down to Q15.
If possible omit using
fastx_toolkitwhich, is old by NGS standards.bbduk.shfrom BBMap suite,trimmomaticorcutadaptare all great alternatives.