Hi all, This is my first time learning the world of NGS. I lack a lot of knowledge. Small RNA sequencing is in progress. But we left the blood sample to the company and received raw data. (fastq file) Before receiving raw data, library sequencing caused a result of 'fail' due to mostly a small number of libraries for the entire sample, but we proceeded with sequencing and obtained a fastq file.
I have a question here. So far, I know a little about checking the quality of raw data while learning about NGS. However, I was wondering if there is a way to check the read of small RNA in raw data if the quality results of the library of small RNA appear as a 'fail' before receiving raw data for the first time like this.
To make the results reliable in future analyses, is there a way to check the miRNA read present in raw data (fastq) and check the quality?
Thank you. all. There are so many things I don't know. It may be a basic part, but please comment a lot. It's a valuable answer for me.
Thank you for your answer. IIf so, can I understand that it means that you have to mapping and verify the quality of the miRNA read in the raw data (fastq) file in the end? I have one more question. I understand that 18-30nt is appropriate for read length for small RNA seq. If you want to perform a follow-up analysis by trimming on this basis Is the sequence length in the Basic Statistics of the fastqc result report between 18 and 30? I know the concepts of bp and nt theoretically, but I don't know yet if the length of the result report means nt or bp.
bp and nt are interchangeable in my experience.
There should be some good tutorials here for smallRNA with more details: https://training.galaxyproject.org/training-material/topics/transcriptomics/
Thank you, the link is helping. Previously, it was recommended to perform 2-3 trimming rounds to verify that the miRNA read is actually the correct miRNA. Does this mean we should proceed with multiple trimming on different criteria and check the data with genome mapping?
Good. No, multiple rounds, same criteria. Perhaps with trimmomatic. Save each result set in a differerent directory and check them out with fastqc. Perhaps align to the genome to "eyeball" results. It's a bit fiddly but otherwise results can be incorrect (eg 1 bp too many) which someone is bound to point out downstream.