About FastP Tool
1
0
Entering edit mode
4 months ago
AIMAR ▴ 70

Hi everyone I would like to ask a question concerning, the use of FastP and gatk CollectSummaryAlignement.

I've used FastP for Preprocessing and I've specified the length requiered command to 30. After using the gatk CollectSummaryAlignment command to have an idea about my aligne reads, it shows that the minimum length is about 19 - 20 bp.

GitHub Gist

Raw Quality_control Preprocessing alignment reads • 833 views
ADD COMMENT
2
Entering edit mode
4 months ago
GenoMax 153k

After using the gatk CollectSummaryAlignment command to have an idea about my aligne reads, it shows that the minimum length is about 19 - 20 bp.

That is probably reflective of the part of those 30+ bp reads that actually aligned to the reference you are using. Remainder of the read must be "soft-clipped" since those bases did not align (which you can confirm by checking CIGAR string for those alignments or visually via a genome viewer). If you check the length of the reads going into this alignment they should all be 30+bp.

ADD COMMENT
0
Entering edit mode

How can I check the CIGAR string. And also do you have any idea about a genome browser I can use ?

ADD REPLY
1
Entering edit mode

Look at your alignment file records (with samtools view and such) and check the 6th field (CIGAR strings are described on page 8 in SAM file format spec: https://samtools.github.io/hts-specs/SAMv1.pdf ). Soft-clipped alignment CIGAR strings will begin/end with S e.g. 15S10M.

Integrative Genomics Viewer (IGV): https://igv.org/doc/desktop/

ADD REPLY
0
Entering edit mode

Understood and thanks

ADD REPLY

Login before adding your answer.

Traffic: 5204 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6