filtering out low quality sequences from Sanger Sequencing
1
0
Entering edit mode
6.0 years ago

Hello! I am currently facing a challenge of filtering out low quality sequences (output from Sanger sequecing, roughly I have 300 sequences, command line are very much appreciated).

Below, I explain what I have done so far, and I am looking for any hints/tips to get this working.

1- conversion ab1 to fastq " for filename in *.ab1; do seqret $filename fastq_files/$filename.fastq -osformat fastq; done"

2- trimming sequencing "for filename in *.fastq; do fastx_trimmer -f 30 -l 600 -i $filename -o t_$filename ; done"

3- trying to filter the sequences using " quality fastq_quality_filter -i input.fastq -o output/output.fastq -q 20 -v -Q 33 " and I got the following error message,

stack smashing detected : fastq_quality_filter terminated zsh: abort (core dumped) fastq_quality_filter -i PosKonR.fastq -o output/Pos.fastq -q 20 -v -Q 33

I changed to -Q 64 and got " fastq_quality_filter: Invalid quality score value (char '&' ord 38 quality value -26) on line 4 "

Many thanks in advance for any help on this

sequence sanger • 2.3k views
ADD COMMENT
0
Entering edit mode

Use Staden or Phred/Phrap. I think there are newer tools, but I am not up-to-date.

ADD REPLY
0
Entering edit mode

ClinQC which may be able to help. There is a also convertrix. It can automatically trim the untrusted regions (low quality bases) at the end of samples per its web page.

ADD REPLY
1
Entering edit mode
6.0 years ago

Thank you all, The issue was with output format from seqret. I have used instead fasta_to_fastq.py available here.

ADD COMMENT

Login before adding your answer.

Traffic: 2893 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6