SNP calling ONT sequenced files
1
0
Entering edit mode
2 hours ago
blur ▴ 280

Hi,

I have ONT files (fastq) and want to read SNPs on them. I have read some papers saying ONTs are too error-prone - is it still true?

Also, I have tried doing this on CLC workbench and couldn't - the fastq files were too big. The company support could not offer help. My sequenced isolates are bacterial - 3-6 million base genomes, so the large chromosomal contig is usually 3-6 million bases long.

Is there a good and reliable tool for ONT SNPs reading? Would it make sense to break the contigs into smaller portions or will that hurt the reliability of the results?

Thanks in advance

ONT SNPs • 68 views
ADD COMMENT
0
Entering edit mode

What is the median length of raw reads and how many do you have? Has the data been basecalled with "high" or "super" accuracy?

I have tried doing this on CLC workbench and couldn't - the fastq files were too big

If you have excessive coverage (> 50x), you could subsample the fastq data and see if CLC is able to work with that.

ADD REPLY
0
Entering edit mode
1 hour ago

The quality of (modern, SUP basecalled, but certainly not fast basecalled) ONT sequences should be fine, especially for bacterial SNP calling these days.

Have a look at deepvariant (resource-intensive) or longshot (simple) via bioconda to get started with SNP calling on long reads.

ADD COMMENT

Login before adding your answer.

Traffic: 3748 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6