Pacbio Ccs Vs Subreads Explained ?

4

Entering edit mode

10.9 years ago

curious.genome ▴ 40

Hi guys,

I've been trying to understand PacBio data for a while now, I think that it's the way forward for our lab. I'm still having trouble understanding CCS though - my biology is quite shaky.

Could someone explain to a computer scientist - how does PacBio get its CCS from the subreads or long reads ?
Our collaborators with the PacBio data gave us a set of files - with subreads.fastq, CCS.fastq and long_reads.fastq. When I ran a FASTQC report for subreads, I got base quality scores of around 10-15, whereas with CCS I got a larger variation for quality scores with values between 30-40. Is this expected ?
Which of these files should I use to perform assembly ? I'm guessing it's CCS.fastq

Thanks for your help!

• 16k views

ADD COMMENT • link updated 10.4 years ago by Biostar 20 • written 10.9 years ago by curious.genome ▴ 40

2

Entering edit mode

You asked the same question at seqanswers, where it was answered nicely. See http://seqanswers.com/forums/showthread.php?t=34790

ADD REPLY • link 10.8 years ago by lexnederbragt ★ 1.3k

0

Entering edit mode

Question was helpful. Thank you.

ADD REPLY • link 10.4 years ago by Prakki Rama ★ 2.7k

Login before adding your answer.