Question: Using 10X Chromium linked reads for genome gap-filling
gravatar for daren.card
12 months ago by
daren.card20 wrote:

I have about 75x coverage of 10X Genomics Chromium data for a non-model reptile species. I've used this to produce a genome assembly using Supernova and am currently using similar coverage of Hi-C data to scaffold. I expect this scaffolding process to result in assembly gaps of various lengths beyond what is already probably present in my Supernova assembly.

I know there is lots of software for filling gaps using short Illumina reads and dedicated software for long-read data like PacBio, but I wondered if there is any software that leverages the linked Illumina reads provided by 10X Genomics to perform gap-filling? Or is there a way to extract the assemblies of linked reads from Supernova (or produce de novo) that should theoretically provide contig sequences up to the length of the input molecules, which could be used with a custom mapping pipeline to fill gaps?

assembly genome • 609 views
ADD COMMENTlink modified 12 months ago by harish280 • written 12 months ago by daren.card20
gravatar for harish
12 months ago by
harish280 wrote:

I don't think there would be dedicated tool for it, but essentially all gapfillers do is just map the reads, compute the consensus and patch the gaps up assuming adequate unique flanking regions are obtained.

Since 10X reads are nothing but standard paired end libraries with 10X barcodes embedded, you can remove those barcodes from reads using scaff10X (scaff_reads) and then use gapfiller/cobbler/rails etc to fill those regions.

ADD COMMENTlink written 12 months ago by harish280

Thanks for the reply. Makes sense. I was just hoping it would be possible to somehow leverage the local assemblies from the linked reads. Seems like it would work better to map and extend using >10 kb "reads" vs. just 150 bp ones.

ADD REPLYlink written 12 months ago by daren.card20

I don't know about the fidelity of what I'm about to suggest, but maybe breaking the scaffold sequences at a specific amount of "N"s and using these pseudo-contigs might probably help.

ADD REPLYlink written 12 months ago by harish280
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1434 users visited in the last hour