Contigs unmapped to reference, Sanger sequencing assembly improvements and plasmid DNA
1
2
Entering edit mode
9.2 years ago
a.a.kechin ▴ 20

I have paired reads from sequencing bacterial genome with Illumina MiSeq. I assembled them into contigs and now I have 4022 contigs with N50=72749, length of the longest contig 229883, number of mapped contigs 41 and reference genome (1.8 Mb chromosome + 3 plasmids) coverage 91.35%. And I don't know what to do with 3981 unmapped contigs. Do they contain any useful information or they can be removed from the further analysis? And what about plasmids? How to separate plasmid DNA from a bacterial chromosome? And how can I improve this assembly by sequencing by Sanger?

Thank in advance for an answer!

contig next-gen-sequencing Assembly • 2.9k views
ADD COMMENT
1
Entering edit mode
9.2 years ago
rob234king ▴ 610

Might try to check for contamination first

Perform a blastn of ncbi then use MEGAN to identify contigs that are contamination i.e. human or mouse etc. Then delete those contigs.

You say contigs, have they not been scaffolded too? What assembler did you use and parameters?

Try using a higher k-mer value in your assembly and although you are likely to get smaller contigs they may extend your current contigs and allow you to join some together by aligning them. Or if not tried use a multi-kmer assembly approach like with soapdenovo2 usually 61-85 can do a good job. If plasmids are AT rich then high kmer value may assemble these specifically better and may have the coverage to make this more successful versus the genome.

Finally:

Align your genome against another similar using lastz to allow you to scaffold the contigs you have so can do some sangar to fill the gaps. Some use Mauve to do this automatically but not found it that good.

ADD COMMENT
0
Entering edit mode

Thank you! I'll try it.

I used a5-miseq pipeline. It produced scaffolds, too. But they were the same as contigs. How to separate contamination from horizontal gene transfer?

ADD REPLY

Login before adding your answer.

Traffic: 2154 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6