Removal of host reads from shotgun metagenomics data.
3
0
Entering edit mode
2.8 years ago

The presence of host sequence in shotgun metagenome data is considered as contamination and it is removed by mapping with a reference genome in different tools like bowtie2. Can anyone kindly through some light on how to create the reference genome when the whole genome sequence data is not available for the host organism? I am currently working on the gut microbiota of a lepidopteran species and its whole genome is not sequenced yet.

Any response is much appreciated.

metagenomics shotgun • 2.0k views
ADD COMMENT
0
Entering edit mode

Thank you for your reply GenoMax , @colindaven and @Mensur Dlakic. I will follow your suggestions and let you know.

ADD REPLY
1
Entering edit mode
2.8 years ago
GenoMax 141k

You should use a tool like bbsplit.sh to bin the reads. (LINK).

gut microbiota of a lepidopteran organism and its whole genome is not sequenced yet.

That is going to limit what you can identify and separate for host genome.

ADD COMMENT
1
Entering edit mode
2.8 years ago

You should just be able to use cat *.fa > draft_genome.fa to combine the contigs to a draft genome. Make sure you check it's usefulness using samtools faidx x.fa . Also check whole genome size is as expected.

Then as GenoMax says you can use bbsplit.sh to proceed.

ADD COMMENT
1
Entering edit mode
2.8 years ago
Mensur Dlakic ★ 27k

You can remove the host contigs after the assembly, and that way you will get to assemble some of the insect genome. It should be plenty different from whatever is in its gut, and I expect it will easily separate when you bin the contigs based on tetranucleotide frequencies. The only downside is that it may make the assembly difficult depending on the proportion of host / metagenomic reads.

ADD COMMENT

Login before adding your answer.

Traffic: 2743 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6