The presence of host sequence in shotgun metagenome data is considered as contamination and it is removed by mapping with a reference genome in different tools like bowtie2. Can anyone kindly through some light on how to create the reference genome when the whole genome sequence data is not available for the host organism? I am currently working on the gut microbiota of a lepidopteran species and its whole genome is not sequenced yet.
Any response is much appreciated.
Thank you for your reply GenoMax , @colindaven and @Mensur Dlakic. I will follow your suggestions and let you know.