When running MITE-hunter, can I split the input genome?
3
3
Entering edit mode
7.8 years ago

Hi, everyone,

I was running MITE-hunter on a 600Mb sized genome, but it take too much time when generating the TSD file. Splitting the input genome and running them parallelly might increase the speed, but I noticed that the following steps involved all-by-all blastn. Will the splitted genome affect the blastn and further affect the MITE annotation?

Thank you.

repeat annotation genome transponson MITE-hunter • 1.9k views
ADD COMMENT
2
Entering edit mode
6.0 years ago

Yes you should be able to do so.

You make one blastdb of your whole genome, then you split the input file for the blast searches in batches and run those in parallel. Depending on the requested output you will need to correctly concatenate the separate output files you will get.

This approach should not affect the blast result (so will be identical to the 'true' all-vs-all approach)

ADD COMMENT
0
Entering edit mode
6.8 years ago
Dave Carlson ★ 1.7k

Good question. I'm in the same boat. What did you end up deciding to do?

ADD COMMENT
0
Entering edit mode
6.0 years ago
VS ▴ 730

I have the same question, did you find an answer to this?

ADD COMMENT

Login before adding your answer.

Traffic: 2441 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6