Prokka cycles
1
0
Entering edit mode
16 months ago
akern337 • 0

Hey guys, can you please share with me an idea, how to annotate more than 300 genomes in a for loop, so I can see the output of each one in separate folders named same as each genome? It's hard to orientate in a such huge output... For one genome it works like that:

prokka --kingdom Bacteria --outdir Anotation --addgenes Dataset/Akkermansia_muciniphila_An78.faa for example

Thanks a lot in advance!

Prokka annotation multiple • 490 views
ADD COMMENT
2
Entering edit mode
16 months ago
Mensur Dlakic ★ 27k

Assuming that all your genomes end in .fna and that you use a bash shell:

for i in *.fna
do
prokka --kingdom Bacteria --outdir ${i%.fna} --force --prefix ${i%.fna} --locustag ${i%.fna} --addgenes --cpus # $i
done

For a (t)csh shell:

foreach i ( *.fna )
prokka --kingdom Bacteria --outdir $i:r --force --prefix $i:r --locustag $i:r --addgenes --cpus # $i
end

In both cases instead of # you should enter the actual number of CPUs allocated to this command, and prokka command may need to be customized a bit to match your needs.

ADD COMMENT
0
Entering edit mode

Thank you a lot, Mensur, that was extremely helpful!

ADD REPLY

Login before adding your answer.

Traffic: 2697 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6