Question

Prokka cycles

0

Entering edit mode

16 months ago

akern337 • 0

Hey guys, can you please share with me an idea, how to annotate more than 300 genomes in a for loop, so I can see the output of each one in separate folders named same as each genome? It's hard to orientate in a such huge output... For one genome it works like that:

prokka --kingdom Bacteria --outdir Anotation --addgenes Dataset/Akkermansia_muciniphila_An78.faa for example

Thanks a lot in advance!

Prokka annotation multiple • 490 views

ADD COMMENT • link 16 months ago by akern337 • 0

score 2 · Accepted Answer · 2022-12-06

2

Entering edit mode

16 months ago

Mensur Dlakic ★ 27k

Assuming that all your genomes end in .fna and that you use a bash shell:

for i in *.fna
do
prokka --kingdom Bacteria --outdir ${i%.fna} --force --prefix ${i%.fna} --locustag ${i%.fna} --addgenes --cpus # $i
done

For a (t)csh shell:

foreach i ( *.fna )
prokka --kingdom Bacteria --outdir $i:r --force --prefix $i:r --locustag $i:r --addgenes --cpus # $i
end

In both cases instead of # you should enter the actual number of CPUs allocated to this command, and prokka command may need to be customized a bit to match your needs.