ClustalW is extremely limited when in comes to multiple whole genome sequencing. I have recently just looked at mugsy which claims to be able to align a little over 30 whole genomes.
Is there a software that can align 400 whole genomes? This would be over a Gb of data.
Any help would be enormously appreciated.
If your purpose is to find orthologs between the genomes, Standalone blast from NCBI will help. Install via: ftp://ftp.ncbi.nih.gov/blast/ . No doubt, it`ll take huge amount of time.
I'm a little confused why you would want to do this. Assuming already assembled and annotated genomes, if whole genome alignment programs take sequences (fasta) and annotations (gff) and use both for alignment, why would you need to blast orthologs?
Progressive Mauve is a good choice. However, other helpful tools include Mummer and with a new ortholog finding tool that uses synteny - PanOCT - sourceforge.net/p/panoct/
Sybil and OrthoMCL are 10 times less memory consuming than PanOCT. http://sourceforge.net/projects/panoct/screenshots/PanOCT_memory_usage.jpg/2000/2000