I'm developing a parser for interleaved file formats, like Phylip, ClustalW and Nexus. But a challenge is to validate my parser against big samples, and do some stress testing. There are some nice short examples : PHYLIP format, Nexus DNA format. Anyone mind to share URL's to download .phy,. nxs, .nex files with many samples/sequences?
Could you be a bit more clear on what kind of data you are looking to parse? Sounds like you already have large sequence data files in FASTA, etc., format, so I'm not exactly sure what you are looking for.
Edited with links to some sample files of what I'm looking for. I just want the same files but with many more samples.