I have a metagenomics dataset and I am interested in retrieving full or nearly complete gene sequences.
I first tried assembling the data using Ray Meta to assemble the data. The average length of my contigs is between 700 and 900 bp long.
How can I further resolve these contigs?
My first thought was to use a tool like GATTACA to bin the contigs into groups of sequences that might denote species and then try and run an assembly program just on those groups.
Any feedback or suggestions for different approaches are appreciated!