**280**wrote:

I am still new to bioinformatics and I have not yet fully understood the definition of contig. I have read a few explanations and what I understand is that contigs are fragments of the genome for which we are certain that the order of the bases is correct. Then, we make scaffolds out of the contigs and the goal is to get one scaffold to represent the entire genome.

Right now, I am trying to obtain the full reference genome in FASTA format of Streptococcus pneumoniae BR1064. I found this at ENA and in the top right category under "Send Feedback" it appears "Genome Representation: full". From there, one can get over to the assembly contig and there are 245 contigs. Can I just put all this contigs together and obtain the full genome of the organism? If so, is there a particular way to do it? Should it just be in increasing numerical order?

**6.7k**• written 6.0 years ago by John Smith •

**280**