with the CD-HIT command
cd-hit-est it is possible to form sequence clusters. Per cluster, 1 "representative" sequence is generated, as stated at the CD-HIT website:
... and produces a set of 'non-redundant' (nr) representative sequences as output.
Is such a nr representative sequence the same as a consensus sequence in CD-HIT? I want to use
cd-hit-est to cluster Nanopore amplicon sequence data.
The NCBI website "https://www.ncbi.nlm.nih.gov/mesh?Db=mesh&Cmd=DetailsSearch&Term=%22Consensus+Sequence%22%5BMeSH+Terms%5D" calls a consensus sequence a representative sequence. However, I'd like to know if CD-HIT also defines a representative sequence as a consensus sequence.
Any ideas? Thank you.