Question

Transcriptome Assembly: Number Of Transcripts Expected?

0

Entering edit mode

11.1 years ago

Rm 8.3k

I just completed a transcriptome assembly (~ 50 Million paired end 100bps read sets) of Xenopus species without reference genome:

I followed velvet + oases multi-kmer approach with min coverage 5 and and merged the transcripts obtained from multiple Kmer (31 to 75 with step of 6) assembles: In the merged set I got around 200K Loci with around 700K total transcripts?

I was just wondering if these numbers in generally expected range? or high

velvet transcript • 2.4k views

ADD COMMENT • link updated 11.1 years ago by Istvan Albert 100k • written 11.1 years ago by Rm 8.3k

score 1 · Answer 1 · 2013-03-15

1

Entering edit mode

11.1 years ago

Istvan Albert 100k

The number of expected transcripts is the number of existing transcripts. You can estimate that from other related genomes because even though the genomes might vary wildly the number of transcripts will probably vary a lot less.

Of course it is unlikely that any method would ever be able to match that correctly and usually you will get a lot more.

ADD COMMENT • link 11.1 years ago by Istvan Albert 100k

0

Entering edit mode

@Istvan : Thank you; i will look over numbers from related species....

ADD REPLY • link 11.1 years ago by Rm 8.3k

0

Entering edit mode

i used the CD-hit-est on the merged assembly at -c 0.9 it reduced the number to ~250K

ADD REPLY • link 11.1 years ago by Rm 8.3k