N50 for de novo assembly result is too short
0
0
Entering edit mode
4.2 years ago
nisrinalulu ▴ 10

Dear all,

I have sequences from soil plantation sample. I already performed the assembly for my sequences using MEGAHIT but the result for N50 is too short, it's only around 500 bp. I have reed some paper that N50 is not only the parameter to say that our assembly is good, but i want to try my best to get good N50 for my data. I also already use another parameter of MEGAHIT to perform the assembly such as --kmin-1pass , --presets meta-large and --min-count 1 but the result seem not really good. Do you have an advice for me to perform MEGAHIT to get a good assembly result? Or do you have other assembler recommendation to perform de novo assembly for shotgun metagenome from soil plantation sample?

Thank you so much for your help.

assembly sequence sequencing • 2.1k views
ADD COMMENT
0
Entering edit mode

Have you performed any QC on your data before assembly?

ADD REPLY
0
Entering edit mode

Yes, I have performed QC for my data using FASTQC and the result for the QC is good. This is one of the result for my QC. Almost all my sequences have the same result as below.

https://photos.app.goo.gl/7Dduc4diXh2EGDoe7

ADD REPLY
0
Entering edit mode

How deep did you sequence? Soil microbiome is probably the most complicated environment. If's anything less than two runs of HiSeq + Mate-pair or HiC the N50 you got is as good as you can get

ADD REPLY
0
Entering edit mode

I run my samples using Illumina HiSeq with 150 paired end. Beside of the platform for run the sequence, how I could know how deep is my sequence?

ADD REPLY
0
Entering edit mode

How many reads did you get? How long?

ADD REPLY
0
Entering edit mode

This is the total sequence that i get from QC:

1. F: 41,452,346 ; R: 41,452,346
2. F: 34,140,203 ; R: 34,140,203
3. F: 48,657,900 ; R: 48,657,900
4. F: 46,637,257 ; R: 46,637,257
ADD REPLY
0
Entering edit mode

OK, You need billions of reads in order to start getting something useful. You can only use it for reference-based analysis

ADD REPLY
0
Entering edit mode

Thank you so much for your reply. I'll looking forward about referece-based analysis. Sorry, do you have a recommendation of paper to perform reference-based anaysis?

ADD REPLY
0
Entering edit mode

Sorry, if I use reference-based analysis, can I get functional annotation from the analysis?

ADD REPLY
0
Entering edit mode

Yes, you sure can. You can start with using MG-RAST , it will do the magic for you.

ADD REPLY
0
Entering edit mode

Thank you for your advice. It helps me a lot.

ADD REPLY
0
Entering edit mode

Good luck with your research, I hope you'll get useful results.

ADD REPLY
0
Entering edit mode

Thank you so much Asaf

ADD REPLY
0
Entering edit mode

Hello, I was wondering if you improve your assembly somehow. I am also dealing with soil metagenomes and small N50

ADD REPLY

Login before adding your answer.

Traffic: 1510 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6