co-assemble Merged reads using Megahit
1
0
Entering edit mode
5.7 years ago
NAVID.BAVI • 0

Hi everyone,

I have 13 RNA seq datasets (demultiplexed fastq files) and I am trying to co-assemble them into contigs that are at least 1000 kb long and eventually perform a differential transcriptome analysis.

For start, I did a quality check with "illumina-utils" package and merged the reads with zero mismatch. Now I need to co-assemble them and obtain Contig files, fro which I've been advised to use Megahit.

Based on the Megahit tutorials, the input for co-assembly is supposed to be R1.fastq and R2.fastq, i.e. paired-end fastq files from different samples. However, what I have now (after the quality filtering and merging the overlapping pairs) is a bunch of *MERGED files for each sample.

I was wondering if Megahit can still be used for co-assembling already-merged reads or does it have to be pair-end files.

I'm quite new and crawling in the complex world of bioinformatics so any help/advice would be much appreciated.And please let me know if my question is not clear enough.

Cheers, Navid

RNA-Seq Assembly sequence Megahit Merged-reads • 5.6k views
ADD COMMENT
0
Entering edit mode

I am not familiar with Megahit but perhaps you just need to use your original reads before they were merged. Megahit may do the merging itself as a part of its workflow. Have you looked at the manual/tutorial for Megahit?

ADD REPLY
0
Entering edit mode

Thank you for replying. I do need to exclude the reads with mismatch before using Megahit. h.mon comment actually worked.

Cheers Navid

ADD REPLY
1
Entering edit mode
5.7 years ago
h.mon 35k

If all you fastq are now merged, i.e., they are now single-end, use the -r parameter. You can combine -1, -2 and -r if you have a mix of merged and unmerged reads.

  Input options that can be specified for multiple times (supporting plain text and gz/bz2 extensions)
    -1                       <pe1>          comma-separated list of fasta/q paired-end #1 files, paired with files in <pe2>
    -2                       <pe2>          comma-separated list of fasta/q paired-end #2 files, paired with files in <pe1>
    --12                     <pe12>         comma-separated list of interleaved fasta/q paired-end files
    -r/--read                <se>           comma-separated list of fasta/q single-end files
ADD COMMENT
0
Entering edit mode

Thanks a lot, it worked! I actually missed this point from the manual. My very best, Navid

ADD REPLY
1
Entering edit mode

If an answer was helpful, you should upvote it; if the answer resolved your question, you should mark it as accepted. You can accept more than one if they work.
Upvote|Bookmark|Accept

ADD REPLY

Login before adding your answer.

Traffic: 2720 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6