Question: Copy number (repeat count) of assembled contigs
1
gravatar for misaghb
5.9 years ago by
misaghb20
United States
misaghb20 wrote:

I have a set of assembled contigs (genomic sequences). These contigs were assembled using a de novo short read assembler (e.g. Velvet, ABySS, etc). I map/align a new short read library (e.g. Illumina paired end or mate pair library) against these contigs using a short read mapper like BOWTIE2. So now I have a SAM/BAM alignment file at hand.

Is there any software (or a pipeline using different tools) to take this alignment SAM/BAM file as input and outputs which contigs are repeated in the genome and estimate those integer repeat counts (copy numbers) using coverage information in the alignment input file? 

Notes:

1. No reference genome available.

2. Magnolya is a software that can be used for this purpose although it's not specifically developed for this problem. But it's limited to ace format and some depreciated scripts from tools like ABySS and Newbler.

ADD COMMENTlink modified 5.9 years ago by apelin20480 • written 5.9 years ago by misaghb20

Hi, this is what I want too. I wonder if you get a solution!!

ADD REPLYlink written 7 days ago by guangzep0
1
gravatar for apelin20
5.9 years ago by
apelin20480
Canada
apelin20480 wrote:

You can use https://github.com/lindenb/jvarkit/wiki/BamStats04 to caluculate coverage for regions specified in BED file, these regions can be your entire contigs. Go go Pierre.

ADD COMMENTlink written 5.9 years ago by apelin20480
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1931 users visited in the last hour
_