Question: Alignment to fastg assembly graph
0
gravatar for grp2009
12 months ago by
grp200940
grp200940 wrote:

Briefly, I am trying to automate the process of aligning a query sequence (from FASTA) to an assembly graph (FASTG from Spades assembly). As output I need the sequences of the paths in the assembly graph corresponding to the alignment(s).

More detail: I have used Spades to assemble the genome from a diploid yeast starting with short reads (WGS sequencing). Using the wonderful program Bandage I am then able to BLAST a certain query sequence against the assembly graph (FASTG file). Because of the diploid nature of the genome, the result looks like this:

image1-path

There are two paths corresponding to this BLAST alignment. In Bandage I can select the nodes corresponding to a path, and then export that path's sequence to a FASTA file.

Doing this manually gives me exactly what I want (essentially, haplotypes derived from the assembled genome). However, I would very much like to automate this process. What tools should I be looking into?

alignment assembly • 658 views
ADD COMMENTlink modified 12 months ago • written 12 months ago by grp200940
2
gravatar for grp2009
12 months ago by
grp200940
grp200940 wrote:

I didn't realize it when I posted the question, but Bandage has a command line mode, including a command querypaths that accomplishes precisely the task in question.

First, find out where the Bandage executable lives. On a Mac, you select the Bandage application, do "Show Package Contents", and the executable is in Contents/MacOS/Bandage. I will just call this executable Bandage.

Then you can run

Bandage assembly_graph.fastq query_sequence.fasta output_prefix

and it will produce output_prefix.tsv with the exactly the paths desired! What a wonderful program. Note that it works with both fastg and gfa formats as input.

ADD COMMENTlink modified 12 months ago • written 12 months ago by grp200940

Thank you for providing an answer/closure for the question. It will help someone in future.

ADD REPLYlink written 12 months ago by genomax83k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2370 users visited in the last hour