Question: LAST alignment: different strands for single CDS query?
gravatar for ksamuk
3.8 years ago by
United States
ksamuk0 wrote:

I am aligning CDS sequences I downloaded from Ensembl to some draft assemblies someone has published. I used LAST to align then, and it appears to have worked, see example here: 

What I can't figure out is that for most of the query CDS's ("O_niloticus" lines in the example), some of the transcript aligns to the + strand of the assembly, and others to - strand. This seems to be the case only when a single CDS aligns to multiple different contigs (e.g. L_fuelleborni.EQ177488 and L_fuelleborni.ABPK01036261.1 are different contigs...I think), i.e. all of the CDS alignments within a contig seem to be on the same strand.

So, my question is: why would this be the case? These are fairly closely related species, so I don't think chimerism/rearrangement is plausible (again, its nearly every CDS). Is it possible the orientation of the contigs assembly is not yet determined? If that's the case can I just reverse compliment the - strand alignments and tack them (with spaces) together with the + strand bit? The end application is a FASTA for each query CDS containing all the matching sequences from a number of other species (for PAML). 

Any insight would be greatly appreciated!

blast alignment • 986 views
ADD COMMENTlink written 3.8 years ago by ksamuk0
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1781 users visited in the last hour