LAST alignment: different strands for single CDS query?
0
0
Entering edit mode
8.9 years ago
ksamuk • 0

I am aligning CDS sequences I downloaded from Ensembl to some draft assemblies someone has published. I used LAST to align then, and it appears to have worked, see example here:

What I can't figure out is that for most of the query CDS's ("O_niloticus" lines in the example), some of the transcript aligns to the + strand of the assembly, and others to - strand. This seems to be the case only when a single CDS aligns to multiple different contigs (e.g. L_fuelleborni.EQ177488 and L_fuelleborni.ABPK01036261.1 are different contigs...I think), i.e. all of the CDS alignments within a contig seem to be on the same strand.

So, my question is: why would this be the case? These are fairly closely related species, so I don't think chimerism/rearrangement is plausible (again, its nearly every CDS). Is it possible the orientation of the contigs assembly is not yet determined? If that's the case can I just reverse compliment the - strand alignments and tack them (with spaces) together with the + strand bit? The end application is a FASTA for each query CDS containing all the matching sequences from a number of other species (for PAML).

Any insight would be greatly appreciated!

blast alignment • 1.7k views
ADD COMMENT

Login before adding your answer.

Traffic: 1956 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6