Question: Msa Preserve Strandedness
1
gravatar for Lee Katz
8.0 years ago by
Lee Katz2.9k
Atlanta, GA
Lee Katz2.9k wrote:

I am creating a set of alignment output files from an XMFA (one per alignment), but I want to retain strandedness. Is there a format that will retain the strand? I could hack it so that it is on the FASTA defline but I want the strandedness to be preserved when running another BioPerl script.

For example, when I have a fasta defline with the strand

>sequence1 strand|+
AAAAAAAA
>sequence2 strand|-
AAAAAAAA

BioPerl will not retain the strand information. The output file will not tell me that sequence2 has been reverse-complimented. Is there a format that would retain this information?

edit Additionally, is there a format that would retain strandedness through this conversion and then through a refinement by ClustalW?

fasta format bioperl multiple • 1.4k views
ADD COMMENTlink modified 3.1 years ago by Biostar ♦♦ 20 • written 8.0 years ago by Lee Katz2.9k
1
gravatar for Hamish
7.1 years ago by
Hamish3.1k
UK
Hamish3.1k wrote:

Short of encoding the strand as part of the sequence identifier, something that used to be done for gene predictions where the strand would be encoded using a 'c' (Crick strand) or 'w' (Watson strand) suffix on the identifier, I am not aware of any formats that will preserve stand information through the type of processing you describe.

Probably the simplest method would be to write a file containing a mapping table of the sequence identifiers to their strand (and any other annotations you would like to preserve). The after your processing has completed, use this file of meta-data to add back the required information to your preferred output format. This has the advantage of allowing to cope with any other mangling of the data that occurs during processing (for example identifier truncation).

ADD COMMENTlink written 7.1 years ago by Hamish3.1k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1150 users visited in the last hour