Reverse Complement of each coding regions

0

Entering edit mode

3.8 years ago

the_cowa ▴ 40

I have a fasta file which contains coding sequences as like this

>Seq1
ATGAATGACAAAATTAA
GTTATATTTAAA 
TCCTTAATAACTTACCAGAAGAGACTAATGA
CATCTCATGCCATGAAG

I need to find Reverse Complement of each line and concatenate as like this

>Seq1
TTAATTTTGTCATTCATTTTAAATATAACTCATTAGTCTCTTCTGGTAAGTTATTAAGGACTTCATGGCATGAGATG

Normally "cat fasta |tr ACGTacgt TGCAtgca | rev" will Reverse Complement but here I need to find for each CDS

sequence fasta Reverse Complement • 647 views

ADD COMMENT • link 3.8 years ago by the_cowa ▴ 40

1

Entering edit mode

http://emboss.sourceforge.net/apps/cvs/emboss/apps/transeq.html

transeq reads one or more nucleotide sequences and writes the corresponding protein sequence translations to file. It can translate in any of the 3 forward or three reverse sense frames, or in all three forward or reverse frames, or in all six frames.

ADD REPLY • link 3.8 years ago by Pierre Lindenbaum 161k

0

Entering edit mode

How is this relevant for the original question @Pierre? OP is asking for reverse-complementing individual chunks of sequence on each line and then catting them together? It seems to be an odd/very specific request.

ADD REPLY • link 3.8 years ago by GenoMax 141k

0

Entering edit mode

Complement but here I need to find for each CDS

I understand the question as :" I want to find all the ORFs in the reverse complement."

ADD REPLY • link 3.8 years ago by Pierre Lindenbaum 161k

0

Entering edit mode

I see. You included that as final solution assuming this is done first.

I need to find Reverse Complement of each line and concatenate as like this

ADD REPLY • link 3.8 years ago by GenoMax 141k

Login before adding your answer.