Reverse Complement of each coding regions
0
0
Entering edit mode
3.8 years ago
the_cowa ▴ 40

I have a fasta file which contains coding sequences as like this

>Seq1
ATGAATGACAAAATTAA
GTTATATTTAAA 
TCCTTAATAACTTACCAGAAGAGACTAATGA
CATCTCATGCCATGAAG

I need to find Reverse Complement of each line and concatenate as like this

>Seq1
TTAATTTTGTCATTCATTTTAAATATAACTCATTAGTCTCTTCTGGTAAGTTATTAAGGACTTCATGGCATGAGATG

Normally "cat fasta |tr ACGTacgt TGCAtgca | rev" will Reverse Complement but here I need to find for each CDS

sequence fasta Reverse Complement • 647 views
ADD COMMENT
1
Entering edit mode

http://emboss.sourceforge.net/apps/cvs/emboss/apps/transeq.html

transeq reads one or more nucleotide sequences and writes the corresponding protein sequence translations to file. It can translate in any of the 3 forward or three reverse sense frames, or in all three forward or reverse frames, or in all six frames.

ADD REPLY
0
Entering edit mode

How is this relevant for the original question @Pierre? OP is asking for reverse-complementing individual chunks of sequence on each line and then catting them together? It seems to be an odd/very specific request.

ADD REPLY
0
Entering edit mode

Complement but here I need to find for each CDS

I understand the question as :" I want to find all the ORFs in the reverse complement."

ADD REPLY
0
Entering edit mode

I see. You included that as final solution assuming this is done first.

I need to find Reverse Complement of each line and concatenate as like this

ADD REPLY

Login before adding your answer.

Traffic: 2640 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6