Do I need to search for the complementary sequence when using regex?
1
0
Entering edit mode
11 months ago
jamie.pike ▴ 80

Ok, prepare yourselves, this might be a very stupid question... If I am searching using a regex in an assembled genome, e.g. "ATCG", do I also need to search for the complementary sequence "CGAT" or will the assembly read 5′→3′?

regex • 320 views
ADD COMMENT
1
Entering edit mode

If you mean you are searching in fast sequence of a contig or a consensus sequence, then yes you will need to search for both. Generally if producing an assembly using denovo tools, the program will not know what is the correct 5' to 3' etc... If you are searching in a SAM/BAm of aligned reads against a consensus or reference, then generally you will have reads mapping in both directions.

ADD REPLY
2
Entering edit mode
11 months ago
JC 12k

short answer: Yes

long answer: DNA encodes information in both chains, also you assembly is most likely not oriented as de-novo assemblers just assemble without specific direction

ADD COMMENT

Login before adding your answer.

Traffic: 2506 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6