Obtain contex sequence from a vcf file
1
0
Entering edit mode
7.4 years ago
shinken123 ▴ 150

Hi

I have a vcf file that looks like this:

4   169239903   M1  G   A   .   PASS    .   GT  0/0 1/1 1/1 1/1 1/1 1/1 1/1 1/1 ./. 1/1 1/1 1/1 1/1 1/1 ./. 1/1 1/0 1/1 1/1 ./.
4   169832919   M2  A   C   .   PASS    .   GT  0/0 ./. 1/1 1/1 ./. 1/1 1/1 1/1 1/1 1/1 1/1 1/1 1/1 ./. 1/1 ./. ./. 1/1 1/1 ./.
4   169989920   M3  C   T   .   PASS    .   GT  0/0 ./. ./. 1/1 1/1 1/1 1/1 1/1 1/1 1/1 1/1 1/1 1/1 1/1 ./. 1/1 ./. 1/1 ./. ./.

Now I will like to obtain a contex sequence with around 15 nucleotides upstream the SNP and 15 nucleotides downstream the SNP from the reference genome to have something like this:

>4_169239903
ATCCCGGTTCCCTCTgTTTCCTTTCCTTGCT
>4_169832919
ATCGTAGTTAGCTAAaTTTCGTACTGACTGA
>4_169989920
CGTGCTGCATGATCCcTTGGCAGTACGTACC

Do you known how to obtain it?

Thank you

snp sequence • 1.5k views
ADD COMMENT
2
Entering edit mode
7.4 years ago
ADD COMMENT
0
Entering edit mode

Thank you very much Pierre,

This is exactly what I want.

ADD REPLY

Login before adding your answer.

Traffic: 2917 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6