How to search for specific data in gff file
1
0
Entering edit mode
11.1 years ago
Affan ▴ 310

I am currently reading a paper which deals with the transcription factor Sp1 binding sites on Chr 1. In the methods section, they say that the genome wide positions of Sp1 binding sites were obtained from FANTOM 4.

I downloaded this file and added it as a custom track to the UCSC genome browser. It shows me (I think) all the transcription factors and their binding sites. But there are a lot, and I can't seem to find Sp1. They are named as example: "TF_binding_site_cage_181208AR-110574".

My question is, what kind of software would I need to use to obtain Sp1 data only? I'd like to extract Sp1 data only and then put it up as a custom track on the genome browser.

sequencing genome • 2.8k views
ADD COMMENT
0
Entering edit mode
11.1 years ago

Try this command in terminal (should extract all lines with "Sp1" to a new gff file):

grep -i 'sp1' your_GFF_file > sp1.gff

If that doesn't work try grepping for other names for your transcription factor.

ADD COMMENT

Login before adding your answer.

Traffic: 2507 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6