Find the transcripts of genomic coordinates
1
0
Entering edit mode
2.8 years ago

Hello everyone

I have a bed file with genomic coordinates, how can I identify the corresponding transcripts?

Python • 791 views
ADD COMMENT
0
Entering edit mode

Do you mean identify transcripts that overlap with the coordinates in your BED file? It looks like you want to use Python to accomplish this. You could use PyRanges to read in your BED coordinates, and a GTF file of transcripts, and ask for overlaps. Did you want to use some other language? Or command-line tools? What form are your transcripts in?

ADD REPLY
2
Entering edit mode
2.8 years ago
Sukjun Kim ▴ 90

If you mean the way of fetching the sequence of the transcripts from the bed file, how about using getfasta in bedtools? You can use as follows:

$ cat test.fa
>chr1
AAAAAAAACCCCCCCCCCCCCGCTACTGGGGGGGGGGGGGGGGGG

$ cat test.bed
chr1 5 10

$ bedtools getfasta -fi test.fa -bed test.bed
>chr1:5-10
AAACC

(ref: https://bedtools.readthedocs.io/en/latest/content/tools/getfasta.html)

ADD COMMENT

Login before adding your answer.

Traffic: 2512 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6