retrieve flanking sequences from fasta file
1
0
Entering edit mode
7.3 years ago
sgmiriuka ▴ 10

Hi,

I am really new to the bioworld... looking for help in trying to get the flanking sequence from NCBI or Ensemble from a list of microRNA sequences in a fasta file. I have fasta file with the human microRNA sequences, and would like to map it and fetch the upstream and downstream flanks (i.e. about 100 bp). Tried in Biomart, successful with the hairpin sequence, but not with the mature ones. Guess that should go from a script. Great if done in biopython, but any help is welcome. thanks in advance! sgm

sequence biopython • 1.9k views
ADD COMMENT
1
Entering edit mode
7.3 years ago
mforde84 ★ 1.4k

http://bedtools.readthedocs.io/en/latest/content/tools/getfasta.html

You'll need the coordinate intervals (i.e., bed format file - https://genome.ucsc.edu/FAQ/FAQformat#format1). Make sure that the coordinates for the feature are consistent with the assembly you are using. If not then you can use liftOver to convert them (https://genome.ucsc.edu/cgi-bin/hgLiftOver).

ADD COMMENT

Login before adding your answer.

Traffic: 2632 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6