Extract 3'utr from annotation file
1
0
Entering edit mode
8.8 years ago
najibveto ▴ 120

hello

i am looking to extract the 3'utr sequences of fathead minnow fish using the annotation file and the genome. http://www.ncbi.nlm.nih.gov/genome/?term=fathead%20minnow

can someone help me or advice me on the method to do it?

genome • 3.4k views
ADD COMMENT
1
Entering edit mode

If UTR features are annotated in your annotation file, you can use bedtools getfasta utility to extract the sequences. You must provide the genome file and a BED/GFF/VCF file of ranges to extract (3'UTR in your case).

If UTR are not annotated yet, you'd need to annotate them using some ORF prediction tool ...

ADD REPLY
1
Entering edit mode
8.8 years ago

If the UTRs are unannotated, you might have luck with the GenomicFeatures package in R. UTRs would be the difference between exon and CDS entries per transcript (there are some functions that enable doing that, read through the vignette). If you're really luck then the txdb object that you create will already do this, but I've never tried.

The you can either write those to a file and use bedtools getfasta or use an equivalent method in R.

ADD COMMENT

Login before adding your answer.

Traffic: 1877 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6