hello
i am looking to extract the 3'utr sequences of fathead minnow fish using the annotation file and the genome. http://www.ncbi.nlm.nih.gov/genome/?term=fathead%20minnow
can someone help me or advice me on the method to do it?
hello
i am looking to extract the 3'utr sequences of fathead minnow fish using the annotation file and the genome. http://www.ncbi.nlm.nih.gov/genome/?term=fathead%20minnow
can someone help me or advice me on the method to do it?
If the UTRs are unannotated, you might have luck with the GenomicFeatures package in R. UTRs would be the difference between exon and CDS entries per transcript (there are some functions that enable doing that, read through the vignette). If you're really luck then the txdb object that you create will already do this, but I've never tried.
The you can either write those to a file and use bedtools getfasta or use an equivalent method in R.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
If UTR features are annotated in your annotation file, you can use
bedtools getfastautility to extract the sequences. You must provide the genome file and a BED/GFF/VCF file of ranges to extract (3'UTR in your case).If UTR are not annotated yet, you'd need to annotate them using some ORF prediction tool ...