sequence retrieval in R using fasta file
1
0
Entering edit mode
7.1 years ago
rsabrina93 ▴ 10

I have a Hg19.fasta file and I have a csv file which contains the genomic coordinates of all 23 chromosomes which the sequence is to be retrieved. How can I do it using R?

R sequence retrieval • 3.0k views
ADD COMMENT
1
Entering edit mode

Not what you ask for, but bedtools getfasta would be the way to do this if you wouldn't constrain to R.

ADD REPLY
0
Entering edit mode
7.1 years ago

You could use the R package seqinr to read the FASTA file with read.fasta(), read the csv file with read.csv() then subset the sequence using the coordinates. There may also be something for you in the Bioconductor GenomicRanges package.

ADD COMMENT

Login before adding your answer.

Traffic: 2691 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6