extract first and last n bp from fasta file from multiple fasta file in R
0
0
Entering edit mode
6 months ago
praasu ▴ 40

Hi,

I would like to extract first and last n bp from multiple fasta file in R. Let me know if you guys have any suggestion.

Many thank for your time.

comparative-genomics • 409 views
ADD COMMENT
1
Entering edit mode

Its not an answer but suggestion. Seqkit's subseq command does this. e.g. https://bioinf.shenwei.me/seqkit/tutorial/#play-with-mirna-hairpins

If you really want to do it in R you would want to loop over as you stream the file or else it would hog your memory really bad. Or you can call seqkit with system :).

  1. https://stackoverflow.com/questions/42492351/stream-processing-large-csv-file-in-r

  2. https://stackoverflow.com/questions/12626637/read-a-text-file-in-r-line-by-line

ADD REPLY
0
Entering edit mode

OP asked the exact same question 2 years ago and got the seqkit subseq answer from shenwei himself. Really odd behavior.

ADD REPLY

Login before adding your answer.

Traffic: 2266 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6