Question: Csplit Multifasta File Into A *Smaller Multifasta File
2
gravatar for neal.platt
7.7 years ago by
neal.platt220
United States
neal.platt220 wrote:

I am trying to split a large multifasta file into several smaller mutlifasta files. I have seen several examples of csplit being used to split multifasta files into single fasta, but for our purposes this is less than ideal.

I have been using the following command:

  csplit -kf fastafile '%^>%' '/^>/' "{10}"

How would I modify this to generate a file with 10 fasta sequences rather than splitting at individual fasta headers (>).

Would awk be more useful?

Thanks for the help in advance.

fasta split • 3.9k views
ADD COMMENTlink written 7.7 years ago by neal.platt220
0
gravatar for Istvan Albert
7.7 years ago by
Istvan Albert ♦♦ 82k
University Park, USA
Istvan Albert ♦♦ 82k wrote:

From your description I am not exactly sure how you want to split your file but there are a number of posts on splitting fasta files, perhaps those help

http://www.biostars.org/post/show/2226/how-to-split-a-multiple-fasta/

and

http://www.biostars.org/post/show/13270/how-to-split-one-big-sequence-file-into-multiple-files-with-less-than-1000-sequences-in-a-single-file/

ADD COMMENTlink written 7.7 years ago by Istvan Albert ♦♦ 82k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1035 users visited in the last hour