Question: How to remove part of FASTA sequences?
0
gravatar for bioinfo
3.5 years ago by
bioinfo650
EU
bioinfo650 wrote:

I have 10 bacterial genomes FASTA sequences (few of them might have chromosome and plasmids together and rest only chromosome) . And I have another FASTA file with 20 different plasmids. My plan is to remove the plasmid part from those 10 bacterial genomes and keep only chromosomal part as FASTA form if any section of bacterial genome match 100% to any of my 20 plasmids list. What are the best ways to do it? 

 

sequence fasta • 1.4k views
ADD COMMENTlink modified 3.5 years ago by Matt Shirley8.0k • written 3.5 years ago by bioinfo650
3
gravatar for Matt Shirley
3.5 years ago by
Matt Shirley8.0k
Cambridge, MA
Matt Shirley8.0k wrote:

I would take a look at the Biostrings R package for this. You can read your FASTA sequences in, then simply use the `maskMotif` method to mask your plasmid sequences and then you can use `injectHardMask` to replace your masked sequences with a letter of your choice. This way you can still identify where there were sequences, and the total length of your chromosome doesn't change.

ADD COMMENTlink written 3.5 years ago by Matt Shirley8.0k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 776 users visited in the last hour