Removing headlines in fasta file in python
0
0
Entering edit mode
14 months ago
Maliha • 0

I want to remove the headlines of a fasta file that contains multiple protein sequences. I need to count the amino acid numbers.

>Os12t0641500-03 Similar to RecF/RecN/SMC N terminal domain containing protein, expressed.
MAAAAAGKGGGGQGRIHRLEVENFKSYKGTQTIGPFFDFTAIIGPNGAGKSNLMDAISFV
LIKVPLL*
>Os12t0597800-01 Similar to Helix-loop-helix DNA-binding domain containing protein, expressed.
MMSFPYSSGDLGEATTAAAAAVDMITLDQMFRDYDASTGDDLFELVWESCGGGEIDSGAG
LGRQ*
>Os12t0598600-00 Similar to H0315A08.1 protein.
MKRSMNYSGIECFTFGDDNKLRIFPPNSYKFKPKDHIILDEVQECILDNFWYQYNNKREE
FSDLDTMDLGGHGQPDE*
Bioinformatics • 496 views
ADD COMMENT
0
Entering edit mode

Hello,

it's not clear what you are trying to achieve. Please add an example of your desired output and explain how it is related to the input.

fin swimmer

ADD REPLY
0
Entering edit mode

Check BioPython for that

ADD REPLY

Login before adding your answer.

Traffic: 1304 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6