I have word file of whole genome sequence around 1709 pages, each gene is separated by ">". I need to blast whole genome sequence against a protein sequence from other organism for homology. Is there anyway to remove this information line ">gm_orf648 67_127_d_D 579383 580123 + 741_nt 246_aa" at once. instead of manually deleting one by one.
Do not do not do not do not do not keep your sequences in Office formats. Ever.
I’m actually amazed it’s even opened that many pages without crashing.
You don't have to delete these.
Thank you for meaningful help, I managed to copy it in oligo 7 thereby no need to remove > lines.