Hello, I am new here:
I have two sequences
>NM_020536.5 Homo sapiens lysine acetyltransferase 14 (KAT14), transcript variant 1, mRNA
GAGCCTGGGCAGTACAGGCGGCGGTGCGCACTCTGCGGCGGCCTCTGCGCCTCGGGCGGGCGGGAGAGAG
AGGCCGCGGCCGCCAGCGTGGGGATGTCTAGGAGCTCGAAGGTGGTGCTGGGCCTCTCGGTGCTGCTGAC (..)
>XP_024305840.1 transmembrane protein 62 isoform X5 [Homo sapiens]
MAAVLALRVVAGLAAAALVAMLLEHYGLAGQPSPLPRPAPPRRPHPAPGPGDSNIFWGLQISDIHLSRFR
DPGRAVDLEKFCSETIDIIQPALVLATGDLTDAKTKEQLGSRQHEVEWQTYQGILKKTRVMEKTKWLDIK
GNHDAFNIPSLDSIKNYYRKYSAVRRDGSFHYVHSTPFGNYSFICVDATVNPGPKRPYNFFGILDKKKME (..)
I wanted to remove the newline and used this command: line = line.rstrip("\n") --> does not work
Can anyone help?
There are many tools to accomplish that in *nix environments, but as I can see you are trying to use python, also you might also need to catch whitespace, discard empty lines and etc.
yes, I try to use Python.
please explain
shorten the Header of this file at first
Sounds you want to linearize the fasta files (code by @Pierre) :