So I'm using plain python (I'm not using BioPython) to search for a string in E. Coli genome. How I do it is that I read each line of a fasta sequence, and I'll do an if sequence return thing on it, a pseudocode like this:
ecoli_sequence = open('ecolik12.fasta', 'r') a = ecoli_sequence.readlines()[1:] for y in a: if "TATAAA" in y: print ("it's here"+y) else: print("it wasn't in the genome, stupid code albeit") ecoli_sequence.close()
however, there is one big problem. if my sequence is at the interface of each line, it can't recognize it. What do you guys suggest?
Please help, I really will appreciate it.