i want to create a fasta file using the mouse (mm10) genome and my first restriction enzyme AAGCTT. The new fasta file will start with AAGCTT and end with that. i already tries to create fragments with 2 existing functions of Basic4Cseq and FourCseq but without success. I want something that will list all of the AAGCTT sites in the genome in the format chr:start-stop. After that i can use this file to extract fasta sequence which will start and end with my restriction enzyme
Any idea ? any script ?