My goal is to retrieve the promoter regions of a few hundred genes. I have the primary transcripts of these genes in a FASTA file and the organism's genome in another FASTA file.
How can I align the transcripts to the reference genome and then retrieve 200nt of genomic sequence upstream of the transcript?
Thanks for the help!