Hi
I have about 284 gene sequence in bacteria_A, and a whole genome sequence in bacteria_B, and bacteria_A and bacteria_B are highly homologous, they are different because of the gene arrangement.
I want to find the position of this 284 sequence in bacteria strainB.
Therefore, I use bowtie2 to align those 284 gene sequence to the complete sequence of bacteria_B by bowties:
./bowtie2-build Bacteria_B_referece.txt index_B
./bowtie2 -f -x index_B -U 284_gene_sequence.txt > output.txt
However, I got this result:
284 (100.00%) were unpaired; of these:
0 (0.00%) aligned 0 times
271 (95.42%) aligned exactly 1 time
13 (4.58%) aligned >1 times
I also tries this command but it didn't help me
./bowtie2 --local -f -x index_B -U 284_gene_sequence.txt > output.txt
The gene sequence are about 1000 to 3000 bp, and the complete genome of bacteria_B is 3.5Mbp.
Does anybody know how to fix it?
Ho-Wen Yang
Hi
Thank you for your reply, I use blast to find the location.
And I have 284 set like this:
How do I do to retrieve the position number from Bacteria_B genome (ex: beginning(2570132) and end(2569671))? I have 284 genes so I couldn't copy and past all the number I need.
Here are different ways to extract sequence you need from genome_B:
how to quickly extract sequence from genome positions
Sequence extract from multifasta using blastall coordinates
I use blast_formatter to retrieve the data I need. Thank you!