Aligning Ion torrent sequenced genome with a reference sequence
0
0
Entering edit mode
6.0 years ago
Paul ▴ 80

I am trying to align a bacterial genome with a reference genome using bowtie2 in ubuntu OS.

The bacterial genome is sequenced using Ion torrent. The paired-end sequenced data of the bacterial genome is in fastq format in a single file (bacterial_genome.fastq).

And the commands I used to align the sequence with the reference genome (genome.fasta) using bowtie2 are

bowtie2 -x genome_index -U bacterial_genome.fastq

bowtie2 -x genome_index --interleaved bacterial_genome.fastq

But none of the commands give me a proper alignment output in bam format. Please suggest a way for the alignment of ion torrent data using bowtie2.

Update:

When I used bowtie2 to build the genome_index, it build the genome_index with the following mentioned line

Total time for backward call to driver() for mirror index: 00:00:02

When I used bowtie2 -x genome_index --interleaved bacterial_genome.fastq

It give this error:

Command: /usr/bin/bowtie2-align-s --wrapper basic-0 -x genome_index --interleaved bacterial_genome.fastq 
(ERR): bowtie2-align exited with value 1

When I used bowtie2 -x genome_index -U bacterial_genome.fastq It generates a bam file and when I process the bam file to sort it using samtools, it shows the following error

[bam_header_read] EOF marker is absent. The input is probably truncated.
[bam_header_read] invalid BAM binary header (this is not a BAM file).
Segmentation fault (core dumped)

The command I used for samtools is samtools sort bacterial_genome.bam sorted_bacterial_genome

NGS SNP Bowtie sequencing • 3.0k views
ADD COMMENT
1
Entering edit mode

I'm just thinking about it and your update helps me.

You used bowtie2 -x genome_index -U bacterial_genome.fastq

The default output of bowtie2 is not a .BAM file it's a .SAM file

Try to samtools view -bS bacterial_genome.sam > bacterial_genome.bam

Then samtools sort bacterial_genome.bam sorted_bacterial_genome

ADD REPLY
0
Entering edit mode

It's not very commun to use Ion Torrent technology to generate paired-end data, did you do a mate-pair sequencing ?

Could you share the command you used to generate the index please ?

ADD REPLY
0
Entering edit mode

To generate index i used, bowtie2-build genome.fasta genome_index. Yes, I think its a mate pair sequencing data.

ADD REPLY
1
Entering edit mode

What was the output of the bowtie2-build genome.fasta genome_index ? Did it end successfully ?

What was the output (in your terminal) of your command bowtie2 -x genome_index --interleaved bacterial_genome.fastq ?

What do you mean by :

none of the commands give me a proper alignment output

You mean that you don't have a bam file or the bam file seems weird to you ?

ADD REPLY
0
Entering edit mode

@Bastein, I have updated the post now, to make it more clear, Do i need specific ion torrent mapper to map this genome?

ADD REPLY
0
Entering edit mode

You didn't specify the output :

[-S <sam>]

ADD REPLY
0
Entering edit mode

Can you post output of

head -12 your_sequence_file?

ADD REPLY
0
Entering edit mode

This is bacterial_genome.fastq

@3NPEK:00664:05761
GGACACCATGTCCTCCACGCCGTCCGCCGACGAAGGCTCGCCCAGCAACCTGATGCTGCGCGGCCGGCTCATCGACGCGTTGCGCTCCGACGTCATCCGCGGCCGGTACCCACCCGGGTACCGGCTGGTGGAGCGGGATGTGGCTGGACGCTTACGGGGTTCACGGCTCCCGGCACGCGAGGCCCTGCAGGCGCTGAAGACCGAGGGCTTTCTCGAGGTGCGCAAAACCCGTGGCCTGGTGGTCCGCGGTTGGACCGAGCGGGACGTCACCGAGCTTTTCGACATCCGGCAGGCCCTGGAGGCCATGGCATGCCGCGAGGCCGCCGAGAACCGGACCGATGAGG
+
C?BA;@?@CCCC@CC@D?A@>BAB@BG@DDC@?>@?BBCCEE>B@@@@A?BBBBCBBA@A@C>B>C?AAAAAAAEBBBA@=????>C?DCABBCA<;)000*05;;@ABB=BEE>CB=???@C?BAA>AB?AAAAA:?@@:0*000*0==8;596666&/(/8/3,3333,34:<888937/8>,89@9706<<ABB<>8=4888=7>CA9>;7737279999998+77,56>;>;>D899<;;:BBB?C?B?9978<???A;@997?=?<@AAAAAA6@@<>>CC?B@>>?>BB;@?<89198<;=8;;=<<8=9933.328;7====7B=@?=>;>C989@:
@3NPEK:00664:05769
CCACCGCACTGCGCGCCGAGGTCCGCGCCTTTCTCGATGAGCAGCGCGCCGCGGGTACGTTCACCCCCCGCTGTGGACGCGTGGCTGTGCGGATGGGACGAGAAGTTCACCACCGCGCTGGCCGCCCGCGGCTGGCTGGGTATGACCGTCCCGAAAGAATATGGCGGCCAGGGCCGTTTCGTTTTTTGCATCGATTCGTGGTCACCGAGGAACTTTTTGGCCGCGGGGGTGCCCGTGGCCG
+
B@@@455==;:;AB<;89AC?@A?@CBC5;;/;?@@AA@??@999=@9:399>>)7/://)//::;;;&78333959:??AAB<>>A?A??<?ABB=BBBB>=:=B7;;@;>>8BA;;;B@B8;;B8;99;?@@@CCCC=CCB?==000>>8<>=900+9@::8:B?BACCC=@;999+9<=@ADD(99@A>>>?<?>@@?<>>@,0/:59)77<==(8/75;77777'377;3<??;@=@
@3NPEK:00664:05791
CAGGTGGTGTCCGATAAGGGCGAGCGGTTGTTCTACGATTACGACTTCGGCGACGGATGGGACCACGTGCTCGTGGTCGAAGACGTTTTCGATGATCCGCCCCCGGCTGCGGTGTGTCTGACGGGAAAGATGGCCTGTCCGCCGGAGGACTGTGGTGGCCTGGGCGGCTATGAGGAGTTGGCTGCGTGGGTTCGCGGCGGGTACGACCCGCGGGAAACGCCGATGGGACTCGGTGCGCAGGAGATGAGGGACTGGCTGCCCCGGGGCTGGCACCCCGACCGTTTCTCGGTGGCCGAGACCAATGACGCTCTGGCCGCGTTGAACACGCGTTGAGGATTCTTCA
+
CBB@?@=@CBB<AAAA=AC=CC@@AC@B@CCADE9::CC>AABGBB@BB?B;;;C?B@AA<BA>AAAA@AABBBC@CCBB?BDCBAAA6@AAABBBC?BCCCC3C?CBBBD?AAABBCCABBCD>BB>BCBB>B>BBBB?BB<@89<6<<@???;??;B?BCE>BC?CBCBCCB?AAC?B>BBBBBBBB<?;>>//00@@8@AAABBB=B99929@8@@B?BBBBB=?BBAA=AAA@@@>8<<>558??9?C@@<@B????3>@@3>>@8=99991<AA;>?D:A???;??:?:>>CAC<A=;;;======@:>588BF=BC<AA>AAAA8>>>9>>77729<
ADD REPLY
0
Entering edit mode

That looks like normal ion sequence to me. Are you sure it is paired-end? There is only one file, correct?

ADD REPLY
0
Entering edit mode

Yes, but I am not sure if the data is paired end. But if its a normal ion torrent sequence, how do i use bowtie2 to map it with my reference genome?

ADD REPLY
1
Entering edit mode

I would just treat it as single end if you are not sure.

Instead of bowtie2 you could try minimap2. This data is bound to have indels etc and you should use an aligner that can tolerate errors.

ADD REPLY
0
Entering edit mode

Thanks this is helpful...

ADD REPLY

Login before adding your answer.

Traffic: 1468 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6