Question: How to view and analyze BAM files from SOLiD Bioscope alignment
0
gravatar for Aura
4.2 years ago by
Aura0
Aura0 wrote:

Hello. I have some .bam, bam.bai and .sam files corresponding to a Bioscope mapping of genome reads from SOLiD to a reference genome. I have had several issues even openig the files. As far as I a go I was able to open the alignment in Samtools, but not in IGV. I have to say that I am new at this so probably I am doing this wrong. When I try to open the .bam file in the graphic interface by clicking Tools>Run igvtools...>Count, etc, I got a .tdf null file and in the terminal I got these warnings:

"Chromosome: gi|448814763|ref|NC_000962.3| not found in .genome file.  Skipping."

"No features were found that matched chromosomes in genome..."

What is happening here? How else can I visualize this alignment?

Thaks for the help.

 

bioscope mapping solid genome • 1.5k views
ADD COMMENTlink written 4.2 years ago by Aura0

most likely the chromosome names do not match.

ADD REPLYlink written 4.2 years ago by Istvan Albert ♦♦ 80k
2
gravatar for Martombo
4.2 years ago by
Martombo2.4k
Seville, ES
Martombo2.4k wrote:
apparently the "chromosome" to which at least some of those reads were aligned to is called gi|448814763|ref|NC_000962.3|. if you only google NC_000962.3 you'll see that it is Mycobacterium tuberculosis H37Rv, complete genome. what you need to do is to get the fasta sequence of this genome (from ncbi). then create an index of it and use it as your reference.
ADD COMMENTlink written 4.2 years ago by Martombo2.4k

Thanks so much. That is right. The H37Rv strain is my reference genome. So, I created the index form the fasta file with samtools:

samtools faidx H37Rv.fasta

And then used the H37Rv.fasta.fai to create a genome file in IGV. Yet when agregate the H37Rv.genome file as reference I dont see the grafic representation of it in IGV. Am I doing it right?

ADD REPLYlink written 4.2 years ago by Aura0

did you check that the header of the fasta sequence is exactly gi|448814763|ref|NC_000962.3| ?

ADD REPLYlink written 4.2 years ago by Martombo2.4k

Yes I have, and It is. Also, I would like to get the annotation file for this genome (gff). It is a way to generate it from any of the ncbi formats?

ADD REPLYlink written 4.2 years ago by Aura0

mmmhh... maybe something went wrong while producing the .genome files? I would simply load the indexed fasta sequence into IGV, with Genomes -> load genome from file. If that doesn't work I'm all out of ideas...

ADD REPLYlink written 4.2 years ago by Martombo2.4k

That worked! Thanks.

ADD REPLYlink written 4.2 years ago by Aura0
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2150 users visited in the last hour