Question: Reads that map to two different genomes, but with different map coordinates
gravatar for Susmita Mandal
22 months ago by
Susmita Mandal60 wrote:

Hello everybody,

I have samples that I have mapped onto two custom pseudo genomes using STAR, thus creating two SAM/BAM files per sample. A small number of reads may be mapped with numerous or abnormal splice sites, or soft clipped reads mapped to different locations due to the presence of homologous sequence. I want to locate the small number of reads that map to both genomes, but with different map coordinates. For that I used bamUtil diff function to do it. I cannot seem to understand what parameters to give and also the results are very confusing. Has anyone faced similarly with bamUtil? or does anyone know something better than bamUtil for this purpose?

EDIT: I did ran bamUtil diff using ./bam diff --in1 1.bam --in2 2.bam --noCigar --baseQual --onlyDiffs --out output_file


I am getting this empty base Qual in pairs means that the reads that mapped onto diferent locations have identical quality. I want to find out if there any reads with different mapped locations and different base Qual too. Any ways? Any grep/sed/awk or shell script?


ADD COMMENTlink modified 22 months ago • written 22 months ago by Susmita Mandal60

Are the two pseudo-genomes generally the same, with some differences, or are they completely different?

ADD REPLYlink written 22 months ago by h.mon30k

The reference genome is same acting as a template and the SNPs are being incorporated into it from two mouse strains to make two pseudo genomes.

ADD REPLYlink written 22 months ago by Susmita Mandal60
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1423 users visited in the last hour