Question: Reads that map to two different genomes, but with different map coordinates
0
gravatar for Susmita Mandal
6 months ago by
Bangalore
Susmita Mandal30 wrote:

Hello everybody,

I have samples that I have mapped onto two custom pseudo genomes using STAR, thus creating two SAM/BAM files per sample. A small number of reads may be mapped with numerous or abnormal splice sites, or soft clipped reads mapped to different locations due to the presence of homologous sequence. I want to locate the small number of reads that map to both genomes, but with different map coordinates. For that I used bamUtil diff function to do it. I cannot seem to understand what parameters to give and also the results are very confusing. Has anyone faced similarly with bamUtil? or does anyone know something better than bamUtil for this purpose?

EDIT: I did ran bamUtil diff using ./bam diff --in1 1.bam --in2 2.bam --noCigar --baseQual --onlyDiffs --out output_file

bamUtil_diff_results

I am getting this empty base Qual in pairs means that the reads that mapped onto diferent locations have identical quality. I want to find out if there any reads with different mapped locations and different base Qual too. Any ways? Any grep/sed/awk or shell script?

Susmita

ADD COMMENTlink modified 6 months ago • written 6 months ago by Susmita Mandal30

Are the two pseudo-genomes generally the same, with some differences, or are they completely different?

ADD REPLYlink written 6 months ago by h.mon24k

The reference genome is same acting as a template and the SNPs are being incorporated into it from two mouse strains to make two pseudo genomes.

ADD REPLYlink written 6 months ago by Susmita Mandal30
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2033 users visited in the last hour