Question

Off topic:knowing the exact position where each read is mapped

0

Entering edit mode

9.8 years ago

zizigolu ★ 4.4k

Hi biostars,

By tophat2 (tophat2 -p 8 -G genes.gtf genome file.fastq) command (gtf I think is the annotation file and genome is the whole genome fasta), I produced a file named accepted_hits.bam which using

bam2bed < accepted_hits.bam | bedmap --echo --count genes.bed - > answer4.bed

command, by bedops tool or bedtools multicov -bams yourBAMfile.bam -bed yourGenes.bed command by bedtools, I have a bed file now, like below

I    334    646    "YAL069W    .    +    protein_coding    CDS    0    exon_number "1"; gene_id "YAL069W"; gene_name "YAL069W"; p_id "P3633"; protein_id "YAL069W"; transcript_id "YAL069W"; transcript_name "YAL069W"; tss_id "TSS1128";    2**

For example 2 reads have been mapped on gene YAL069W in 334-646 distance

but how I can know where each of these reads individually have been mapped? I mean how I can have a file in which I could see where each read individually has been mapped. For example these two reads in protein_coding (CDS) part, where have been mapped individually not just knowing that in 334 - 646, two reads have been mapped on gene YAL069W (not a distance in where a number of reads have been mapped).

Please if you have any idea, let me know

Thank you so much

RNA-Seq tophat2 ribo-seq • 1.6k views

ADD COMMENT • link updated 2.8 years ago by Ram 45k • written 9.8 years ago by zizigolu ★ 4.4k