Off topic:knowing the exact position where each read is mapped
1
0
Entering edit mode
8.6 years ago
zizigolu ★ 4.3k

Hi biostars,

By tophat2 (tophat2 -p 8 -G genes.gtf genome file.fastq) command (gtf I think is the annotation file and genome is the whole genome fasta), I produced a file named accepted_hits.bam which using

bam2bed < accepted_hits.bam | bedmap --echo --count genes.bed - > answer4.bed

command, by bedops tool or bedtools multicov -bams yourBAMfile.bam -bed yourGenes.bed command by bedtools, I have a bed file now, like below

I    334    646    "YAL069W    .    +    protein_coding    CDS    0    exon_number "1"; gene_id "YAL069W"; gene_name "YAL069W"; p_id "P3633"; protein_id "YAL069W"; transcript_id "YAL069W"; transcript_name "YAL069W"; tss_id "TSS1128";    2**

For example 2 reads have been mapped on gene YAL069W in 334-646 distance

but how I can know where each of these reads individually have been mapped? I mean how I can have a file in which I could see where each read individually has been mapped. For example these two reads in protein_coding (CDS) part, where have been mapped individually not just knowing that in 334 - 646, two reads have been mapped on gene YAL069W (not a distance in where a number of reads have been mapped).

Please if you have any idea, let me know

Thank you so much

RNA-Seq tophat2 ribo-seq • 1.5k views
ADD COMMENT
This thread is not open. No new answers may be added
Traffic: 1719 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6