Retrieve a % coverage for each transcript
0
0
Entering edit mode
13 days ago

Hi,

I have performed a standard RNAseq workflow using from paired-end reads (main steps summarised below)

extract_splice_sites.py Genome_Annos.gff3 >Genome.ss
extract_exons.py Genome_Annos.gff3 >Genome.exon
hisat2-build -f -p 4  --ss Genome.ss  --exon Srat_Geno.exon Genome.fa Genome_Index
hisat2 -p 4 -q -S ${sample}_aligned.sam -x ./Genomes/Genome -1 $forward_read -2 $reverse_read
samtools sort -@ 4 -o aligned.bam aligned.sam
stringtie -e -B -G ./Genomes/Genome_Annos.gff3 -o ./Count_gtfs/Count.gtf -p 4 -A ./Count_gtfs/Counts aligned.bam
../prepDE.py -i ./sample_list.txt -g ./Gene_count_matrix.csv -t ./Transcript_count_matrix.csv -l 50

My question is how would I be able to determine a % of each transcript that is completely covered by reads? We are surprised by the presence of any mRNAs in this sample, so we would ideally like a % value to indicate which of these detected mRNAs are fully intact within the sample, and which are only present as fragments.

Thanks

RNA-seq • 235 views
ADD COMMENT

Login before adding your answer.

Traffic: 2473 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6