running MISO with ENSEMBL GTF if reads were aligned against UCSC genome
2
0
Entering edit mode
7.0 years ago

I aligned reads against UCSC genome using STAR, where UCSC gtf was used as reference annotation. As a result, I get the chromosomes in the format "chr1", .... When using pe-utils function in MISO, it is identifying 0 read pairs. I get this error when I tried using both Homo_sapiens.GRCh37.65.gff and hg19_ensGene.gff3 annotation files and also tried the --no-bam-filter option of the pe-utils function. I notice that my bam files also have "chrM". Will removing reads from "chrM" help?

RNA-Seq MISO • 2.0k views
ADD COMMENT
1
Entering edit mode
7.0 years ago

Why not just fix the chromosome names?

  1. samtools view -H foo.bam > header
  2. Edit the header (mostly, remove "chr" and convert "chrM" -> "MT")
  3. samtools reheader
ADD COMMENT
0
Entering edit mode

Devon, worked now. Thanks for the suggestion.

ADD REPLY
0
Entering edit mode
7.0 years ago
Emily 23k

You can use the Ensembl File Chameleon to export an Ensembl annotation file with "chr" style names.

ADD COMMENT
0
Entering edit mode

That's an interesting tool. Does it have the name conversions stored somewhere or does it just tack on chr everywhere?

ADD REPLY

Login before adding your answer.

Traffic: 3211 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6