Trying to understand knownGene annotation file from UCSC
1
1
Entering edit mode
9.4 years ago

I have Annovar output that gives me a position for a variant from the start of a transcript CDS. The rna sequences in "knownGeneMrna" include the UTRs. I am trying to find the start of the CDS (exclude the first UTR) by using "knownGene.txt" but I don't know what the columns are. I thought it was the case that column 4 was the start of the transcript and column 6 was the end of the UTR but this does not make sense for certain transcripts (such as uc001abv.1) where the UTR seems to be longer than the transcript itself?

I wonder if somebody could help me with a description of or a link to a description of "knownGene"?

Thanks,
Jeremy

SNP RNA-Seq • 2.2k views
ADD COMMENT
0
Entering edit mode
9.4 years ago

Found file schemas here

ADD COMMENT

Login before adding your answer.

Traffic: 2644 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6