Phastcons Files
1
3
Entering edit mode
13.7 years ago

I'm currently looking for the 'probability/score' of a base to be conserved threw the evolution. So I looked at the phastCons files at the UCSC.

I found a set of WIG files here:

But there are also some (SQL) files here:

what's the difference between those files ? Which source should I use ? Would you suggest another source of data ?

Many thanks

Pierre

ucsc conservation • 6.8k views
ADD COMMENT
0
Entering edit mode

Hi ,

I am amso working with PhastCons files. Does the high scores means more conserved than the low score?

thanks

ADD REPLY
6
Entering edit mode
13.7 years ago
Ning-Yi Shao ▴ 390

If I am not wrong, they are different subsets of species that the PhastCon score method used to calculate the score. For the vertebrate.mod, even the coding regions may not be so conserved, while for the primates.mod, most regions--even non-coding regions are high conserved.

The txt.gz files contain information about lod score in bed formation (LOD=phastCons row log odds score). The further information you may check phastCon score program.

The program produces two main types of output. The primary
output, sent to stdout in fixed-step WIG format
(http://genome.ucsc.edu/goldenPath/help/wiggle.html), is a set of
base-by-base conservation scores. The score at each base is equal
to the posterior probability that that base was "generated" by the
conserved state of the phylo-HMM. The scores are reported in the
coordinate frame of a designated reference sequence (see
--refidx), which is by default the first sequence in the
alignment. They can be suppressed with the --no-post-probs
option. The secondary type of output, activated with the
--most-conserved (aka --viterbi) option, is a set of discrete
conserved elements. These elements are output in either BED or GFF
format, also in the coordinate system of the reference sequence
(see --most-conserved). They can be assigned log-odds scores
using the --score option.

ADD COMMENT
0
Entering edit mode

but what is the difference of information between the wig and the txt.gz ?

ADD REPLY
0
Entering edit mode

This problem I am not sure. The txt.gz files contain information about lod score in bed formation. I think those are the intermediate results of phastCon score program, for in case if somebody want to customize his own phastCon score. But I am not sure. Please check this webpage.

ADD REPLY
0
Entering edit mode

This problem I am not sure. The txt.gz files contain information about lod score in bed formation (LOD=phastCons row log odds score). The further information you may check phastCon score program.

ADD REPLY
0
Entering edit mode

This problem I am not sure. The txt.gz files contain information about lod score in bed formation (LOD=phastCons row log odds score). The further information you may check phastCon score program. It looks like the intermediate results of phastcon score program, and also can be used as measure of conservation. Please refer to this webpage.

ADD REPLY

Login before adding your answer.

Traffic: 1817 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6