Entering edit mode
7.3 years ago
Bjoux
•
0
I want to calculate the conservation of a segment of RNA sequence (about 50 nt),. I tried phastCons but failed, because I do not know how to prepare the input file. I want to use the download phastcons score of UCSC conservation score file, but I do not have the location of my seq in the genome. Does anyone know some other methods to calculate conservation score?
Thanks
Bjou
Thanks. I try the approach you mentioned. The procedure is below: 1. dowload the phastCons7way file from UCSC, it contains hg38.phastCons7way.wigFix (17.8GB) which includes the conservation score of each neuleotide. 2.the length of my mrna seq is 1707. the results of mapping this seq to hg38 is: actions query score start end qsize identity chro strand start end span browserdetails YourSeq 1704 1 1707 1707 100.0% 1 - 41847189 42035925 188737 3. use bedmap to map the postion to the phastCons score file. but I do not know how to use bedmap to extract the corresponding score in the phastCons7way.wigFix file.
Convert the WIG-formatted file to BED format with
wig2bed
(also in the BEDOPS kit). If BLAT gives you data in PSL format, you can usepsl2bed
to convert that to BED. Map the regions to the signal, e.g. in the most simple case (which may or may not work well for phastCons signal):The documentation for
bedmap
is probably a good place to start, as it explains the tool and various scenarios for its use to map genomic intervals to score data.