Question: Conversion Of Pfb Files From Hg18 To Hg19
1
gravatar for User 0559
7.9 years ago by
User 055910
User 055910 wrote:

I am using pennCNV for copy number to generate genotype clusters, using data from Affy6.0. The function generateaffygeno_cluster.pl takes input affygw6.hg18.pfb (provided by pennCNV package) as reference. However, this pfb file is based on hg18, as indicated by its title.

How would I convert it into a file based on hg19? I found this discussion http://biostar.stackexchange.com/questions/1622/pfb-file-for-cnv-analysis-using-illumina-610-quad-v1-arrays on the forum. However, my problem is different from this one as I am trying to convert between different genome versions. Can anyone give a hint on how to do this? Can I use liftover to do this convertion? And how would that be done? Thank you very much!

conversion hg • 3.9k views
ADD COMMENTlink modified 7.9 years ago by Mark0 • written 7.9 years ago by User 055910

You should not use the "code" tag for your questions.

ADD REPLYlink written 7.9 years ago by Ryan D3.3k

For reference, indenting a line with 4 spaces results in code formatting.

ADD REPLYlink written 7.9 years ago by Neilfws48k
5
gravatar for Ryan D
7.9 years ago by
Ryan D3.3k
USA
Ryan D3.3k wrote:

Take your existing pfb and convert the file to something that can be read by LiftOver. Convert it to hg19. Convert back to pfb.

Convert your pfb like so:

awk '{print "chr"$2"\t"$3"\t"$3+1"\t"$1" "$4}' old.pfb > to_usc.txt

Upload the new text file to LiftOver available here: http://genome.ucsc.edu/cgi-bin/hgLiftOver.

Convert to hg19 and download your file back to your Linux machine. We'll say you saved the file as from_usc.txt.

sed 's/chr//g' from_usc.txt | awk '{print $4"\t"$1"\t"$2"\t"$5}' > new.pfb

So long as you don't have any markers with the letters "chr" in them and your sex chromomsomes, X and Y, aren't named as chr23/chr24, the LiftOver should be smooth. Check your file when you are done with:

cat new.pfb | cut -f1 | sort | uniq

Then you can make sure all chromosomes came across properly. Also, don't forget to check the "failure file" on the UCSC website to see if there were any LiftOver failures. Hope this helps.

Rx

ADD COMMENTlink modified 11 weeks ago by RamRS25k • written 7.9 years ago by Ryan D3.3k

Thank you very much.. This is my first post and I am just starting high resolution genomics recently. Thank you for taking your time to answer this.

ADD REPLYlink written 7.8 years ago by User 055910
0
gravatar for Mark
7.9 years ago by
Mark0
Mark0 wrote:

You should upload a file to the web for this, would be useful, so people don't have to go through these tedious steps in the future!

Best, Mark

ADD COMMENTlink written 7.9 years ago by Mark0

Thanks. This was my first post and I am not very sure about the details of my own question. Thank you for your remark. I will try to do what you suggested any time later on.

Thanks.

ADD REPLYlink written 7.8 years ago by User 055910
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1162 users visited in the last hour