Hi All,
I would like to identify SNPs that help distinguish Japanese and Chinese populations using 1000genomes data. Currently I am a little lost with all the potential programs and software packages to download and was wondering if anyone can direct me to a straight forward pipeline. I need to be able to "rank" in some way the possible set of SNPs and therefore prioritise the level of discrimination power of a SNP in separating two populations.
Thanks in advance!
Thank you very much Kevin,
I was able to download plink and set up a directory fine but when trying to download the VCF.gz files I see this error bash: wget: command not found -bash: wget: command not found -bash: wget: command not found -bash: wget: command not found -bash: wget: command not found -bash: wget: command not found -bash: wget: command not found -bash: wget: command not found
I have tried copy pasting each line of text and writing manually but nothing seems to change the result.
Thanks again,
Andrew
It's possible that
wget
is not installed on your computer. You can trysudo apt install wget
(you will require administrator rights, and will have to provide a password).Otherwise, try
curl -O
instead ofwget
Thanks Kevin you have been an incredible help!
I was able to get wget on my computer and moved a bit further along. I have managed to download the PED file and the reference files but struggling when converting 1000 Genomes files to BCF. my input code is (Volumes/Seagate is the directory on my external hard drive):
or chr in {1..22} X; do
done
but plink returns with :
[main] Unrecognized command.
Just wondering if you knew what the most likely cause was..
Thanks again!
Hey Andrew, are you specifying the
--bcf
command line parameter when trying to read into PLINK?