Are Plink MDS plots supposed to complete so quickly?
0
0
Entering edit mode
8.8 years ago
devenvyas ▴ 740

I ran MDS plots on a data set of >620,000 SNPs and 934 individuals using plink.

I followed the advice of a former lab member as well as this tutorial (https://www.staff.ncl.ac.uk/heather.cordell/pak2010MDS.html - mainly just the stuff after "To perform MDS analysis in PLINK, we first calculate a file").

plink --file dataset --genome --out dataset
plink --file dataset --read-genome dataset --mds-plot $k --out "dataset_"$k

($k is looped in and is the number of MDS dimensions ranging from 1 to 20).

The first line (genome file creation) does take a while, but I have noticed that the actual mds generation takes about 10 minutes per run and uses about half a gigabyte of RAM, which seems suspiciously quick and non-intensive.

Anyone have any experience with this? (i.e., am I doing something wrong, or is this normal)

plink SNP mds • 3.0k views
ADD COMMENT
0
Entering edit mode

Actually, once your data has been converted to binary, genome file creation should take less than two minutes with your dataset with PLINK 1.9 on most systems, and --mds-plot should take less than one.

ADD REPLY
0
Entering edit mode

This was running on Plink 1.07, single threaded. HPC just got Plink 1.9 installed this morning, so I haven't tried it out yet.

ADD REPLY

Login before adding your answer.

Traffic: 1853 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6