Question: Beagle imputation run time
1
gravatar for joshuaehill
4 months ago by
joshuaehill10
joshuaehill10 wrote:

Hello, I am running beagle5.1 and am wondering about runtime and input limits. I have successfully run a vcf with 24 samples and 1M markers and it took 1m20s. I am now running a vcf of 507 samples and 250k markers. Beagle has been running for 24 hours now. There is no error output generated yet. No out of memory yet. I have it running separately on 2 different machines each is 48 core with either 250G or 500G ram.

I am only running 1 chromosome through at this time.

the command I am running:

java -Xmx200g -jar ~/.conda/envs/beagle/share/beagle-5.1_24Aug19.3e8-0/beagle.jar gt=/data1/users/joshua.hill/19255Swo-total-gvcf-joint/total-gvcf-joint-only-ScWZqcR_8017_12737-mmiss05-maf005.recode.vcf out=/staging/total-gvcf-joint-only-ScWZqcR_8017_12737-mmiss05-maf005-imputation

On the 500G machine I have allocated 450g to it (java -Xmx450g -jar). I can see that all the cores are being used and something is happening but with no output it is hard to tell. From hat I can tell looking at the process, it is taking all the memory being allocated. It is just stuck at the "Window 1 , Study Markers" stage. Not even going through iterations yet.

Thanks josh

beagle snp imputation • 211 views
ADD COMMENTlink modified 4 months ago by Bioinformatics_NewComer320 • written 4 months ago by joshuaehill10

Hello,

Adding to my question. It seems on the machine that I allocated 200g of memory has now errored out with the "OutofMemory" error. So now I know that for 507 samples and 250k markers 200g of memory is not enough. I am hoping that the other machine with 450g of memory allocated will work.

ADD REPLYlink written 4 months ago by joshuaehill10
0
gravatar for Bioinformatics_NewComer
4 months ago by
Genomic Island
Bioinformatics_NewComer320 wrote:

Are you doing this per chromosome? Also, are you doing the imputation chromosome-wide? I think it is really hard computationally do run a chromosome wide imputation, rather it is suggested to perform imputation in chunks (3-5-7Mb however works).

ADD COMMENTlink written 4 months ago by Bioinformatics_NewComer320
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1706 users visited in the last hour
_