Question: NGSadmix program returns -nan as a result for the admixture proportions of each individual
gravatar for bekahlee54
3 months ago by
bekahlee540 wrote:

I have been trying to run the program NGSadmix ( using nine whole genome sequences. The program will run, however, the output in my .fopt.gz and .qopt files only contain -nan in both columns (running a k=2).

I made a beagle input file using the angsd program (v0.918) and this code: angsd -GL 1 -out genolike -doGlf 2 -doMajorMinor 1 -SNP_pval 1e-5 -doMaf 1 -bam bam.filelist

where the bam.filelist is a file listing the location of each of the sorted and indexed .bam files on a new line.

I am running NGSadmix with this code: ./NGSadmix -likes genolike.beagle.gz -K 2 -o outputadmix

This is the log file I receive: Input: lname=genolike.beagle.gz nPop=2, fname=(null) qname=(null) outfiles=outputadmix Setup: seed=1520867306 nThreads=1 method=1 Convergence: maxIter=2000 tol=0.000010 tolLike50=0.100000 dymBound=0 Filters: misTol=0.050000 minMaf=0.050000 minLrt=0.000000 minInd=0 Input file has dim: nsites=25438921 nind=9 Input file has dim (AFTER filtering): nsites=25438921 nind=9 [ALL done] cpu-time used = 40718.93 sec [ALL done] walltime used = 41370.00 sec best like = -nan after 2000 iterations

The .qopt output file contains this information: $head outputadmix.qopt -nan -nan -nan -nan -nan -nan ...

I've tried re-making the input file, but am consistently getting the same results. I have had a hard time trouble-shooting this issue because I am unclear as to why I'm seeing -nan as a result.

Any suggestions for how to solve this problem would be much appreciated.

ADD COMMENTlink modified 3 months ago by andersdetermig20 • written 3 months ago by bekahlee540
gravatar for andersdetermig
3 months ago by
andersdetermig20 wrote:

You are using NGSadmix using 25 Million sites which seems like a very large number of SNPs - and you only have 9 individuals. Also it appears like you don't have a minimum minor allele frequency threshold.

Given the very low number of individuals I would try a more stringent filter on SNPs for example angsd -GL 1 -out genolike -doGlf 2 -doMajorMinor 1 -SNP_pval 1e-6 -doMaf 1 -bam bam.filelist -minMaf 0.1

also I would only use autosomes. This should reduce the number of sites.

ADD COMMENTlink modified 3 months ago • written 3 months ago by andersdetermig20
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 619 users visited in the last hour