June 21, 2011

Enhancements to ADMIXTURE algorithm

It's good to have a new paper on ADMIXTURE which documents some of the features of the newest version of the software. I don't see much in this that you could not already find in the user manual; I was hoping for some more discussion on penalized likelihood, but not much can be found on the topic.

BMC Bioinformatics 2011, 12:246doi:10.1186/1471-2105-12-246

Enhancements to the ADMIXTURE Algorithm for Individual Ancestry Estimation

David H Alexander and Kenneth Lange

BMC Bioinformatics 2011, 12:246doi:10.1186/1471-2105-12-246

Abstract (provisional)

Background
The estimation of individual ancestry from genetic data has become essential to applied population genetics and genetic epidemiology. Software programs for calculating ancestry estimates have become essential tools in the geneticist's analytic arsenal.

Results
Here we describe four enhancements to ADMIXTURE, a high-performance tool for estimating individual ancestries and population allele frequencies from SNP (single nucleotide polymorphism) data. First, ADMIXTURE can be used to estimate the number of underlying populations through cross-validation. Second, individuals of known ancestry can be exploited in supervised learning to yield more precise ancestry estimates. Third, by penalizing small admixture coefficients for each individual, one can encourage model parsimony, often yielding more interpretable results for small datasets or datasets with large numbers of ancestral populations. Finally, by exploiting multiple processors, large datasets can be analyzed even more rapidly.

Conclusions
The enhancements we have described make ADMIXTURE a more accurate, efficient, and versatile tool for ancestry estimation.

No comments: