{"id":262953,"date":"2016-07-21T08:51:16","date_gmt":"2016-07-21T15:51:16","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-project&p=262953"},"modified":"2020-04-15T15:45:39","modified_gmt":"2020-04-15T22:45:39","slug":"fastlmm","status":"publish","type":"msr-project","link":"https:\/\/www.microsoft.com\/en-us\/research\/project\/fastlmm\/","title":{"rendered":"FaST-LMM"},"content":{"rendered":"

NEW: Ludicrous speed LMM can run 1 million samples.<\/h2>\n

A version of FaST-LMM has now been optimized for use in the cloud and cloud sized data.\u00a0 If you are interested in reading about it, click here (opens in new tab)<\/span><\/a>.\u00a0 If you are interested in using this, please click here (opens in new tab)<\/span><\/a> to\u00a0send an\u00a0email to genomics@microsoft.com with “GWAS use request” as the\u00a0subject.<\/p>\n

Click here to download standard version of\u00a0FaST-LMM (opens in new tab)<\/span><\/a><\/h3>\n

FaST-LMM, (Factored Spectrally Transformed Linear Mixed Models) is a set of tools for efficiently performing genome-wide association studies (GWAS), prediction, and heritability estimation on large data sets. FaST-LMM runs on both Windows and Linux, and has been tested on data sets with over one million samples.<\/p>\n

The most up-to-date version of FaST-LMM is written in python and available on GitHub (opens in new tab)<\/span><\/a>.\u00a0 It supports univariate GWAS [1, 4], tests for epistasis, corrections for cellular heterogeneity via the inclusion of\u00a0principal components [2], set association\u00a0tests [3], and heritability estimation [5].\u00a0 A C++ version, including Windows binary (opens in new tab)<\/span><\/a>, Linux binary (opens in new tab)<\/span><\/a>, and source (opens in new tab)<\/span><\/a>, supports univariate GWAS and limited epistatic testing. Another version supporting corrections for cellular heterogeneity is available in python <\/u>and R<\/u>.\u00a0 An example of\u00a0FaST-LMM\u00a0with cloud computing is\u00a0here (opens in new tab)<\/span><\/a>.<\/p>\n

[1] Lippert, J. Listgarten, Y. Liu, C.M. Kadie, R.I. Davidson, D. Heckerman. FaST linear mixed models for genome-wide association studies. (opens in new tab)<\/span><\/a> Nature Methods<\/em>, 8: 833-835, Oct 2011 (doi:10.1038\/nmeth.1681).<\/p>\n

[2] Zou, C. Lippert, D. Heckerman, M. Aryee, J. Listgarten. Epigenome-wide association studies without the need for cell-type composition. (opens in new tab)<\/span><\/a> Nature Methods<\/em>, 11: 309\u2013311, Jan 2014 (doi:10.1038\/nmeth.2815).<\/p>\n

[3] Lippert, Jing Xiang, Danilo Horta, Christian Widmer, Carl M. Kadie, D. Heckerman, J. Listgarten. Greater power and computational efficiency for kernel-based association testing of sets of genetic variants. (opens in new tab)<\/span><\/a> Bioinformatics<\/em>, 30, July 2014 (doi: 10.1093\/bioinformatics\/btu504).<\/p>\n

[4] Widmer, C. Lippert, O. Weissbrod, N. Fusi, C.M. Kadie, R.I. Davidson, J. Listgarten, and D. Heckerman. Further Improvements to Linear Mixed Models for Genome-Wide Association Studies. (opens in new tab)<\/span><\/a> Scientific Reports<\/em>, 4, 6874, Nov 2014 (doi:10.1038\/srep06874).<\/p>\n

[5] Heckerman, D. Gurdasani, C. Kadie, C. Pomilla, T. Carstensen, H. Martin, K. Ekoru, R.N. Nsubuga, G. Ssenyomo A. Kamali, P. Kaleebu, C. Widmer, and M.S. Sandhu. Linear mixed model for heritability estimation that explicitly addresses environmental variation (opens in new tab)<\/span><\/a>. PNAS<\/em>, 113: 7377\u20137382, July 2016 (doi: 10.1073\/pnas.1510497113).<\/p>\n\t\t\t

\n\t\t\t\t\t\t\t
\n\t\t\t\t\t\n\t\t\t\t\t\t