References<\/strong><\/h3>\n[1] A. Fabijan, J. Gupchup, S. Gupta, J. Omhover, W. Qin, L. Vermeer and P. Dmitriev, “Diagnosing Sample Ratio Mismatch in Online Controlled Experiments: A Taxonomy and Rules of Thumb for Practitioners,” in Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019.<\/p>\n
[2] H. Hamel, “A\/B Testing fast & secure, or how to improve ads iteratively, quickly, safely,” https:\/\/medium.com\/criteo-engineering\/a-b-testing-fast-secure-or-how-to-improve-ads-iteratively-quickly-safely-ab614e0d83fc, 2019.<\/p>\n
[3] R. Esfandani, “Monitoring and alerting for A\/B testing: Detecting problems in real time,” https:\/\/medium.com\/walmartglobaltech\/monitoring-and-alerting-for-a-b-testing-detecting-problems-in-real-time-4fe4f9b459b6, 2018.<\/p>\n
[4] A. Fabijan, T. Blanarik, M. Caughron, K. Chen, R. Zhang, A. Gustafson, V. K. Budumuri and S. Hunt, “Diagnosing Sample Ratio Mismatch in A\/B Testing,” https:\/\/www.microsoft.com\/en-us\/research\/group\/experimentation-platform-exp\/articles\/diagnosing-sample-ratio-mismatch-in-a-b-testing\/, 2020.<\/p>\n
[5] R. Kohavi, A. Deng, B. Frasca, T. Walker, Y. Xu and N. Pohlmann, “Online Controlled Experiments at Large Scale,” in Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, 2013.<\/p>\n
[6] D. Schuirmann, “A comparison of the Two One-Sided Tests Procedure and the Power Approach for assessing the equivalence of average bioavailability,” Journal of Pharmacokinetics and Biopharmaceutics, pp. 657-680, 1987.<\/p>\n
[7] A. Farcomeni, “A review of modern multiple hypothesis testing, with particular attention to the false discovery proportion,” Statistical Methods in Medical Research, vol. 17, pp. 347-388, 2008.<\/p>\n
[8] P. O’Brien and T. Fleming, “A Multiple Testing Procedure for Clinical Trials,” Biometrics, vol. 35, no. 3, pp. 549-556, 1979.<\/p>\n
[9] Y. Benjamini and Y. Hochberg, “Controlling the false discovery rate: A practical and powerful approach to multiple testing,” Journal of the Royal Statistical Society, Series B, vol. 57, pp. 289-300, 1995.<\/p>\n
[10] Y. Benjamini and D. Yekutieli, “The control of the false discovery rate in multiple testing under dependency,” Annals of Statistics, vol. 29, pp. 1165-1188, 2001.<\/p>\n","protected":false},"excerpt":{"rendered":"
At Microsoft, we continuously improve products by developing new features for them. To facilitate data-driven decision-making in software development, product teams across Microsoft run tens of thousands of A\/B tests each year. While the primary purpose of A\/B testing is to rigorously evaluate customer satisfaction with new features and experiences, it also helps uncover anomalies, bugs, performance degradation, and user dissatisfaction. To catch these issues early, we rely on alerts. Alerts are proactive notifications to experimenters when something unexpected has occurred in an A\/B test.<\/p>\n","protected":false},"author":39973,"featured_media":0,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-content-parent":651963,"footnotes":""},"research-area":[],"msr-locale":[268875],"msr-post-option":[],"class_list":["post-729247","msr-blog-post","type-msr-blog-post","status-publish","hentry","msr-locale-en_us"],"msr_assoc_parent":{"id":651963,"type":"group"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/729247"}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-blog-post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/39973"}],"version-history":[{"count":212,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/729247\/revisions"}],"predecessor-version":[{"id":730801,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/729247\/revisions\/730801"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=729247"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=729247"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=729247"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=729247"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}