{"id":868917,"date":"2022-11-15T07:22:30","date_gmt":"2022-11-15T15:22:30","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-blog-post&#038;p=868917"},"modified":"2022-11-15T12:33:24","modified_gmt":"2022-11-15T20:33:24","slug":"deep-dive-into-variance-reduction","status":"publish","type":"msr-blog-post","link":"https:\/\/www.microsoft.com\/en-us\/research\/articles\/deep-dive-into-variance-reduction\/","title":{"rendered":"Deep Dive Into Variance Reduction"},"content":{"rendered":"\n<p>Variance Reduction (VR) is a popular topic that is frequently discussed in the context of A\/B testing. However, it requires a deeper understanding to maximize its value in an A\/B test.\u202f In this blog post, we will answer questions including: What does the \u201cvariance\u201d in VR refer to? \u202fWill VR make A\/B tests more trustworthy?\u202f How will VR impact the ability to detect true change in A\/B metrics?&nbsp;<\/p>\n\n\n\n<p>This blog post provides an overview of ExP\u2019s implementation of VR, a technique called CUPED (Controlled experiment Using Pre-Experiment Data). Other authors have contributed excellent explainers of CUPED\u2019s performance and its ubiquity as an industry-standard variance reduction technique [1][2]. We have covered in previous blog posts how ExP uses CUPED in the experiment lifecycle [3]. <\/p>\n\n\n\n<p>In this post, we share the foundations of VR in statistical theory and how it amplifies the power of an A\/B testing program without increasing the likelihood of making a wrong decision. <a href=\"#_ftn1\">[a]<\/a>[4]<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\" \/>\n\n\n\n<p><a id=\"_ftn1\" href=\"#_ftnref1\">[a]<\/a> Many of the elements covered quickly in this blog are covered in excellent detail in Causal Inference and Its Applications in Online Industry [4].<\/p>\n\n\n\n<h3 id=\"variance-is-a-statistical-property-of-estimators\">Variance is a Statistical Property of Estimators<\/h3>\n\n\n\n<p>To understand where variance reduction fits in, let\u2019s start with a more fundamental question: <em>What\u2019s our ideal case for analyzing an A\/B test? <\/em>We want to estimate the difference in two potential outcomes for a user: the outcome in a world where the treatment was applied, and the outcome in a world where the treatment was not applied \u2013 the counterfactual.&nbsp;<\/p>\n\n\n\n<p>The fundamental challenge of causal inference is that we cannot observe those two worlds simultaneously, and so we must come up with a process for estimating the counterfactual difference. In A\/B testing, that process relies on applying treatments to different users. Different users are never perfect substitutes for one another because their outcomes are not only functions of the treatment assignment, but also impacted by many other factors that influence user behavior.<\/p>\n\n\n\n<p>Causal inference is a set of scientific methods to estimate the counterfactual difference in potential outcomes between our two imagined worlds. Any process of estimating this counterfactual difference introduces uncertainty.&nbsp;<\/p>\n\n\n\n<p>Statistical inference is the process of proposing and refining estimators of an average counterfactual difference to improve the estimators\u2019 core statistical properties:&nbsp;<\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>asymptotic bias, or consistency;<\/li><li>rate of convergence to this asymptotic bias; and&nbsp;<\/li><li>variance.<\/li><\/ul>\n\n\n\n<p>In fact, that\u2019s what the \u201cvariance\u201d in variance reduction refers to: the property of the estimator of the average treatment effect. Variance reduction (as in CUPED-VR) is not a reduction in variance of&nbsp;<em>underlying data<\/em>&nbsp;such as when sample data is modified through outlier removal, capping, or log-transformation.&nbsp;\u202fInstead, variance reduction refers to a change in the <em>estimator <\/em>which produces estimates of the\u202ftreatment effect with lower standard error.&nbsp;<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"704\" height=\"421\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/11\/estimator_icons.png\" alt=\"The procedure of inference. We want to estimate the parameter beta, so we gather data, evaluate it with the estimator and end up with an estimate of beta\" class=\"wp-image-898302\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/11\/estimator_icons.png 704w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/11\/estimator_icons-300x179.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/11\/estimator_icons-240x144.png 240w\" sizes=\"auto, (max-width: 704px) 100vw, 704px\" \/><figcaption>The procedure of inference. We want to estimate the parameter \\( \\beta \\), so we gather data, evaluate it with the estimator and end up with an estimate \\( \\hat{\\beta} \\). In A\/B testing, \\( \\beta \\) is commonly the average treatment effect.  Image courtesy of Dr.\u202fLaura Hatfield and\u202f<a href=\"https:\/\/diff.healthpolicydatascience.org\/\" target=\"_blank\" rel=\"noreferrer noopener\">diff.healthpolicydatascience.org<\/a>.<\/figcaption><\/figure>\n\n\n\n<h3 id=\"the-difference-in-means-estimator-provides-consistency-in-a-b-tests\">The Difference-in-Means Estimator Provides Consistency in A\/B tests<\/h3>\n\n\n\n<p>Random assignment ensures that the difference between treatment and control populations is an unbiased estimator. However, we need to consider how much uncertainty our estimation process has introduced. <\/p>\n\n\n\n<p>To do so, we use the known rate of convergence to the true population difference \u2013 called <em>consistency <\/em>\u2013 to estimate the true variance of the average treatment effect using our sample. With the delta estimate from difference-in-means (\\( \\delta_{DiM}\\)) and the sample variance estimate, we report an interval of estimates that is likely to contain the true population difference, called a <em>confidence interval<\/em>:<\/p>\n\n\n\n<p>\\( \\begin{aligned} Var(\\delta_{DiM}) &=\\frac{ \\sigma_{Y^T}^2}{{n^T}} + \\frac{ \\sigma_{Y^C}^2}{n^C} \\\\ CI_{lb,ub}&= \\delta_{DiM} \\pm z_{\\alpha\/2}\\sqrt{Var(\\delta_{DiM})} \\\\ \\end{aligned} \\) <a href=\"#_ftn2\">[b]<\/a><\/p>\n\n\n\n<p>The difference-in-means estimator for the average treatment effect is unbiased, and the variance of the estimator shrinks at a known rate as the sample size grows. When we propose VR estimators, we\u2019ll need to describe their relationship to the bias, variance, and the consistent variance estimate of the difference-in-means estimator to understand if we\u2019re improving.<\/p>\n\n\n\n<p><a id=\"_ftn2\" href=\"#_ftnref2\">[b]<\/a> \\( z_{\\alpha\/2} \\) is the standard normal quantile at your acceptable  \\( \\alpha \\), or false positive rate. For example, a 95% confidence interval uses 1.96 for \\( z_{0.05\/2} \\).<\/p>\n\n\n\n<h3 id=\"cuped-vr-outperforms-the-difference-in-means-estimator\">CUPED-VR Outperforms the Difference-in-Means Estimator&nbsp;<\/h3>\n\n\n\n<p>Statistical tests that use variance reduction rely on an additional strategy to reduce the variance of an estimator of average treatment effect, which has a similar power benefit to increasing the A\/B test sample size.<\/p>\n\n\n\n<p>This is rooted in the insight that even if we have a single-user treatment and single-user control, if the users are good substitutes for one another, we\u2019ll expect to obtain a treatment effect estimate that\u2019s closer to the true treatment effect than if the users are very different from one another. &nbsp;The assignment procedure can be modified to try to ensure \u201cbalanced\u201d treatment and control assignments. Re-randomization of assignments with checks to ensure baseline balance uses this idea [5].<\/p>\n\n\n\n<p>In many online A\/B tests, we don\u2019t modify our assignment procedure. Instead, we perform a correction in the analysis phase with VR estimators. VR combines large-sample asymptotic properties of A\/B tests with the optimization of comparing similar users through statistical adjustment. Similarity is modeled through use of characteristics known to be independent of the assignment of A or B test feature to the user.<\/p>\n\n\n\n<h4 id=\"cuped-vr-procedure\">CUPED-VR Procedure<\/h4>\n\n\n\n<p>CUPED <a href=\"#_ftn3\"><\/a> is one method of VR, with the following steps:<\/p>\n\n\n\n<ul class=\"wp-block-list\" type=\"1\"><li>Linear models \\( \\vec Y_i \\sim \\vec \\theta \\vec X_i \\) are estimated separately for treatment and control (or with an assignment group indicator).<\/li><\/ul>\n\n\n\n<ul class=\"wp-block-list\" id=\"block-d2b66e36-d256-46ae-9d09-fcd5bba2643b\"><li>The product of \\( \\hat \\theta\\) and the overall mean \\( \\overline X_i \\) is subtracted from \\( Y_i \\), giving adjusted metric values \\( Y_{CUPED,T} \\) and \\( Y_{CUPED,C} \\). In each group, users&#8217; adjusted metrics are shifted as a function of their similar prior characteristics.<\/li><li>The difference in the average adjusted metric values gives a still-consistent and lower-variance estimate of the average treatment effect estimand.<\/li><\/ul>\n\n\n\n<p>From simulating CUPED-VR\u2019s performance versus difference-in-means on repeated samples of the same data, we can observe the extent of variance reduction for the estimator (<em>plot<\/em> <em>below<\/em>). In this plot of estimates, the set of estimates that are closer to the true effect of 2.5 compared to the difference-in-means estimates on the same trial are shifted because, in those trials, CUPED-adjusted metrics accounted for chance imbalance in the pre-A\/B test period.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\" \/>\n\n\n\n<p><a id=\"_ftn3\" href=\"#_ftnref3\"><\/a> When the estimated coefficients are weighted by assignment probability, the CUPED-VR estimator is equivalent to another popular regression adjustment estimator for A\/B tests: ANCOVA2, or Lin\u2019s estimator [6][7] [<a href=\"#_ftn4\">Table 1<\/a>]. &nbsp;<\/p>\n\n\n\n<figure class=\"wp-block-image size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/11\/blog_post_vr_animation-6373ac562c65a.gif\" alt=\"CUPED adjusts metrics by the predicted value from a regression of Y on X. The treatment effect estimate has lower standard error.  Estimated confidence intervals are narrower as a consequence, and power of tests are increased.\" class=\"wp-image-898263\" width=\"748\" height=\"748\" \/><figcaption>CUPED adjusts metrics by the predicted value from a regression of Y on X. The treatment effect estimate has lower standard error.&nbsp;<em><strong>&nbsp;<\/strong><\/em>Estimated confidence intervals are narrower as a consequence, and power of tests are increased.<\/figcaption><\/figure>\n\n\n\n<h4 id=\"measuring-cuped-vr-performance-with-effective-traffic-multiplier\">Measuring CUPED-VR Performance with Effective Traffic Multiplier<\/h4>\n\n\n\n<p>The CUPED-VR estimator has known analytic results [7] of how its variance compares to the variance of the difference-in-means estimator:<\/p>\n\n\n\n\\(\\begin{aligned} Var(\\delta_{VR}) &=(\\frac{ \\sigma_{Y^T}^2}{n^T} + \\frac{ \\sigma_{Y^C}^2}{n^C}) (1 &#8211; R^2) \\\\ Var(\\delta_{DiM}) &=\\frac{ \\sigma_{Y^T}^2}{n^T} + \\frac{ \\sigma_{Y^C}^2}{n^C} \\\\ \\end{aligned} \\)\n\n\n\n<p>The variance is reduced in proportion to the amount of variance explained by the linear model in treatment and control, or the total  \\( R^2 \\). And, importantly, the estimator is still consistent: We don\u2019t sacrifice bias in favor of lower variance. This means that when we estimate the variance of our \\( \\delta_{VR} \\) , we can build narrower confidence intervals, with values that are closer to the \\( \\delta_{VR} \\) but reflect the same level of confidence about the range. This also means that if the true treatment effect is non-zero, we are more likely to detect a statistically significant effect. Indeed, the ratio of raw variance to VR variance \\( \\frac{1}{1-R^2} \\) represents the amount of traffic that would need to be added to the simple difference estimator to provide the same level of variance reduction as VR. <br><br>Decision-makers understand that having more traffic in an A\/B test for a given time period helps decrease time-to-decision or increase confidence in a decision if evaluating at a fixed time. And at ExP, we have found this to be an easy-to-interpret representation of VR&#8217;s efficacy for Microsoft experimenters. We surface it for each variance-reduced metric and refer to it as the \u201ceffective traffic multiplier\u201d.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/11\/blog_post_vr_annotated-1024x1024.png\" alt=\"From a simulated total R2 of 0.4, the median effective traffic multiplier is 1.66 in simulations. This translates to a power gain of 22%.\" class=\"wp-image-894495\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/11\/blog_post_vr_annotated-1024x1024.png 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/11\/blog_post_vr_annotated-300x300.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/11\/blog_post_vr_annotated-150x150.png 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/11\/blog_post_vr_annotated-768x768.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/11\/blog_post_vr_annotated-1536x1536.png 1536w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/11\/blog_post_vr_annotated-2048x2048.png 2048w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/11\/blog_post_vr_annotated-180x180.png 180w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/11\/blog_post_vr_annotated-360x360.png 360w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption>From a simulated total \\( R^2\\) of 0.4, the median effective traffic multiplier is 1.66 in simulations. This translates to a power gain of 22%.<\/figcaption><\/figure>\n\n\n\n<p>The effectiveness of CUPED-VR is influenced by various attributes of the product, telemetry, experiment, and metric. At Microsoft, we see substantial difference in efficacy across different product surfaces and metric types.<br><br>Based on a recent 12-week sample of week-long experiments, groups of VR metrics from two different surfaces for the same product have very different average performance. In one Microsoft product surface, VR is not effective for most metrics: a majority of metrics (<strong>>68%<\/strong>)<strong> <\/strong>have effective traffic multiplier <strong><=1.05x<\/strong>. In contrast, another product surface sees substantial gain from VR methods: a majority of metrics (><strong>55%<\/strong>)<strong> <\/strong>have effective traffic multiplier <strong>>1.2x. <\/strong><\/p>\n\n\n\n<h2 id=\"summary\">Summary<\/h2>\n\n\n\n<p>Variance reduction is the use of alternative estimators, like CUPED, to improve difference-in-means and effectively multiply observed traffic in an A\/B test.<strong> <\/strong>Its variance-reducing properties are rooted in the foundations of design-based statistical inference, which makes it a trustworthy estimator at scale.<\/p>\n\n\n\n<p><em>\u2013 Laura Cosgrove, Jen Townsend, and Jonathan Litz, Microsoft Experimentation Platform<\/em><\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\" \/>\n\n\n\n<h3 id=\"cuped-vr-and-ancova2-comparison-table\">CUPED-VR and ANCOVA2 Comparison <a id=\"_ftn4\" href=\"#_ftnref4\">Table<\/a><\/h3>\n\n\n\n<figure class=\"wp-block-table\"><table><tbody><tr><td><strong>Estimator<\/strong>&nbsp;<\/td><td>Procedure&nbsp;<\/td><\/tr><tr><td><strong>ANCOVA2<\/strong>[6][7]&nbsp;<\/td><td>\\(\\begin{aligned} \\\\ Y_i &= \\beta_0 + \\delta T_i + \\beta ( X_i &#8211; \\overline  X) +  \\gamma ( X_i &#8211; \\overline X) T_i + \\epsilon_i  \\\\ \\delta &= \\hat \\delta \\end{aligned} \\)<\/td><\/tr><tr><td><strong>CUPED<\/strong>-VR<\/td><td>\\( \\begin{aligned}\\\\ Y_i^T &= \\beta_0^T +  \\theta^T  X_i^T + \\epsilon_i^T \\\\ Y_i^C &= \\beta_0^C +  \\theta^C  X_i^C + \\epsilon_i^C \\\\ Y_i^{CUPED, T} &= Y_i^T &#8211; (p\\hat {\\theta^C} + (1-p)  \\hat {\\theta^T})* X_i^T \\\\ Y_i^{CUPED, C} &= Y_i^C &#8211; (p\\hat {\\theta^C} + (1-p)\\hat {\\theta^T})*X_i^C &nbsp;\\\\ \\delta &= \\overline Y^{CUPED, T} &#8211; \\overline  Y^{CUPED, T} \\end{aligned} \\)<\/td><\/tr><\/tbody><\/table><figcaption>The CUPED procedure is statistically equivalent to ANCOVA2<\/figcaption><\/figure>\n\n\n\n<h2 id=\"references\">References<\/h2>\n\n\n\n<p><strong>[1] <\/strong>Berk, M. (2021) <em>How to Double A\/B Testing Speed with Cuped<\/em>, <em>Towards Data Science<\/em>. Available at: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/towardsdatascience.com\/how-to-double-a-b-testing-speed-with-cuped-f80460825a90\">https:\/\/towardsdatascience.com\/how-to-double-a-b-testing-speed-with-cuped-f80460825a90<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> (Accessed: November 1, 2022).<\/p>\n\n\n\n<p><strong>[2] <\/strong>Craig (2022) <em>Cuped on Statsig<\/em>, <em>Medium<\/em>. Available at: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/blog.statsig.com\/cuped-on-statsig-d57f23122d0e\">https:\/\/blog.statsig.com\/cuped-on-statsig-d57f23122d0e <span class=\"sr-only\"> (opens in new tab)<\/span><\/a>(Accessed: November 1, 2022).<\/p>\n\n\n\n<p><strong>[3]\u202f<\/strong>Machmouchi, W., et al. (2021)<em> Patterns of Trustworthy Experimentation: Pre-Experiment Stage<\/em>,\u202f<em>Microsoft Research<\/em>. Available at:\u202f<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/experimentation-platform-exp\/articles\/patterns-of-trustworthy-experimentation-pre-experiment-stage\/\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/www.microsoft.com\/en-us\/research\/group\/experimentation-platform-exp\/articles\/patterns-of-trustworthy-experimentation-pre-experiment-stage\/<\/a> (Accessed: November 1, 2022).&nbsp;<\/p>\n\n\n\n<p><strong>[4]<\/strong> Deng, A., 2021. Causal Inference and Its Applications in Online Industry. [online] Alexdeng.github.io. Available at: <<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/alexdeng.github.io\/causal\/index.html\">https:\/\/alexdeng.github.io\/causal\/index.html<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>> [Accessed 5 July 2022].&nbsp;<\/p>\n\n\n\n<p><strong>[5] <\/strong>Zhao, A. and Ding, P. (2021) <em>No star is good news: A unified look at rerandomization based on $p$-values from covariate balance tests<\/em>, <em>arXiv.org<\/em>. Available at: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/arxiv.org\/abs\/2112.10545\">https:\/\/arxiv.org\/abs\/2112.10545<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> (Accessed: November 1, 2022).<\/p>\n\n\n\n<p><strong>[6] <\/strong>Lin, W. (2013).&nbsp;Agnostic notes on regression adjustments to experimental data: Reexamining Freedman\u2019s critique. Annals of Applied Statistics, 7,&nbsp;295\u2013318.<\/p>\n\n\n\n<p><strong>[7]<\/strong> Deng, A., 2021. Chapter 10: Improving Metric Sensitivity. Causal Inference and Its Applications in Online Industry. [online] alexdeng.github.io. Available at: <<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/alexdeng.github.io\/causal\/index.html\">https:\/\/alexdeng.github.io\/causal\/index.html<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>> [Accessed 5 July 2022].&nbsp;<\/p>\n\n\n\n<figure class=\"wp-block-pullquote\"><blockquote><p><\/p><\/blockquote><\/figure>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Variance Reduction (VR) is a popular topic that is frequently discussed in the context of A\/B testing. However, it requires a deeper understanding to maximize its value in an A\/B test.\u202f In this blog post, we will answer questions including: What does the \u201cvariance\u201d in VR refer to? \u202fWill VR make A\/B tests more trustworthy?\u202f [&hellip;]<\/p>\n","protected":false},"author":42087,"featured_media":898263,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-content-parent":651963,"msr_hide_image_in_river":0,"footnotes":""},"research-area":[],"msr-locale":[268875],"msr-post-option":[],"class_list":["post-868917","msr-blog-post","type-msr-blog-post","status-publish","has-post-thumbnail","hentry","msr-locale-en_us"],"msr_assoc_parent":{"id":651963,"type":"group"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/868917","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-blog-post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/42087"}],"version-history":[{"count":98,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/868917\/revisions"}],"predecessor-version":[{"id":898572,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/868917\/revisions\/898572"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/898263"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=868917"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=868917"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=868917"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=868917"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}