Control Group $(Y_0)$ | Treatment Group $(Y_1)$ | |
---|---|---|
Mean $(\mu)$ | ||
Standard Deviation $(\sigma)$ | ||
Number of Observations $(n)$ | ||
Significance Level $(\alpha)$ |
|
|
Critical t = {{ vm.graph.info.xcrit[1] | number : 3 }} | Noncentrality Parameter = {{ vm.graph.info.ncp | number : 3 }} | Degrees of Freedom = {{ vm.graph.info.dof | number : 3 }} |
Power = {{ vm.graph.info.power | number : 3 }} |
Imagine a scientist planning to run an experiment. A power analysis can help answer questions like:
This visualization illustrates how assumptions about the data generating process affect the likelihood of detecting a significant effect.
This power analysis assumes each outcome for the control group is distributed normally with mean {{ vm.graph.params.mu0 | number : 2 }} and standard deviation {{ vm.graph.params.sigma0 | number : 1 }}, and each outcome for the treatment group is distributed normally with mean {{ vm.graph.params.mu1 | number : 2 }} and standard deviation {{ vm.graph.params.sigma1 | number : 1 }}. If {{ vm.graph.params.n0 | number : 0 }} observations are collected from the control group and {{ vm.graph.params.n1 | number : 0 }} observations are collected from the treatment group, what is the probability of rejecting the null hypothesis, allowing for false rejections at a rate of {{ vm.graph.params.alpha | number : 3 }}? The probability of rejecting the null hypothesis that the expected outcomes for the two groups are equal is {{ vm.graph.info.power | number : 3 }}.
After running the experiment, the researcher would perform a t-test. The t-statistic is the difference in the mean outcomes for the two groups, divided by its standard error. Under the null hypothesis, the t-statistic follows the Student’s t-distribution with {{ vm.graph.info.dof | number : 3 }} degrees of freedom, this is the black distribution above. Under the alternative hypothesis, the t-statistic follows a noncentral t-distribution with the same degrees of freedom and noncentrality parameter of {{ vm.graph.info.ncp | number : 3 }}, this is the red distribution above. The null hypothesis is rejected if the experiment generates a t-statistic with magnitude greater than {{ vm.graph.info.xcrit[1] | number : 3 }}. The power of this hypothesis test equals the area under the curve of the alternative hypothesis with t-values of magnitude greater than the critical value, this area is highlighted in red.
Interested in how to perform this power calculation? Check out Harrison and Brady (2004). Note, I used Satterthwaite’s formula to approximate the degrees of freedom for the t-distributions.
Interested in building a similar visualization? Check out jStat, D3.js, AngularJS, and AngularJS Slider.