You signed in with another tab or window.Reload to refresh your session.You signed out in another tab or window.Reload to refresh your session.You switched accounts on another tab or window.Reload to refresh your session.Dismiss alert
This repo is a simulation study of statistical properties for hypothesistests in LRTesteR. Each row is an experiment where data are generatedfrom random number generators and hypothesis test are done. The firstfive rows look like
#> # A tibble: 450,000 × 8#> test mu variance stat pvalue alt CI_LB CI_UB#> <chr> <dbl> <dbl> <dbl> <dbl> <chr> <dbl> <dbl>#> 1 gaussian_mu_one_sample -4 1 0.251 0.617 two.sided -4.07 -3.89#> 2 gaussian_mu_one_sample -4 1 1.79 0.181 two.sided -4.03 -3.85#> 3 gaussian_mu_one_sample -4 1 1.30 0.255 two.sided -4.04 -3.86#> 4 gaussian_mu_one_sample -4 1 0.453 0.501 two.sided -4.11 -3.94#> 5 gaussian_mu_one_sample -4 1 0.0143 0.905 two.sided -4.09 -3.92#> # … with 449,995 more rows#> # ℹ Use `print(n = ...)` to see more rows
For each simulated experiment, both the true hypothesis and the outcomeof the test are known. Multiple experiments are aggregated to calculatetype I error rates.
#> # A tibble: 30 × 4#> # Groups: test, mu [10]#> test mu variance Type_I_Error#> <chr> <dbl> <dbl> <dbl>#> 1 gaussian_mu_one_sample -4 1 0.05#> 2 gaussian_mu_one_sample -4 3 0.05#> 3 gaussian_mu_one_sample -4 5 0.05#> 4 gaussian_mu_one_sample -2 1 0.05#> 5 gaussian_mu_one_sample -2 3 0.05#> # … with 25 more rows#> # ℹ Use `print(n = ...)` to see more rows
Each simulated experiment is based on a sample size of 500. Eachcombination of true hypothesis, parameter value, and test are repeated5,000 times.
Detailed statistical analysis can be found in the type I and type IIfolders. Code to run the simulation is in the R Code folder.
One Sample Type I Error Rate
Most tests have a type I error rate of 5%. Likelihood ratio tests havesimilar error rates to exact tests.
One Way Type I Error Rate
Compared to the one sample tests, type I error rates are further fromthe .05 target.
For the one sample tests, all 500 data points are associated with onegroup. The one parameter is estimated using all 500 data points. For theone way tests, there are two groups and therefore two parametersestimated. Each parameter is based on only 250 data points. The cost ofless data per parameter is a type I error further from the target 5%.
One Sample Type II Error Rate
All tests achieve near 0% type II error for a large enough effect size.
One Way Type II Error Rate
Similar to above, all one way tests have near 0% type II error rate forlarge effect sizes.
Confidence Interval Coverage
In addition to error rates, confidence interval coverage rates areestimated. Most functions have a coverage rate of 95%. The worstperforming confidence intervals are within one percentage point.
About
A simulation of type I error rates, type II error rates, and coverage rates of functions in LRTesteR.