| Test Statistic | Type of Test |
|---|---|
| t-statistic | t-test Regression test |
| F-statistic | ANOVA MANOVA ANCOVA |
| z-statistic | z-test |
| χ²-statistic | Chi-square test |
Teststatistic is a quantity derived from thesample forstatistical hypothesis testing.[1] A hypothesis test is typically specified in terms of a test statistic, considered as a numerical summary of a data-set that reduces the data to one value that can be used to perform the hypothesis test. In general, a test statistic is selected or defined in such a way as to quantify, within observed data, behaviours that would distinguish thenull from thealternative hypothesis, where such an alternative is prescribed, or that would characterize the null hypothesis if there is no explicitly stated alternative hypothesis.
An important property of a test statistic is that itssampling distribution under the null hypothesis must be calculable, either exactly or approximately, which allowsp-values to be calculated. Atest statistic shares some of the same qualities of adescriptive statistic, and many statistics can be used as both test statistics and descriptive statistics. However, a test statistic is specifically intended for use in statistical testing, whereas the main quality of a descriptive statistic is that it is easily interpretable. Some informative descriptive statistics, such as thesample range, do not make good test statistics since it is difficult to determine their sampling distribution.
Two widely used test statistics are thet-statistic and theF-statistic.
Suppose the task is to test whether a coin isfair (i.e. has equal probabilities of producing a head or a tail). If the coin is flipped 100 times and the results are recorded, the raw data can be represented as a sequence of 100 heads and tails. If there is interest in themarginal probability of obtaining a tail, only the numberT out of the 100 flips that produced a tail needs to be recorded. ButT can also be used as a test statistic in one of two ways:
Using one of these sampling distributions, it is possible to compute either aone-tailed or two-tailed p-value for the null hypothesis that the coin is fair. The test statistic in this case reduces a set of 100 numbers to a single numerical summary that can be used for testing.
One-sample tests are appropriate when a sample is being compared to thepopulation from a hypothesis. The population characteristics are known from theory or are calculated from the population.
Two-sample tests are appropriate for comparing two samples, typically experimental and control samples from ascientifically controlled experiment.
Paired tests are appropriate for comparing two samples where it is impossible to control important variables. Rather than comparing two sets, members are paired between samples so the difference between the members becomes the sample. Typically the mean of the differences is then compared to zero. The common example scenario for when apaired difference test is appropriate is when a single set of test subjects has something applied to them and the test is intended to check for an effect.
Z-tests are appropriate for comparing means under stringent conditions regarding normality and a known standard deviation.
At-test is appropriate for comparing means under relaxed conditions (less is assumed).
Tests of proportions are analogous to tests of means (the 50% proportion).
Chi-squared tests use the same calculations and the same probability distribution for different applications:
F-tests (analysis of variance, ANOVA) are commonly used when deciding whether groupings of data by category are meaningful. If the variance of test scores of the left-handed in a class is much smaller than the variance of the whole class, then it may be useful to study lefties as a group. The null hypothesis is that two variances are the same – so the proposed grouping is not meaningful.
In the table below, the symbols used are defined at the bottom of the table. Many other tests can be found inother articles. Proofs exist that the test statistics are appropriate.[2]
| Name | Formula | Assumptions or notes | |||
|---|---|---|---|---|---|
| One-sample -test | (Normal populationorn large)and σ known. (z is the distance from the mean in relation to thestandard deviation of the mean). For non-normal distributions it is possible to calculate a minimum proportion of a population that falls withink standard deviations for anyk (see:Chebyshev's inequality). | ||||
| Two-sample z-test | Normal populationand independent observationsand σ1 and σ2 are known where is the value of under the null hypothesis | ||||
| One-samplet-test | (Normal populationorn large)and unknown | ||||
| Pairedt-test | (Normal population of differencesorn large)and unknown | ||||
| Two-sample pooledt-test, equal variances | (Normal populationsorn1 + n2 > 40)and independent observationsand σ1 = σ2 unknown | ||||
| Two-sample unpooledt-test, unequal variances (Welch'st-test) | (Normal populationsorn1 + n2 > 40)and independent observationsand σ1 ≠ σ2 both unknown | ||||
| One-proportion z-test | n · p0 > 10andn (1 − p0) > 10and it is a SRS (Simple Random Sample), seenotes. | ||||
| Two-proportion z-test, pooled for | n1p1 > 5andn1(1 − p1) > 5andn2p2 > 5andn2(1 − p2) > 5and independent observations, seenotes. | ||||
| Two-proportion z-test, unpooled for | n1p1 > 5andn1(1 − p1) > 5andn2p2 > 5andn2(1 − p2) > 5and independent observations, seenotes. | ||||
| Chi-squared test for variance | df = n-1 • Normal population | ||||
| Chi-squared test forgoodness of fit | df = k − 1 − # parameters estimated, and one of these must hold. • All expected counts are at least 5.[4]: 350 • All expected counts are > 1 and no more than 20% of expected counts are less than 5[5] | ||||
| Two-sample F test for equality of variances | Normal populations Arrange so and reject H0 for[6] | ||||
| Regressiont-test of | RejectH0 for[4]: 288 *Subtract 1 for intercept;k terms contain independent variables. | ||||
In general, the subscript 0 indicates a value taken from thenull hypothesis, H0, which should be used as much as possible in constructing its test statistic.... Definitions of other symbols:
| |||||