Movatterモバイル変換

[0]ホーム

Jump to content

Credible interval

Edit links

From Wikipedia, the free encyclopedia

Concept in Bayesian statistics

The 90%-smallest credible interval of a distribution is the smallest interval that contains 90% of the distribution mass.

Bayesian statistics
Part of a series on

Posterior =Likelihood ×Prior ÷Evidence
Background
Bayesian inference Bayesian probability Bayes' theorem Bernstein–von Mises theorem Coherence Cox's theorem Cromwell's rule Likelihood principle Principle of indifference Principle of maximum entropy
Model building
Conjugate prior Linear regression Empirical Bayes Hierarchical model
Posterior approximation
Markov chain Monte Carlo Laplace's approximation Integrated nested Laplace approximations Variational inference Approximate Bayesian computation
Estimators
Bayesian estimator Credible interval Maximum a posteriori estimation
Evidence approximation
Evidence lower bound Nested sampling
Model evaluation
Bayes factor (Schwarz criterion) Model averaging Posterior predictive
Mathematics portal
v t e

InBayesian statistics, acredible interval is aninterval used to characterize aprobability distribution. It is defined such that an unobservedparameter value has a particularprobability $\gamma$ to fall within it. For example, in an experiment that determines the distribution of possible values of the parameter $\mu$ , if the probability that $\mu$ lies between 35 and 45 is $\gamma =0.95$ , then $35\leq \mu \leq 45$ is a 95% credible interval.

Credible intervals are typically used to characterizeposterior probability distributions orpredictive probability distributions.^[1] Their generalization to disconnected or multivariate sets is calledcredible set or credible region.

Credible intervals are aBayesian analog toconfidence intervals infrequentist statistics.^[2] The two concepts arise from different philosophies:^[3] Bayesian intervals treat their bounds as fixed and the estimated parameter as a random variable, whereas frequentist confidence intervals treat their bounds as random variables and the parameter as a fixed value. Also, Bayesian credible intervals use (and indeed, require) knowledge of the situation-specificprior distribution, while the frequentist confidence intervals do not.

Definitions

[edit]

Credible sets are not unique, as any given probability distribution has an infinite number of $\gamma$ -credible sets, i.e. sets of probability $\gamma$ . For example, in the univariate case, there are multiple definitions for a suitable interval or set:

Thesmallest credible interval (SCI), sometimes also called thehighest density interval. This interval necessarily contains themedian whenever $\gamma \geq 0.5$ . When the distribution isunimodal, this interval also contains themode.
Thesmallest credible set (SCS), sometimes also called thehighest density region. For a multimodal distribution, this is not necessarily an interval as it can be disconnected. This set always contains themode.
Aquantile-based credible interval, which is computed by taking the inter-quantile interval $[q_{\delta },q_{\delta +\gamma }]$ for some predefined $\delta \in [0,1-\gamma ]$ . For instance, themedian credible interval (MCI) of probability $\gamma$ is the interval where the probability of being below the interval is as likely as being above it, that is to say the interval $[q_{(1-\gamma )/2},q_{(1+\gamma )/2}]$ . It is sometimes also called theequal-tailed interval, and it always contains themedian. Other quantile-based credible intervals can be defined, such as thelowest credible interval (LCI) which is $[q_{0},q_{\gamma }]$ , or thehighest credible interval (HCI) which is $[q_{1-\gamma },q_{1}]$ . These intervals may be more suited for bounded variables.

One may also define an interval for which themean is the central point, assuming that the mean exists.

$\gamma$ -Smallest Credible Sets ( $\gamma$ -SCS) can easily be generalized to the multivariate case, and are bounded by probability densitycontour lines.^[4] They always contain themode, but not necessarily themean, thecoordinate-wise median, nor thegeometric median.

Credible intervals can also be estimated through the use of simulation techniques such asMarkov chain Monte Carlo.^[5]

Contrasts with confidence interval

[edit]

A frequentist 95%confidence interval means that with a large number of repeated samples, 95% of such calculated confidence intervals would include the true value of the parameter. In frequentist terms, the parameter isfixed (cannot be considered to have a distribution of possible values) and the confidence interval israndom (as it depends on the random sample).

Bayesian credible intervals differ from frequentist confidence intervals by two major aspects:

Credible intervals are intervals whose values have a (posterior) probability density, representing the plausibility that the parameter has those values, whereas confidence intervals regard the population parameter as fixed and therefore not the object of probability. Within confidence intervals, confidence refers to the randomness of the very confidence interval under repeated trials, whereas credible intervals analyze the uncertainty of the target parameter given the data at hand.

Credible intervals and confidence intervals treatnuisance parameters in radically different ways.

For the case of a single parameter and data that can be summarised in a singlesufficient statistic, it can be shown that the credible interval and the confidence interval coincide if the unknown parameter is alocation parameter (i.e. the forward probability function has the form $\mathrm {Pr} (x|\mu )=f(x-\mu )$ ), with a prior that is a uniform flat distribution;^[6] and also if the unknown parameter is ascale parameter (i.e. the forward probability function has the form $\mathrm {Pr} (x|s)=f(x/s)$ ), with aJeffreys' prior $\mathrm {Pr} (s|I)\;\propto \;1/s$ ^[6] — the latter following because taking the logarithm of such a scale parameter turns it into a location parameter with a uniform distribution.But these are distinctly special (albeit important) cases; in general no such equivalence can be made.

References

[edit]

^Edwards, Ward; Lindman, Harold; Savage, Leonard J. (1963). "Bayesian statistical inference in psychological research".Psychological Review.70 (3):193–242.doi:10.1037/h0044139.
^Lee, P.M. (1997)Bayesian Statistics: An Introduction, Arnold.ISBN 0-340-67785-6
^VanderPlas, Jake."Frequentism and Bayesianism III: Confidence, Credibility, and why Frequentism and Science do not Mix | Pythonic Perambulations".jakevdp.github.io.
^O'Hagan, A. (1994)Kendall's Advanced Theory of Statistics, Vol 2B, Bayesian Inference, Section 2.51. Arnold,ISBN 0-340-52922-9
^Chen, Ming-Hui; Shao, Qi-Man (1 March 1999). "Monte Carlo Estimation of Bayesian Credible and HPD Intervals".Journal of Computational and Graphical Statistics.8 (1):69–92.doi:10.1080/10618600.1999.10474802.
^^a ^bJaynes, E. T. (1976). "Confidence Intervals vs Bayesian Intervals", inFoundations of Probability Theory, Statistical Inference, and Statistical Theories of Science, (W. L. Harper and C. A. Hooker, eds.), Dordrecht: D. Reidel, pp. 175et seq

Center	Mean Arithmetic Arithmetic-Geometric Contraharmonic Cubic Generalized/power Geometric Harmonic Heronian Heinz Lehmer Median Mode
Dispersion	Average absolute deviation Coefficient of variation Interquartile range Percentile Range Standard deviation Variance
Shape	Central limit theorem Moments Kurtosis L-moments Skewness

Count data

Index of dispersion

Summary tables

Dependence

Graphics

Data collection

Study design	Effect size Missing data Optimal design Population Replication Sample size determination Statistic Statistical power
Survey methodology	Sampling Cluster Stratified Opinion poll Questionnaire Standard error
Controlled experiments	Blocking Factorial experiment Interaction Random assignment Randomized controlled trial Randomized experiment Scientific control
Adaptive designs	Adaptive clinical trial Stochastic approximation Up-and-down designs
Observational studies	Cohort study Cross-sectional study Natural experiment Quasi-experiment

Statistical inference

Statistical theory

Frequentist inference

Point estimation	Estimating equations Maximum likelihood Method of moments M-estimator Minimum distance Unbiased estimators Mean-unbiased minimum-variance Rao–Blackwellization Lehmann–Scheffé theorem Median unbiased Plug-in
Interval estimation	Confidence interval Pivot Likelihood interval Prediction interval Tolerance interval Resampling Bootstrap Jackknife
Testing hypotheses	1- & 2-tails Power Uniformly most powerful test Permutation test Randomization test Multiple comparisons
Parametric tests	Likelihood-ratio Score/Lagrange multiplier Wald

Specific tests

Z-test(normal) Student'st-test F-test
Goodness of fit	Chi-squared G-test Kolmogorov–Smirnov Anderson–Darling Lilliefors Jarque–Bera Normality(Shapiro–Wilk) Likelihood-ratio test Model selection Cross validation AIC BIC
Rank statistics	Sign Sample median Signed rank(Wilcoxon) Hodges–Lehmann estimator Rank sum(Mann–Whitney) Nonparametric anova 1-way(Kruskal–Wallis) 2-way(Friedman) Ordered alternative(Jonckheere–Terpstra) Van der Waerden test

Bayesian inference

Correlation	Pearson product-moment Partial correlation Confounding variable Coefficient of determination
Regression analysis (see alsoTemplate:Least squares and regression analysis	Errors and residuals Regression validation Mixed effects models Simultaneous equations models Multivariate adaptive regression splines (MARS)
Linear regression	Simple linear regression Ordinary least squares General linear model Bayesian regression
Non-standard predictors	Nonlinear regression Nonparametric Semiparametric Isotonic Robust Homoscedasticity and Heteroscedasticity
Generalized linear model	Exponential families Logistic(Bernoulli) / Binomial / Poisson regressions
Partition of variance	Analysis of variance (ANOVA, anova) Analysis of covariance Multivariate ANOVA Degrees of freedom

Categorical / multivariate / time-series / survival analysis

Categorical

Multivariate

Time-series

General	Decomposition Trend Stationarity Seasonal adjustment Exponential smoothing Cointegration Structural break Granger causality
Specific tests	Dickey–Fuller Johansen Q-statistic(Ljung–Box) Durbin–Watson Breusch–Godfrey
Time domain	Autocorrelation (ACF) partial (PACF) Cross-correlation (XCF) ARMA model ARIMA model(Box–Jenkins) Autoregressive conditional heteroskedasticity (ARCH) Vector autoregression (VAR) (Autoregressive model (AR))
Frequency domain	Spectral density estimation Fourier analysis Least-squares spectral analysis Wavelet Whittle likelihood

Survival

Survival function	Kaplan–Meier estimator (product limit) Proportional hazards models Accelerated failure time (AFT) model First hitting time
Hazard function	Nelson–Aalen estimator
Test	Log-rank test

Applications

Biostatistics	Bioinformatics Clinical trials / studies Epidemiology Medical statistics
Engineering statistics	Chemometrics Methods engineering Probabilistic design Process / quality control Reliability System identification
Social statistics	Actuarial science Census Crime statistics Demography Econometrics Jurimetrics National accounts Official statistics Population statistics Psychometrics
Spatial statistics	Cartography Environmental statistics Geographic information system Geostatistics Kriging

Retrieved from "https://en.wikipedia.org/w/index.php?title=Credible_interval&oldid=1299833775"

Categories:

Hidden categories:

[8]ページ先頭

Movatterモバイル変換

Definitions

Contrasts with confidence interval

References

Further reading