hidden pixel

Credible Interval Information

In Bayesian statistics, a credible interval is a posterior probability interval[1] which is used for interval estimation in contrast to point estimation. Credible intervals are used for purposes similar to those of confidence intervals in frequentist statistics and an alternative terminology is to use Bayesian confidence interval instead of "credible interval".[2] When dealing with more than one unknown quantity simultaneously, the term credible region is used.

For example, a statement such as "following the experiment, a 90% credible interval for the parameter t is 35-45" means that the posterior probability that t lies in the interval from 35 to 45 is 0.9.

There are several ways of defining a credible interval from a given probability distribution for the parameter. Examples include:

  • choosing the narrowest interval, which for a unimodal distribution will involve choosing those values of highest probability density including the mode.
  • choosing the interval where the probability of being below the interval is as likely as being above it; the interval will include the median.
  • choosing the interval which has the mean as its central point, assuming the mean exists.

It is possible to frame the choice of a credible interval within decision theory and, in that context, an optimal interval will always be a highest probability density set.[3]

Distinction between a Bayesian credible interval and a frequentist confidence interval

A frequentist 90% confidence interval of 35–45 means that with a large number of repeated samples, 90% of the calculated confidence intervals would include the true value of the parameter. The probability that the parameter is inside the given interval (say, 35–45) is either 0 or 1 (the non-random unknown parameter is either there or not). In frequentist terms, the parameter is fixed (cannot be considered to have a distribution of possible values) and the confidence interval is random (as it depends on the random sample). Antelman (1997, p. 375) summarizes a confidence interval as "... one interval generated by a procedure that will give correct intervals 95 % [resp. 90 %] of the time". [4]

In general, Bayesian credible intervals do not coincide with frequentist confidence intervals for two reasons:

  • credible intervals incorporate problem-specific contextual information from the prior distribution whereas confidence intervals are based only on the data;
  • credible intervals and confidence intervals treat nuisance parameters in radically different ways.

For the case of a single parameter and data that can be summarised in a single sufficient statistic, it can be shown that the credible interval and the confidence interval will coincide if the unknown parameter is a location parameter (i.e. the forward probability function has the form Pr(x | μ) = f(x − μ) ), with a prior that is a uniform flat distribution;[5] and also if the unknown parameter is a scale parameter (i.e. the forward probability function has the form Pr(x | s) = f(x / s) ), with a Jeffreys' prior [5] -- the latter following because taking the logarithm of such a scale parameter turns it into a location parameter with a uniform distribution. But these are distinctly special (albeit important) cases; in general no such equivalence can be made.

References

  1. ^ Edwards, W., Lindman, H., Savage, L.J. (1963) "Bayesian statistical inference in statistical research". Psychological Research, 70, 193-242
  2. ^ Lee, P.M. (1997) Bayesian Statistics: An Introduction, Arnold. ISBN 0-340-67785-6
  3. ^ O'Hagan, A. (1994) Kendall's Advance Theory of Statistics, Vol 2B, Bayesian Inference, Section 2.51. Arnold, ISBN 0-340-52922-9
  4. ^ Antelman, G. (1997) Elementary Bayesian Statistics (Madansky, A. & McCulloch, R. eds.). Cheltenham, UK: Edward Elgar ISBN 978-1-85898-504-6
  5. ^ a b Jaynes, E. T. (1976). "Confidence Intervals vs Bayesian Intervals", in Foundations of Probability Theory, Statistical Inference, and Statistical Theories of Science, (W. L. Harper and C. A. Hooker, eds.), Dordrecht: D. Reidel, pp. 175 et seq
Statistics
Descriptive statistics
Continuous data
Location Mean (Arithmetic, Geometric, Harmonic) · Median · Mode
Dispersion Range · Standard deviation · Coefficient of variation · Percentile · Interquartile range
Shape Variance · Skewness · Kurtosis · Moments · L-moments
Count data Index of dispersion
Summary tables Grouped data · Frequency distribution · Contingency table
Dependence Pearson product-moment correlation · Rank correlation (Spearman's rho, Kendall's tau) · Partial correlation · Scatter plot
Statistical graphics Bar chart · Biplot · Box plot · Control chart · Correlogram · Forest plot · Histogram · Q-Q plot · Run chart · Scatter plot · Stemplot · Radar chart
Data collection
Designing studies Effect size · Standard error · Statistical power · Sample size determination
Survey methodology Sampling · Stratified sampling · Opinion poll · Questionnaire
Controlled experiment Design of experiments · Randomized experiment · Random assignment · Replication · Blocking · Regression discontinuity · Optimal design
Uncontrolled studies Natural experiment · Quasi-experiment · Observational study
Statistical inference
Bayesian inference Bayesian probability · Prior · Posterior · Credible interval · Bayes factor · Bayesian estimator · Maximum posterior estimator
Frequentist inference Confidence interval · Hypothesis testing · Sampling distribution · Meta-analysis
Specific tests Z-test (normal) · Student's t-test · F-test · Chi-square test · Pearson's chi-square · Wald test · Mann–Whitney U · Shapiro–Wilk · Signed-rank · Likelihood-ratio
General estimation Mean-unbiased · Median-unbiased · Maximum likelihood · Method of moments · Minimum distance · Maximum spacing · Density estimation
Correlation and regression analysis
Correlation Pearson product-moment correlation · Partial correlation · Confounding variable · Coefficient of determination
Regression analysis Errors and residuals · Regression model validation · Mixed effects models · Simultaneous equations models
Linear regression Simple linear regression · Ordinary least squares · General linear model · Bayesian regression
Non-standard predictors Nonlinear regression · Nonparametric · Semiparametric · Isotonic · Robust
Generalized linear model Exponential families · Logistic (Bernoulli) · Binomial · Poisson
Formal analyses Analysis of variance (ANOVA) · Analysis of covariance · Multivariate ANOVA
Data analyses and models for other specific data types
Multivariate statistics Multivariate regression · Principal components · Factor analysis · Cluster analysis · Copulas
Time series analysis Decomposition · Trend estimation · Box–Jenkins · ARMA models · Spectral density estimation
Survival analysis Survival function · Kaplan–Meier · Logrank test · Failure rate · Proportional hazards models · Accelerated failure time model
Categorical data McNemar's test · Cohen's kappa
Applications
Engineering statistics Methods engineering · Probabilistic design · Process & Quality control · Reliability · System identification
Environmental statistics Geostatistics · Climatology
Medical statistics Epidemiology · Clinical trial · Clinical study design
Social statistics Actuarial science · Population · Demography · Census · Psychometrics · Official statistics · Crime statistics
Category · Portal · Outline · Index

Categories: Bayesian statistics | Statistical inference | Intervals

 

The above information uses material from Wikipedia and is licensed under the GNU Free Documentation License.
Some facts may not have been fully verified for accuracy. [Disclaimers]
This page was last archived by our server on Thu Aug 4 20:41:02 2011.
Displaying this page or its contents does not use any Wikimedia Foundation's resources.
The owners of this site proudly support the Wikimedia Foundation.