Bootstrap/Bootstrapping (Statistics)
Bootstrap/Bootstrapping (Statistics)
Bootstrap/Bootstrapping
- is used to estimate a parameter η of the distribution of a sample statistic 𝜃ˆ, via Monte Carlo simulations when it is too difficult to do it analytically
- is a statistical procedure that resamples a single dataset to create many simulated samples. This process allows for the calculation of standard errors, confidence intervals, and hypothesis testing
Why Use Bootstrap
Bootstrap Terminology
- bootstrap sample - is a random sample drawn with replacement from the observed sample 𝑆 of the same size as 𝑆
- bootstrap distribution - is the distribution of a statistic across a set of bootstrap samples
- bootstrap estimator - is an estimator that is computed on basis of bootstrap samples
Bootstrap Algorithm
to estimate parameter η of the distribution of 𝜃ˆ:
- first obtain a set of 𝑏 bootstrap samples {𝐵1, ..., 𝐵𝑏}. There are 2 ways, either:
- consider all possible bootstrap samples drawn with replacement from the given sample 𝑆 (often intractable)
- generate a large number 𝑏 of random bootstrap samples drawn with replacement from the given sample 𝑆
- for each bootstrap sample 𝐵𝑖 compute statistic 𝜃ˆ*𝑖 the same way 𝜃ˆ was computed from the original sample 𝑆
- estimate the parameter η of this bootstrap distribution {𝜃ˆ*𝑖, ..., 𝜃ˆ*𝑏}
Parametric Bootstrap vs Nonparametric Bootstrap
Subpages
Resources
, multiple selections available,