Bootstrap Confidence Intervals - ST552 Lecture 13 · Motivation...

Bootstrap Confidence IntervalsST552 Lecture 13

Charlotte Wickham2019-02-11

Motivation

The inferences we’ve covered so far relied on our assumption ofNormal errors:

ε ∼ N(0, σ2In×n)

For example, we’ve seen under this assumption, the least squaresestimates are also Normally distributed:

β ∼ N(β, σ2

)−1)

If the errors aren’t truly Normally distributed, whatdistribution do the estimates have?

Warm-up: Your Turn

Imagine the errors are in fact t3 distributed?

With your neighbour: design a simulation to understand thedistribution of the least squares estimates.

Example: 1. Fix n, fix X

5 10 15 20

Example: 1. Fix β, find y

5 10 15 20

Example: 2. Simulate errors, find y

5 10 15 20

Example: 3. Find least squares line

5 10 15 20

Example: 4. Repeat #2. and #3. many times

5 10 15 20

Example: Examine distribution of estimates

−5 0 5 10

Estimate of intercept

0.00 0.25 0.50 0.75

Estimate of slope

Example: Compared to theory

−5 0 5 10

Estimate of intercept

0.00 0.25 0.50 0.75

Estimate of slope

When the errors aren’t Normal: CLT

Think of our estimates like linear combinations of the errors. I.e. asort of average of i.i.d random variables.Some version of the Central Limit Theorem will apply.

For large samples, even when the errors aren’t Normal,

β∼N(β, σ2(XT X )−1)

Summary so far

If we knew the error distribution and true parameters we could usesimulation to understand the sampling distribution the leastsquares estimates.

Simulation can also be used to demonstrate the CLT at work inregression.

Bootstrap confidence intervals

In practice, with data in front of us, we don’t know the distributionof the errors (nor the true parameter values).

The bootstrap is one approach to estimate the samplingdistribution of β, by using the simulation idea, and substituting inour best guesses for the things we don’t know.

Bootstrapping regression

(Model based resampling)

0. Fit model and find estimates, β, and residuals, ei

1. Fix X ,2. For k = 1, . . . ,B

i. Generate errors, ε∗i sampled with replacement from ei

ii. Construct y , using the model, y = y + ε∗

iii. Use least squares to find β∗(k)

3. Examine the distribution of β∗ and compare to β

One confidence interval for βj is the 2.5% and 97.5% quantiles ofthe distribution of β∗

j .(Known as the Percentile method, there are other (better?)methods).

Example: Faraway Galapagos Islands

(I’ll illustrate with simple linear regression, Faraway does multiplecase in 3.6)

−200

0 500 1000 1500

Elevation

Observed data

Bootstrap: 1. Find β, y , and ei .

−200

0 500 1000 1500

Elevation

Observed data

Bootstrap: Using fixed X , ˆbeta from observed data

−200

0 500 1000 1500

Elevation

Observed data

Bootstrap: 2. Resample residuals to construct bootstrappedresponse

−200

0 500 1000 1500

Elevation

Bootstrapped data

Bootstrap: 3. Fit regression model to bootstrapped response

−200

0 500 1000 1500

Elevation

Bootstrapped data

Bootstrap: 3. Repeat #2. and #3. many times

−200

0 500 1000 1500

Elevation

Examine distribution of estimates

−50 0 50

Bootstrap estimates of intercept

0.10 0.15 0.20 0.25 0.30 0.35

Bootstrap estimates of slope

High level: bootstrap idea

We don’t know the distribution of the errors, but our best guess isprobably the empirical c.d.f on the residuals.

Sampling from a random variable with a c.d.f. defined as theempirical c.d.f. of the residuals, boils down to sampling withreplacement from residuals.

Limitations

We might rely on bootstrap confidence intervals when we areworried about the assumption of Normal errors. But, there arelimitations.

• We still rely on the assumption that the errors areindependent and identically distributed.

• Generally scaled residuals are used (residuals don’t have thesame variance, more later)

• An alternative bootstrap resamples the (yi , xi1, . . . , xip)vectors, i.e. resamples the rows of the data, a.k.a resamplingcases bootstrap.

Bootstrap Confidence Intervals - ST552 Lecture 13 · Motivation...

Documents

Transcript of Bootstrap Confidence Intervals - ST552 Lecture 13 · Motivation...

ΑΝΑΔΙΑΜΟΡΦΩΣΗ ΔΕΞΙΑΣ ΚΟΙΛΙΑΣstatic.livemedia.gr/livemedia/documents/al18173_us75_20160514090539_avgeropoulou.pdfhazard with bootstrap Cl adjustment model

Stephen Mildenhall Bayesian-Bootstrap Loss Development CAS DFA Seminar Chicago, July 1999.

Better Bootstrap Con dence Intervals · 2012. 6. 1. · Edgeworth and Cornish-Fisher expansions will come in handy. Consider the quantity S n = p n(^ ) ˙: Under a certain class of

Module 14: Confidence Intervals

More on Confidence Intervalswlandau.github.io/stat305/lectures/ch6part2/ch6part2.pdf · More on Con dence Intervals Will Landau n 25, ˙ unknown n

Bootstrap and Resampling Methodsluke/classes/STAT7400/...Bootstrap and Resampling Methods Often we have an estimator T of a parameter q and want to know its sampling properties –

Two Geometric Representations of Confidence Intervals … · Two Geometric Representations of Confidence Intervals for Ratios of Linear Combinations of Regression Parameters: ...

Inference Based on the Wild Bootstrap - Carleton University€¦ · Bootstrap methods are sometimes used to estimate standard errors and covariance matrices. ... Might be useful for

STAT 135 Lab 2 Confidence Intervals, MLE and the Delta Method · STAT 135 Lab 2 Con dence Intervals, MLE and the Delta Method Rebecca Barter February 2, 2015. Con dence Intervals.

Joe Felsenstein Department of Genome Sciences and Department …evolution.gs.washington.edu/gs541/2004/lecture27.pdf · 2007. 2. 26. · Bootstrap sample #1 Bootstrap sample #2 Estimate

Inference and Confidence Intervals. Outline Inferring a population mean: Constructing confidence intervals Examining the difference between two means.

Métodos de reamostragem - Unicampcnaber/aula_Boot.pdf · 2012. 6. 8. · Bootstrap n~ao-param etrico: n~ao conhecemos a distribui˘c~ao dos dados F e simulamos valores a partir da

4.3 Confidence Intervals -Using our CLM assumptions, we can construct CONFIDENCE INTERVALS or CONFIDENCE INTERVAL ESTIMATES of the form: -Given a significance.

Confidence Intervals for Proportions - My Math Mantra

Confidence Intervals - Kasetsart University

Computational Geometry Complexity Notionsovid.cs.depaul.edu/Classes/CSC543-S09/ComputationalGeometry.pdf · 5/12/2009 6 Interval Intersection Input: a set of n intervals, an interval

07 - Bootstrap and Splines › SYS6018 › lectures › 07-bootstrap.pdf · 07 - Bootstrap and Splines SYS 6018 | Fall 2019 6/17 1 #-- Bootstrap Distribution 2 M =2000 # number of

Chapter 9 Conﬁdence Intervals - lagrange.math.siu.edu

Bootstrap of residual processes in regression: to smooth ... › pdf › 1712.02685.pdf · estimator, Koul and Lahiri (1994) and Neumeyer (2008, 2009) proposed bootstrap procedures,

Bootstrap Methods for Time Series: A Selective Overview ...politis/TALKS/slidesCyprusBOOT2004.pdf · Bootstrap distribution is automatically centered correctly. • Circular Block