Lecture 18 - courses.edx.org · Lecture 18. Statistics---the linear model, multivariate style...

Lecture 18

Statistics---the linear model, multivariate styleLet’s consider a more general linear model:

Yi = β0 + β1X1i + β2X2i + . . . + βkXki + εi, i = 1, . . . , n

This is a job for matrix notation!

Statistics---the linear model, multivariate styleLet’s consider a more general linear model:

Yi = β0 + β1X1i + β2X2i + . . . + βkXki + εi, i = 1, . . . , n

This is a job for matrix notation!Let Xi = (X0i, . . . , Xki) 1x(k+1) (row) vector (X0i==1)Let β = (β0, β1, . . . , βk)T (k+1)x1 (column) vector

Statistics---the linear model, multivariate styleSo we have:

Yi = Xiβ + εi, i = 1, . . . , n

But we can go further:Let Y = (Y1, . . . , Yn)T nx1 (column) vectorLet ε = (ε1, . . . , εn)T nx1 (column) vectorLet X = X01 . . . Xk1 nx(k+1) matrix (X0i==1)

X02 . . . Xk2

X0n . . . Xkn

Y = Xβ + εnx1 (nx(k+1))((k+1)x1 nx1

Assumptions:i) identification: n > k+1, X has full column rank k+1 (i.e., regressors are linearly independent, i.e., XTX is invertible)ii) error behavior: E(ε) = 0, E(εεT) (= Cov(ε)) = σ2In (stronger version ε ~ N(0,σ2In))

Y = Xβ + εnx1 (nx(k+1))((k+1)x1 nx1

nxn identity matrix

Y = Xβ + εnx1 (nx(k+1))((k+1)x1 nx1

Let’s take a closer look at these assumptions.

Statistics---the linear model, multivariate stylei) n > k+1, X has full column rank k+1 (i.e., regressors are

linearly independent, i.e., XTX is invertible)---what does this mean?

---need to have more observations than regressors---can’t have any regressors that do not have positive

sample variation---can’t have any regressors that are linear functions of one

or more other regressors

Statistics---the linear model, multivariate style---can’t have any regressors that are linear functions of one

Imagine a case where we want to estimate the effect of schooling, work experience, and age, on salary, so we estimate the following model on individual-level data:

Yi = β0 + β1X1i + β2X2i + . . . + βkXki + εiYi salary X1i years of schoolingX2i years of work experience could be that X1i + X2i + 6 = X3iX3i age for all i in our sample

Statistics---the linear model, multivariate style---can’t have any regressors that are linear functions of one

Actually, researchers most often run afoul of this assumption whenusing dummy variables to indicate, say, observations falling into anexhaustive and mutually exclusive set of classes. Suppose each observation in your data set of pets is either a dog, cat, or fish.You cannot create and include a dummy variable for each type of petbecause together they add up to a column of 1s, which is perfectly collinear with the first column of the X matrix. You need to omit one of them. R will tell you if you make this mistake.

Statistics---the linear model, multivariate styleii) E(ε) = 0, E(εεT) = σ2In---what does this mean?

ε= (ε1, . . . , εn)T is nx1, so εεT nxnE(εεT) = E(ε1ε1) . . E(ε1εn) = Var(ε1) . . Cov(ε1,εn)

E(εnε1) . . E(εnεn) Cov(ε1,εn) . . Var(εn)= σ2 0 . . . 0

0 σ2 . . . 0 = σ2In

0 0 . . . σ2

Statistics---the linear model, multivariate styleE(ε) = 0, E(εεT) = σ2In---what does this mean?

0 σ2 . . . 0 = σ2In

0 0 . . . σ2

This is because the E(ε) = 0.

Statistics---the linear model, multivariate styleE(ε) = 0, E(εεT) = σ2In---what does this mean?

0 σ2 . . . 0 = σ2In

0 0 . . . σ2

This is called the variance-covariance matrix of ε---wedenote it Cov(ε)

Statistics---the linear model, multivariate styleLinear model:

Y = Xβ + εnx1 (nx(k+1))((k+1)x1 nx1

What is ? Well, it is the vector that minimizes the sum of squared errors, i.e.,

Y = Xβ + εnx1 (nx(k+1))((k+1)x1 nx1

So, take the derivative w.r.t. β and set equal to zero to obtain . Then solve for .

Y = Xβ + εnx1 (nx(k+1))((k+1)x1 nx1

if (XTX) is invertible.

Y = Xβ + εnx1 (nx(k+1))((k+1)x1 nx1

if (XTX) is invertible.Wow, beautiful.

Y = Xβ + εnx1 (nx(k+1))((k+1)x1 nx1

What do we want to know about ? Its distribution!

Y = Xβ + εnx1 (nx(k+1))((k+1)x1 nx1

What do we want to know about ? Its distribution!E( ) = β (Treat Xs as fixed, they come outside of the

expectation operator, and it’s easy to show.)

Y = Xβ + εnx1 (nx(k+1))((k+1)x1 nx1

expectation operator, and it’s easy to show.) Cov( ) = σ2(XTX)-1 (Again, not too hard to show if you

treat the Xs as fixed---details in notes on website.)

Y = Xβ + εnx1 (nx(k+1))((k+1)x1 nx1

treat the Xs as fixed---details in notes on website.)And

Y = Xβ + εnx1 (nx(k+1))((k+1)x1 nx1

treat the Xs as fixed---details in notes on website.)And And if the errors are normally-

distributed, they’re also normal.

Statistics---inference in the linear modelNow, finally, we get to inference. Typically, we will want

to test hypotheses involving the βs. (The βs are the parameters in our conditional mean function of our outcome variable Y, and the questions we want to answer are usually about the nature of this conditional mean function.)

Sometimes we are only interested in one of the βs. Other times we might want to simultaneously test hypotheses about several of them.

As we saw in the output I showed you earlier, statistical packages typically perform some standard tests for us, but there may be other ones we need to do ourselves.

Statistics---inference in the linear modelLet’s start with a pretty general framework for testing

hypotheses about β. It’s not only quite general and flexible, it’s also super intuitive.

Statistics---inference in the linear modelLet’s consider hypotheses of the following form:

H0: Rβ = cHA: Rβ = c

R is a rx(k+1) matrix of restrictions. (If r = 1, then we are just testing one restriction, such as β1 = 0.)

Almost any hypothesis involving β you can dream up in the context of the linear model can be captured in this framework. You can test whether individual parameters are equal to zero. You can test whether individual parameters are equal to something other than zero. You can test multiple hypotheses simultaneously. You can test hypotheses about linear combinations of parameters. The world is your oyster.

R is a rx(k+1) matrix of restrictions. If, for instance, R = 0 1 0 . . 0 and c = 0that corresponds to H0: β1 = 0.

R is a rx(k+1) matrix of restrictions. If, for instance, R = 0 1 0 . . 0 and c = 0

0 0 1 . . 0 0. . . .0 0 0 . . 1 0

that corresponds to H0: β1 = β2 = . . . = βk = 0.

R is a rx(k+1) matrix of restrictions. If, for instance, R = 0 1 0 . . 0 and c = 0

0 0 1 . . 0 0. . . .0 0 0 . . 1 0

that corresponds to H0: β1 = β2 = . . . = βk = 0.Here we’re testing k hypotheses simultaneously.

R is a rx(k+1) matrix of restrictions. If, for instance, R = 0 1 -1 . . 0 and c = 0

0 0 0 1 . 0 50 0 0 . . 1 -2

that corresponds to H0: β1 = β2, β3 = 5, and βk = -2.

Statistics---inference in the linear modelOne thing you cannot do in this framework is test one-sided

hypotheses. (We’ll get back to those.)

Statistics---inference in the linear modelWe have a super intuitive and cool way to test these

hypotheses. (First, think of the null as describing a set of restrictions on the model.) 1. We estimate the unrestricted model. 2. We impose the restrictions of the null and estimate that model. 3. We compare the goodness-of-fit of the models. If the restrictions don’t really affect the fit of the model much, then the null is probably true or close to true, so we do not want to reject it. If the restrictions really bind, then we do want to reject the null.

Statistics---inference in the linear modelEstimating the unrestricted model should be simple---just run

the regression. But how do we estimate the restricted model?If the restriction is that certain βs = 0, then leave the regressors corresponding to those βs out of the restricted model.If the restriction is that, say, two βs are equal, create a new regressor, which is the sum of the regressorscorresponding to those βs and include that sum in the restricted model in place of the original regressors. What if the restriction is that some β = c?

Statistics---inference in the linear modelThis is an F-test. (We’ve mentioned a special case of the F

test before. This is a more general formulation.) T = ((SSRR - SSRU)/r)/(SSRU/(n-(k+1))T ~ Fr,n-(k+1) under the null and we reject the null for large

values of the test statistic.

Statistics---inference in the linear modelThis is an F-test. (We’ve mentioned a special case of the F

test before. This is a more general formulation.) T = ((SSRR - SSRU)/r)/(SSRU/(n-(k+1))T ~ Fr,n-(k+1) under the null and we reject the null for large

values of the test statistic.(Why an F distribution? Well, the reason goes back to one

of the facts I told you about special distributions a couple of weeks ago. The ratio of two independent χ2 random variables divided by their respective degrees of freedom are distributed F.)

Lecture 18 - courses.edx.org · Lecture 18. Statistics---the linear model, multivariate style...

Documents

Transcript of Lecture 18 - courses.edx.org · Lecture 18. Statistics---the linear model, multivariate style...

PERBANDINGAN RAMALAN MODEL TARCH DAN …... · DAFTAR TABEL ... distribusi normal dengan rata-rata nol dan variansi ... Φ: vektor parameter model regresi linear EGARCH ( : ...

Multiple Linear Regression Model - IIT Kanpurhome.iitk.ac.in/~shalab/econometrics/Chapter3-Econometrics... · Econometrics | Chapter 3 | Multiple Linear Regression Model | Shalabh,

General Linear Model

REGRESSION DIAGNOSTICS and Remedial...model. Consider the following standard multiple linear regression model: Y = β0 + β1x1 + ... random error. The square root of this statistic

Nonlinear Models - csd.uwo.cadlizotte/teaching/cs4414_F18/Lectures/8_Nonlinear... · Nonlinear Models Dan Lizotte 2018-10-18 Nonlinearly separable data • A linear boundary might

Kinetics of the chiral phase transition in a linear modelhees/publ/dslam2.pdf · 2 Christian Wesp et al.: Kinetics of the chiral phase transition in a linear ˙model the coupling

General Linear Model L ύ cia Garrido and Marieke Schölvinck ICN.

Generalized Linear Models...Generalized Linear Models † GLMs generalize the standard linear model: Yi = Xiﬂ + †i Random: Normal distribution †i » N (0;¾2) Systematic: linear

Non-linear optics Non-linear reaction of a material to an incident …gate.iesl.forth.gr/~soukouli/OFY/lectures/Dialexi_13.pdf · 2008. 2. 29. · Drude model + + + =∞− * 2 2

H3: Linear models, Best Linear Unbiased Estimator

Card Motor New - NEW SMC Expertise · Card Motor Linear motor Linear guide Displacement sensor 3 functions in 1 unit CAT.EUS100-96A-UK Series LAT3 RoHS New. Model LAT3 -10 LAT3 -20

The Classical Linear Regression Model - UCM...Andrea Carriero (QMUL) The Classical Linear Regression Model January 2018 12 / 41 Bayesian estimation Theil mixed estimation - marginal

LECTURE 10: LINEAR MODEL SELECTION PT. 1jcrouser/SDS293/lectures/10-linear-model... · LECTURE 10: LINEAR MODEL SELECTION PT. 1 October 16, 2017 SDS 293:Machine Learning. Outline

Example 1 of econometric analysis: the Market Model · Example 1 of econometric analysis: the Market Model Susan Thomas IGIDR, Bombay 18 November, 2008 Susan Thomas Example 1 of econometric

The Classical Linear Regression Model - Ken Benoit · The Classical Linear Regression Model ME104: Linear Regression Analysis Kenneth Benoit August 14, 2012. CLRM: Basic Assumptions

Econometrics: Multiple Linear Regression - — OCW - …ocw.uc3m.es/economia/econometrics/lecture-notes-1/Topic3...The Multiple Linear Regression Model I Many economic problems involve

Methods for Dummies General Linear Model

β1,2-Mannans Conformationally Interesting MoleculesPolysaccharide Conformation. Part V1. Computer Model-Building. for Linear and Branched Pyranoglycans. Correlations with Biological.

(7) Bayesian linear regression - Nc State Universityreich/ABA/notes/BLR.pdfBayesian linear regression I Linear regression is by far the most common statistical model I It includes

LINEAR BALL BEARINGS: Solid, Drawn Cup, Stroke, … · LINEAR BALL BEARINGS: Solid, Drawn Cup, Stroke, Linear Flat Roller, and Linear Roller Bearing Types. B-236 ... B-240 Linear