C22: The Method of Least Squares

CIS 2033 based onDekking et al. A Modern Introduction to Probability and Statistics. 2007

Instructor Longin Jan Latecki

22.1 – Least Squares Given is a bivariate dataset (x1, y1), …, (xn, yn), where x1, …, xn are nonrandom and Yi = α + βxi + Ui are random variables for i = 1, 2, . . ., n. The random variables U1, U2, …, Un have zero expectation and variance σ 2

Method of Least Squares: Choose a value for α and β such that

S(α,β)=( ) is minimal.

( yi−α−β x i)2

22.1 – RegressionThe observed value yi corresponding to xi and the value α+βxi on the

regression line y = α + βx.

22.1– Estimation

After some calculus magic, we get two equations to estimate α and β:

Method of Least Squares: Choose a value for α and β such that

S(α,β)=( ) is minimal.∑1

To find the least squares estimates, we differentiate S(α, β) with respect to α and β, and we set the derivatives equal to 0:

22.1– Estimation

After some simple algebraic rearranging, we obtain:

(slope)

(intercept)

Regression line y = 0.25 x –2.35 for points

22 ][E][E)(Var XXX

22.1– Least Square Estimators are Unbiased

The estimators for α and β are unbiased.

For the simple linear regression model, the random variable

is an unbiased estimator for σ2.

σ̂ 2= 1n−2∑i=1

(Y i−α̂−β̂ xi)2

22.2– ResidualsA way to explore whether the linear regression model is appropriate to model a given bivariate dataset is to inspect a scatter plot of the so-called residuals ri against the xi. The ith residual ri is defined as the vertical distance between the ith point and the estimated regression line:

We always have

22.2– Heteroscedasticity Homoscedasticity: The assumption of equal variance of

the Ui (and therefore Yi).In case the variance of Yi depends on the value of xi, wespeak of heteroscedasticity. For instance, heteroscedasticity occurs when Yi with a large expected value have a larger variance than those with small expected values. This produces a “fanning out” effect, which can be observed in the figure:

22.3– Relation with Maximum LikelihoodWhat are the maximum likelihood estimates for α and β?To apply the method of least squares no assumption is needed about the type of distribution of the Ui. In case the type of distribution of the Ui is known, the maximum likelihood principle can be applied. In particular, when the Ui are independent with an N(0, σ2) distribution.

Then Yi has an N (α + βxi, σ2) distribution, making the probability density function

When Yi are independent, and eachYi has an N(α+βxi, σ2) distribution, and assuming that the linear model is appropriate to model a given bivariate dataset, the residuals ri should look like the realization of a random sample from a normal distribution. An example is shown in the figure below:

22.3– Maximum Likelihood

For fixed σ >0 the loglikelihood l (α, β, σ) obtains the maximum when

is minimal. Hence, when random variables independent with a N(0,σ 2) distribution, the maximum likelihood principle and the least squares method return the same estimators.

The maximum likelihood estimator for σ 2 is:

σ̂ 2= 1n∑i=1

(Y i−α̂−β̂ xi)2

C22: The Method of Least Squares

Documents

Transcript of C22: The Method of Least Squares

Regression Estimation - Least Squares and Maximum … · 3.How to derive tests ... 1.Think of variance as con dence and bias as correctness. 1.1Intuitions (largely) apply 2.Sometimes

Vector Spaces, Orthogonality, and Linear Least Squares · 2020. 9. 4. · Week 10. Vector Spaces, Orthogonality, and Linear Least Squares 354 Homework 10.1.1.4 We notice that it would

3. Regression & Exponential Smoothinghpeng/Math4826/Chapter3.pdf · Discounted least squares/general exponential smoothing Xn t=1 w t[z t −f(t,β)]2 • Ordinary least squares:

C17+ C22+ - BiCefmedia.webshop.bicef.se/2019/06/Castellini_C17-C28plus_Snabbguide.pdf15 C17+ C22+ 10 11 12 9 3 IT Ref. Descrição do aparelho 3 Porta 9 Depósito de descarga 10 Depósito

Translation Synchronization via Truncated Least Squaresxrhuang/slides/TranSyncSpotlight_NIPS17.pdf · Translation Synchronization via Truncated Least Squares Xiangru Huang1* Zhenxiao

The Parametric Self-Dual Simplex Method...Parametric Self-Dual Simplex Method m+n number of pivots Data Least Squares Least Absolute Deviation A log{log plot of Tvs. m+ nand the L1

Least-squares finite element methods for the Poisson - People

4 squares questions

Nonlinear Regression, Nonlinear Least Squares, and ...change a nonlinear relationship into a linear one (in these data, replacing population by its cube-root linearizes the plot, as

Yuting Duan , Antoine Guitton, and Paul Sava Center for Wave …newton.mines.edu/paul/talks/2017_CWP_ElasticLSRTM.pdf · Elastic least-squares migration Yuting Duan , Antoine Guitton,

Wednesday, 08/04/2020 Antonis Argyrosusers.ics.forth.gr/~argyros/cs472_spring20/18_CV... · Least squares line fitting Data: (x1, y1), …, (xn, yn)Line equation: yi= mxi+ b Find

A Linear Least-Squares Solution to ... - cv-foundation.org · A Linear Least-Squares Solution to Elastic Shape-from-Template Abed Malti1 Adrien Bartoli2 Richard Hartley3 1 Fluminance/INRIA,

ADJUSTMENT COMPUTATIONS STATISTICS AND LEAST SQUARES IN SURVEYING AND GIS PAUL WOLF

A Linear Least-Squares Solution to Elastic Shape-From … · (6) where fx fy fz > are the components of the exter-nal body forces per unit volume. ...

Available online at · and then the solution was refined by the full matrix least-squares method using SHELXL-97.35 Non-hydrogen atoms were refined with anisotropic displacement

Greene, Econometric Analysis (7th ed, 2012)fm · 2012. 2. 15. · EC771: Econometrics, Spring 2012 Greene, Econometric Analysis (7th ed, 2012) Chapters 9, 20: Generalized Least Squares,

Rosemary Renaut, Jodi Mead - Arizona State Universityrosie/mypresentations/cfgpres.pdf · 2008. 1. 2. · Regularization Parameter Estimation for Least Squares: A Newton method using

ADJUSTMENT COMPUTATIONS STATISTICS AND LEAST SQUARES IN SURVEYING AND GIS PAUL WOLF CHARLES D. GHILANI.

EE263 homework 4 solutions - web.stanford.edu homework 4 solutions 5.2 Complex linear algebra and least-squares. Most of the linear algebra you have seen is unchanged when the scalars,

Alternatives to Least-Squares - UBCcourses.ece.ubc.ca/574/ident2.pdfAlternatives to Least-Squares • Need a method that gives consistent estimates in presence of coloured noise •