REML Estimation of Variance Components - ISU Public …dnett/S511/26REML.pdf · 27 library(nlme)...

REML Estimation ofVariance Components

Consider the General Linear Model

y = Xβ + ε, where ε ∼ N(0,Σ)

and Σ is an n× n positive definite variance matrix thatdepends on unknown parameters that are organizedin a vector γ.

In the previous set of slides, we consideredmaximum likelihood (ML) estimation of theparameter vectors β and γ.

We saw by example that the MLE of the variancecomponent vector γ can be biased.

Example of MLE Bias

For the case of ε = σ2I, where γ = σ2, the MLE of σ2

is(y− Xβ)′(y− Xβ)

nwith expectation

n− rn

This is MLE for σ2 is often criticized for “failing toaccount for the loss of degrees of freedom needed toestimate β.”

[(y− Xβ)′(y− Xβ)

n− rn

< σ2 = E[

(y− Xβ)′(y− Xβ)

A Familiar Special Case

y1, . . . , yni.i.d.∼ N(µ, σ2)

E[∑n

i=1(yi − µ)2

]= σ2 but

E[∑n

i=1(yi − y)2

n− 1n

σ2 < σ2.

REML is an approach that produces unbiasedestimators for these special cases and producesless biased estimates than ML in general.

Depending on whom you ask, REML stands forREsidual Maximun Likelihood or REstrictedMaximum Likelihood.

The REML Method

1 Find n− rank(X) = n− r linearly independentvectors a1, . . . , an−r such that a′iX = 0′ for alli = 1, . . . , n− r.

2 Find the maximum likelihood estimate of γ usingw1 ≡ a′1y, . . . ,wn−r ≡ a′n−ry as data.

A = [a1, . . . , an−r] w =

wn−r

a′1y...

a′n−ry

= A′y

If a′X = 0′, a′y is known as an error contrast.

Thus, w1, . . . ,wn−r comprise a set of n− r errorcontrasts.

Because

(I − PX)X = X − PXX = X − X = 0,

the elements of

(I − PX)y = y− PXy = y− y

are each error contrasts.

Because rank(I − PX) = n− r, there exists a set of n− rlinearly independent rows of I − PX that can be used in step1 of the REML method to get a1, . . . , an−r.

If we do use a subset of rows of I− PX to get a1, . . . , an−r; theerror contrasts

w1 = a′1y, . . . ,wn−r = a′n−ry

will be a subset of the elements of the residual vector

(I − PX)y = y− y.

This is why it makes sense to call the procedure ResidualMaximum Likelihood.

Note thatw = A′y

= A′(Xβ + ε)= A′Xβ + A′ε= 0 + A′ε= A′ε

w = A′ε ∼ N(A′0, A′ΣA)d= N(0, A′ΣA),

and the distribution of w depends on γ but not β.

The log likelihood function in this case is

`(γ|w) = −12

log |A′ΣA|−12

w′(A′ΣA)−1w−n− r2

log(2π).

An MLE for γ, say γ, can be found in the generalcase using numerical methods to obtain the REMLestimate of γ.

In 6ll, we take the time to prove that every set of n− rlinearly independent error contrasts yields the sameREML estimator of γ.

As an example, consider the special case where

y1, . . . , yni.i.d.∼ N(µ, σ2).

Then X = 1, β = µ, and Σ = σ2I.

It follows that

a′1 = (1,−1, 0, 0, . . . , 0)a′2 = (0, 1,−1, 0, . . . , 0)

...an−1 = (0, 0, . . . , 0, 1,−1)

b′1 = (1, 0, 0, . . . , 0,−1)b′2 = (0, 1, 0, . . . , 0,−1)

...bn−1 = (0, 0, . . . , 0, 1,−1)

are each a set of n− r = n− 1 linear independentvectors that can be used to form error contrasts.

Either

a′1ya′2y...

a′n−1y

y1 − y2

y2 − y3...

yn−1 − yn

or v =

b′1yb′2y...

b′n−1y

y1 − yn

y2 − yn...

yn−1 − yn

could be used to obtain the same REML estimator of σ2.

For the normal theory Gauss-Markov linear model,

y = Xβ + ε, ε ∼ N(0, σ2I),

the REML estimator of σ2 is

σ2 =y′(I − PX)y

n− r,

the unbiased estimator that we used previously.

For linear mixed effects models, the REML estimatorsof variance components produce the same estimatesas the unbiased ANOVA-based estimators formed bytaking appropriate linear combinations of meansquares when the latter are positive and data arebalanced.

In any case, once a REML estimate of γ (and thus Σ)has been obtained, the BLUE of an estimable Cβ canbe approximated by

CβΣ = C(X′Σ−1

X)−X′Σ−1

where Σ is Σ with γ (the REML estimate of γ) inplace of γ.

d=read.delim( "http://www.public.iastate.edu/~dnett/S511/SeedlingDryWeight2.txt") d Genotype Tray Seedling SeedlingWeight 1 A 1 1 8 2 A 1 2 9 3 A 1 3 11 4 A 1 4 12 5 A 1 5 10 6 A 2 1 17 7 A 2 2 17 8 A 2 3 16 9 A 2 4 15 10 A 2 5 19 11 A 2 6 18 12 A 2 7 18 13 A 2 8 18 14 A 2 9 24 15 A 3 1 12 16 A 3 2 12 17 A 3 3 16

18 A 3 4 15 19 A 3 5 15 20 A 3 6 14 21 A 4 1 17 22 A 4 2 20 23 A 4 3 20 24 A 4 4 19 25 A 4 5 19 26 A 4 6 18 27 A 4 7 20 28 A 4 8 19 29 A 4 9 19 30 B 5 1 9 31 B 5 2 12 32 B 5 3 13 33 B 5 4 16 34 B 5 5 14 35 B 5 6 14 36 B 6 1 10 37 B 6 2 10 38 B 6 3 9

39 B 6 4 8 40 B 6 5 13 41 B 6 6 9 42 B 6 7 11 43 B 7 1 12 44 B 7 2 16 45 B 7 3 17 46 B 7 4 15 47 B 7 5 15 48 B 7 6 15 49 B 8 1 9 50 B 8 2 6 51 B 8 3 8 52 B 8 4 8 53 B 8 5 13 54 B 8 6 9 55 B 8 7 9 56 B 8 8 10

plot(d[,2],d[,4]+rnorm(56,0,.2), xlab="Tray",ylab="Seedling Dry Weight", col=2*(1+(d[,1]=="B")),pch="-",cex=2) legend("topright",c("Genotype A","Genotype B"), fill=c(2,4),border=c(2,4))

library(nlme) lme(SeedlingWeight~Genotype,random=~1|Tray, method="ML",data=d) Linear mixed-effects model fit by maximum likelihood Data: d Log-likelihood: -126.3709 Fixed: SeedlingWeight ~ Genotype (Intercept) GenotypeB 15.301832 -3.567017 Random effects: Formula: ~1 | Tray (Intercept) Residual StdDev: 2.932294 1.882470 Number of Observations: 56 Number of Groups: 8

library(lme4) lmer(SeedlingWeight~Genotype+(1|Tray),REML=F,data=d) Linear mixed model fit by maximum likelihood Formula: SeedlingWeight ~ Genotype + (1 | Tray) Data: d AIC BIC logLik deviance REMLdev 260.7 268.8 -126.4 252.7 247.4 Random effects: Groups Name Variance Std.Dev. Tray (Intercept) 8.5984 2.9323 Residual 3.5437 1.8825 Number of obs: 56, groups: Tray, 8

Fixed effects: Estimate Std. Error t value (Intercept) 15.302 1.510 10.14 GenotypeB -3.567 2.136 -1.67 Correlation of Fixed Effects: (Intr) GenotypeB -0.707

lme(SeedlingWeight~Genotype,random=~1|Tray,data=d) Linear mixed-effects model fit by REML Data: d Log-restricted-likelihood: -123.5705 Fixed: SeedlingWeight ~ Genotype (Intercept) GenotypeB 15.288838 -3.550201 Random effects: Formula: ~1 | Tray (Intercept) Residual StdDev: 3.414856 1.882230 Number of Observations: 56 Number of Groups: 8

lmer(SeedlingWeight~Genotype+(1|Tray),data=d) Linear mixed model fit by REML Formula: SeedlingWeight ~ Genotype + (1 | Tray) Data: d AIC BIC logLik deviance REMLdev 255.1 263.2 -123.6 253 247.1 Random effects: Groups Name Variance Std.Dev. Tray (Intercept) 11.6612 3.4149 Residual 3.5428 1.8822 Number of obs: 56, groups: Tray, 8 Fixed effects: Estimate Std. Error t value (Intercept) 15.289 1.745 8.762 GenotypeB -3.550 2.468 -1.438 Correlation of Fixed Effects: (Intr) GenotypeB -0.707

REML Estimation of Variance Components - ISU Public …dnett/S511/26REML.pdf · 27 library(nlme)...

Documents

Transcript of REML Estimation of Variance Components - ISU Public …dnett/S511/26REML.pdf · 27 library(nlme)...

Kelas A Kelas B - · PDF fileKuliah 7- 1 Tahap Ouput dan Penguat Daya Isu penting untuk penguat daya selain penguatan (daya), resistansi input dan resistansi output distorsi amplituda

Package Tray '08 xB IssB - Sparks Toyota and Scion Parts - …toyotaparts.sparkstoyota.com/install/64009-12011inst.pdf · · 2009-06-16Subwoofer Enclosure Recommended Tools Safety

Questions - Body Clock and enquire about ‘An Easy Guide to TENS Pain Relief’ by Dr Gordon Gadsby. Th isu nt e dp c al r gEMC ne ds tob ial pu v c g thE MCinf orma p …

DU GENOTYPE AU PHENOTYPE.TRADUCTION

TRUST · 4081.12 4231.SO L 12.5 x W 7.5 x H 4 cm (0.1L) 12pcs white L 13 x W 7 x H 3.5 cm (0.08L) Glass tray with bowls Set consist of : glass tray, 3China bowls 5135.PB L 32 x W

Original Article A novel genotype screening and ... · the rapid developing of molecular technique, to ... chloric acid and diethyl ether according to acid ether centrifugal sedimentation

Study of plasma carnitine ester profiles and genotype ...aok.pte.hu/hu/egyseg/phd_dolgozatok/2010/docs/phd/... · Study of plasma carnitine ester profiles and genotype analysis in

MY08 xB Package Tray IssD - Toyota Parts - Genuine OEM … · · 2017-03-11SCION xB 2008 - PACKAGE TRAY Preparation Issue: D 09/30/14 Page 1 of 11 pages Part Number: 64009-12010

tray 1 rollers - Product Support and Drivers – Xeroxdownload.support.xerox.com/.../pt_BR/tray_1_rollers.pdfRodillos de alimentación de Bandeja 1 • Rodets d'alimentació de la

β-LACTAMASE-MEDIATED RESISTANCE TO ...eprints.qut.edu.au/77835/1/Tegan_Harris_Thesis.pdfβ-lactamase-mediated resistance to antimicrobials: the relationship between genotype and phenotype

BS-380 - Diasystem · BS-380 Chemistry Analyzer Multi-functional sample tray 75 sample positions Can be programmed into 5 virtual sample disks Primary tubes and various sample cups

Effect of APOE ε4 genotype on amyloid-β and tau ...

Intelligent Cable Management Systems - BICSI PullingCable Pulling • Watch edggyes of conduit, tray and sleeves • Steady as it goes – don’t be a jerk • Maximum Pulling Tension:

Multiparameter models - Jarad Niemi · Multiparameter models Dr. Jarad Niemi STAT 544 - Iowa State University February 12, 2019 Jarad Niemi (STAT544@ISU) Multiparameter models February

Thermal analysis heat capacity - ISU Sites

FIREPROOFING MATERIALS AND SYSTEMS - G-SiGN · PDF fileΘ Repsol Química Puertollano, Spain: Cable tray (2010-2011) ... Θ API 2218 Testing, Standards: UL 1709, ASTM E1529 & E1725,

MIRRORS CARE - A l'Est d'Eden€¦ · driftwood tray HEAVY AGED CHARCOAL MIRROR #20450 SMALL 48x48x4cm - 19"x19"x1,6" mirror tray 62 Ω tray designs #20301 SMALL 48x48x4cm - 19"x19"x1,6"

G gene polymorphism and production in β-thalassemia major ... · IL-10 levels among thalassemia patients that related with genotype production . The study appear that mean serum

Risk Management - Barnes Insurance Agency · A patient with wild-type genotype does not have a defect in the apolipoprotein E ... Anti-ADHD Agents Guanfacine (Intuniv) Amphetamine

Maximum Arc Digraph with a Given Zero Forcing Number · Combinatorial Matrix Theory (ISU) October 2013 6 / 20 Thezero forcing number, Z(), is the minimum number of vertices that need