Aspects of Likelihood Inferencefisher.utstat.toronto.edu/reid/sta3000y/notes-from-basel...I neglects...

Aspects of Likelihood Inference

Nancy Reid

October 16, 2013

Models and likelihoodI Model for the probability distribution of y given xI Density f (y | x) with respect to, e.g., Lebesgue measureI Parameters for the density f (y | x ; θ), θ = (θ1, . . . , θd)

I Likelihood function L(θ; y0) ∝ f (y0; θ)

I often θ = (ψ, λ)I θ could have very large dimension, d > n

typically y = (y1, . . . , yn)I θ could have infinite dimension E(y | x) = θ(x) ‘smooth’,

in principle

Aspects of Likelihood Inference Basel, 2013 2

Why likelihood?I makes probability modelling centralI emphasizes the inverse problem of reasoning

from y0 to θ or f (·)I suggested by Fisher as a measure of plausibility

Royall, 1994L(θ̂)/L(θ) ∈ (1,3) very plausible;L(θ̂)/L(θ) ∈ (3,10) implausible;L(θ̂)/L(θ) ∈ (10,∞) very implausible

I converts a ‘prior’ probability π(θ) to a posterior π(θ | y) viaBayes’ Theorem

I provides a conventional set of summary quantities forinference based on properties of the postulated model


Widely used


... widely used


... widely used

National Post, Toronto, Jan 30 2008


... why likelihood?I likelihood function depends on data only through sufficient

statisticsI “likelihood map is sufficient” Fraser & Naderi, 2006

I gives exact inference in transformation modelsI “likelihood function as pivotal” Hinkley, 1980

I provides summary statistics with known limiting distributionI leading to approximate pivotal functions, based on normal

distributionI likelihood function + sample space derivative gives better

approximate inference


Nancy

Derived quantities

16 17 18 19 20 21 22 23

−4

−3

−2

−1

0

log−likelihood function

θθ

log−

likel

ihoo

d

θθθθ

θθ −− θθ

1.92 w/2

I maximum likelihood estimatorθ̂ = arg supθ log L(θ; y)= arg supθ`(θ; y)

I observed Fisher informationj(θ̂) = −∂2`(θ)/∂θ2

I efficient score function`′(θ) = ∂`(θ; y)/∂θ

`′(θ̂) = 0 assuming enough regularity

I `′(θ; y) =∑n

i=1 log fYi (yi ; θ), y1, . . . , yn independent


Nancy

Nancy

Approximate pivots scalar parameter of interestI profile log-likelihood `p(ψ) = `(ψ, λ̂ψ)I θ = (ψ, λ); λ̂ψ constrained maximum likelihood estimator

re(ψ; y) = (ψ̂ − ψ)j1/2p (ψ̂).∼ N(0,1)

r(ψ; y) = ±√[2{`p(ψ̂)− `p(ψ)}]

.∼ N(0,1)

πm(ψ | y).∼ N{ψ̂, j−1/2p (ψ̂)}

jp(ψ) = −`′′p(ψ); profile information

16 17 18 19 20 21 22 23

−4

−3

−2

−1

0

log−likelihood function

θθ

log−

likel

ihoo

d

θθθθ

θθ −− θθ

1.92 w/2


Nancy

Nancy

... approximate pivots scalar parameter of interest


Nancy

Nancy

Nancy

... approximate pivots scalar parameter of interestI profile log-likelihood `p(ψ) = `(ψ, λ̂ψ)I θ = (ψ, λ); λ̂ψ constrained maximum likelihood estimator

re(ψ; y) = (ψ̂ − ψ)j1/2p (ψ̂).∼ N(0,1)

r(ψ; y) = ±√[2{`p(ψ̂)− `p(ψ)}]

.∼ N(0,1)

πm(ψ | y).∼ N{ψ̂, j−1/2p (ψ̂)}

r∗(ψ; y) = r(ψ) +1

r(ψ)log{

QF (ψ)r(ψ)

}.∼ N(0,1)

r∗B(ψ; y) = r(ψ) +1

r(ψ)log{

QB(ψ)r(ψ)

}.∼ N(0,1)


Nancy

The problem with profilingI `p(ψ) = `(ψ, λ̂ψ) used as a ‘regular’ likelihood, with the

usual asymptoticsI neglects errors in the estimation of the nuisance parameterI can be very large when there are many nuisance

parameters

I example: normal theory linear regression σ̂2 = RSS/nusual estimator RSS/(n − k) k the number of regressioncoefficients

I badly biased if k large relative to nI inconsistent for σ2 if k →∞ with n fixedI example fitting of smooth functions with large numbers of

spline coefficients


Conditional and marginal likelihoods

f (y ;ψ, λ) ∝ f1(s | t ;ψ)f2(t ;λ)

I L(ψ, λ) ∝ Lc(ψ)Lm(λ), where L1 and L2 are genuinelikelihoods, i.e. proportional to genuine density functions

I Lp(ψ) is a conditional likelihood Lc(ψ), and estimation of λhas no impact on asymptotic properties

I s is conditionally sufficient , t is marginally ancillary, for ψ

I hardly ever get so luckyI but might expect something like this to hold approximately,

which it does, and this is implemented in r∗F formulaautomatically Brazzale, Davison, R 2007


Introduction

Aspects of Likelihood Inferencefisher.utstat.toronto.edu/reid/sta3000y/notes-from-basel...I neglects...

Documents

Transcript of Aspects of Likelihood Inferencefisher.utstat.toronto.edu/reid/sta3000y/notes-from-basel...I neglects...

Lax Milgram Theorem - Universität Basel

ΑΛΚΗΣ ΠΙΕΡΡΑΚΟΣ (1920-2017) - NTUA...2017/03/29 · σπούδασε στην Ελβετία (Gewerbeschule, Basel, 1949-1951), και στο Λονδίνο (Slade School

withCristiano Varin and thanks to Don Fraser, Grace Yi ...fisher.utstat.toronto.edu/reid/research/ucl-reid.pdf · Composite Likelihood Nancy Reid November 22, 2012 withCristiano Varin

JAJ NMR and Systematic Errors

Sarah Basel Obada Froukh Dena Kofahi Nader Alaridah...2 | P a g e Ϋ In this sheet we will discuss parasitic infections of the GIT. › Three protozoal (unicellular) infections: 1.

Errors in Hypothesis Testing

Alan W. Reid Rice University ICM, Rio de Janeiro, August 2018ar99/ProfiniteRio2.pdfFunar shows that for any m 2 there exist m torus bundles admitting SOLV geometry whose fundamental

DEECET-2019...DEECET-2019 Physical Science - TAMIL 1. The important objective of giving dictation in Primary classes is 1. to correct errors in reading 2. to correct errors in writing

Methodenentwicklung für die simultane Bestimmung und ... · Kantonales Laboratorium Basel-Landschaft, 4410 Liestal ZUSAMMENFASSUNG In der heutigen Nutztierhaltung sind β -Lactam-Antibiotika

Τυχαία vs. συστηµατικά σφάλµατα (random vs. systematic errors)users.uoi.gr/hyepilab/assets/pdfs/hygieneII/ErrorsPresentation.pdf · Test-retest Repeatability

PowerPoint Presentationstatic.barik.net/barik/proposal/barik_proposal_slides.pdf · Help users recognize, diagnose, and reccn.'er from errors Network Nodes Links Layout Display Specials

Forecasting - Rob J. HyndmanOutline 1Regression with ARIMA errors 2Stochastic and deterministic trends 3Periodic seasonality 4Dynamic regression models Forecasting: Principles and

Propagation of errors - Instituto Tecnológico de Aeronáutica · Physics 2BL 7 Histograms and Distributions different values number of occurrences weighted sum – each value of

Composite Likelihood - University of Torontofisher.utstat.toronto.edu/reid/research/reid-brown13.pdf · Applications: spatial and space-time data I conditional approaches seem more

Threshold Dependence of Soft-Errors induced by particles ... · Threshold Dependence of Soft-Errors induced by particles and Heavy Ions on Flip Flops in a 65 nm Thin BOX FDSOI Mitsunori

Chemistry 163B Thermochemistry Chapter 4 Engel & … · Chemistry 163B Thermochemistry ... topics for thermochemistry, parts of Ch. 4 Engel & Reid HW#3 16,17,18 HW#3 16 ... • physical

Resonance Corrections at J-PARC MR€¦ · • J-PARC MR has a three fold symmetry. • Non structure resonances would be caused by magnet errors. • SAD simulation with magnet errors,

temperature-limiting safety components sensor elements or ... · temperature-limiting safety components sensor elements or sensors Changes and errors excepted ... your specialist

Deductive Search for Errors in Free Data Type Speci cations using … · 2008. 7. 18. · 1 Subject frame: free data type speci cations goal: support for detecting errors in speci

615 Inborn errors affecting vitamin B6 metabolism - PND Association