BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability...

34
BFF Four: Are we Converging? Nancy Reid May 2, 2017

Transcript of BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability...

Page 1: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

BFF Four: Are we Converging?

Nancy Reid

May 2, 2017

Page 2: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Classical Approaches: A Look Way Back

Nature of Probability

BFF one to three: a look back

Comparisons

Are we getting there?

BFF Four Harvard, May 2017 2/34

Page 3: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Posterior Distribution Bayes 1763

Stigler 2013

BFF Four Harvard, May 2017 3/34

Page 4: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Posterior Distribution Bayes 1763

π(θ | y0) = f (y0; θ)π(θ)/m(y0)

probability distribution for θy0 is fixed

probability comes from π(θ)BFF Four Harvard, May 2017 4/34

Page 5: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Fiducial Probability Fisher 1930

df = − ∂

∂θF (T , θ)dθ

fiducial probability density for θ, givenstatistic T

probability comes from (dist’n of) T“It is not to be lightly supposed that men of the mental calibre of Laplace and

Gauss ... could fall into error on a question of prime theoretical importance,without an uncommonly good reason”

BFF Four Harvard, May 2017 5/34

Page 6: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Confidence Distribution Cox 1958; Efron 1993

• ”Much controversy has centred on the distinction between fiducial andconfidence estimation”

• “ ... The fiducial approach leads to a distribution for the unknown parameter”

• “... the method of confidence intervals, as usually formulated, gives only oneinterval at some preselected level of probability”

• “... in ... simple cases ... there seems no reason why we should not workwith confidence distributions for the unknown parameter

• “These can either be defined directly, or ... introduced in terms of the set ofall confidence intervals”

BFF Four Harvard, May 2017 6/34

Page 7: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Confidence Distribution Cox 1958; Efron 1993

• “assigns probability 0.05 to θ lying between the upper endpoints of the 0.90and 0.95 confidence intervals, etc.”

• “Of course this is logically incorrect, but it has powerful intuitive appeal”

• “... no nuisance parameters [this] is exactly Fisher’s fiducial distribution”Seidenfeld 1992; Zabell 1992

BFF Four Harvard, May 2017 7/34

Page 8: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Structural Probability Fraser 1966

• “a re-formulation of fiducial probability for transformation models”• “This transformation re-formulation leads to a frequency interpretation”• a change in the parameter value can be offset by a change in the sample

y → y + a; θ → θ − a• a local location version leads to:

df = − ∂

∂θF (y , θ)dθ = − ∂

∂θF (y , θ)

f (y0, θ)

f (y0, θ)=

Likelihood︷ ︸︸ ︷f (y0, θ)

dydθ

∣∣∣∣y0

BFF Four Harvard, May 2017 8/34

Page 9: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Significance Function Fraser 1991

0 2 4 6 8

−8−6

−4−2

0

ψ

Log

likel

ihoo

d

0 2 4 6 8

−3−2

−10

12

3

ψ

Pivo

t

• “from likelihood to significance”• “significance records probability left of the observed data point”

”likelihood records probability at the observed data point”• the significance function is a plot of this probability

as a function of θ• “the full spectrum of confidence intervals is obtained ...

suggesting the alternate name confidence distribution functionBFF Four Harvard, May 2017 9/34

Page 10: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Nature of Probability Cox 2006; R & Cox 2015; Zabell, 1992

• probability to describe physical haphazard variability aleatory• probabilities represent features of the “real” world

in somewhat idealized form• subject to empirical test and improvement• conclusions of statistical analysis expressed in terms of

interpretable parameters• enhanced understanding of the data generating process

• probability to describe the uncertainty of knowledge epistemic• measures rational, supposedly impersonal, degree of belief,

given relevant information Jeffreys

• measures a particular person’s degree of belief, subjecttypically to some constraints of self-consistency

Ramsey, de Finetti, Savage• often linked with personal decision making necessarily?

BFF Four Harvard, May 2017 10/34

Page 11: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

... nature of probability

• Bayes posterior describes uncertainty of knowledge• probability comes from the prior

• confidence intervals or p-values refer to empirical probabilities

• in what sense are confidence distribution functions,significance functions, structural or fiducial probabilities to beinterpreted?

• empirically? degree of belief?• literature is not very clear imho

• we may avoid the need for a different version of probability byappeal to a notion of calibration

BFF Four Harvard, May 2017 11/34

Page 12: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

BFF 1 – 4

• posterior distribution

• fiducial probability

• confidence distribution

• structural probability

• significance function

• belief functions

• objective Bayes

• generalized fiducial inference

• confidence distributions andconfidence curves

• approximate significancefunctions

• inferential models

BFF Four Harvard, May 2017 12/34

Page 13: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

What has changed?computation

BFF Four Harvard, May 2017 13/34

Page 14: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Objective Bayes Berger, BFF4, e.g.

• noninformative, default, matching, reference, ... priors

• we may avoid the need for a different version of probability byappeal to a notion of calibration

Cox 2006, R & Cox 2015

• as with other measuring deviceswithin this scheme of repetition, probability is defined as ahypothetical frequency

• it is unacceptable if a procedure yielding high-probabilityregions in some non-frequency sense are poorly calibrated

• such procedures, used repeatedly, give misleadingconclusions

Bayesian Analysis, V1(3) 2006

BFF Four Harvard, May 2017 14/34

Page 15: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

... objective Bayes

• pragmatic solution as a starting point

• some versions may not be correctly calibrated

• requires checking in each example

• calibrated versions must be targetted on the parameter ofinterest

• only in very special cases can calibration be achieved for morethan one parameter in the model, from the same prior

• the simplicity of a fully Bayesian approach to inference is lostGelman 2008; PPM LW

BFF Four Harvard, May 2017 15/34

Page 16: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Confidence Distribution Xie & Singh; Hjort & Schweder

• any function H : Y ×Θ→ (0,1) which is

• a cumulative distribution function of θ for any y ∈ Y

• has correct coverage: H(Y , θ) ∼ U(0,1) Y ∼ f (·; θ)

• CDs, or approximate CDs, are readily obtained from pivotalquantitites

• pivotal quantity: g(y , θ) with sampling distribution known√

n(y − µ)/s• sufficiently general to encompass bootstrap distribution and

many standard likelihood quantities• recent examples include robust meta-analysis and

identification of change-pointsXie et al. 2011; Cunen et al. 2017; Hannig & Xie 2012

BFF Four Harvard, May 2017 16/34

Page 17: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

... confidence distribution

-1 0 1 2 3 4

0.0

0.2

0.4

0.6

0.8

1.0

µ

confidence

-1 0 1 2 3 4

0.5

0.6

0.7

0.8

0.9

1.0

µ

cc

-1 0 1 2 3 4

0.0

0.2

0.4

0.6

0.8

1.0

µ

significance

-1 0 1 2 3 4

-4-2

02

4

µ

pivot

BFF Four Harvard, May 2017 17/34

Page 18: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Generalized Fiducial Hannig et al 2016

• Fisher: g(Y , θ) has a known distribution; invert this to createdistribution for θ when y0 is obtained• Fraser: use data-generating equation to make the inversion

more direct, e.g. Yi = µ+ ei , ei = y0i − µ

• Hannig et al. Y = G(U, θ), U has known distribution• suppose we can invert this for any y0: θ = Qy0(U)

• fiducial distribution of θ is Qy0(U∗) U∗ independent copy of U

• inverse only exists if θ and Y have same dimension• might get this by reduction of a sample to sufficient statistics• more generally, fiducial density takes the form

r(θ; y0) ∝ f (y0, θ)J(y0, θ)

BFF Four Harvard, May 2017 18/34

Page 19: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Significance Function Fraser & R, ...

• focus on parameter of interestmany arguments point to the need for this

• y ∈ Rn, θ ∈ Rp, ψ ∈ R

• focus on dimension reduction

• n ↓ p using an approximation location model

df = − ∂

∂θF (y , θ)dθ = − ∂

∂θF (y , θ)

f (y0, θ)

f (y0, θ)=

Likelihood︷ ︸︸ ︷f (y0, θ)

dydθ

∣∣∣∣y0

• p ↓ 1 using a tangent exponential model

• combination of structural model and likelihood asymptotics

• leads for example to construction of default priorsFraser et al 2010

BFF Four Harvard, May 2017 19/34

Page 20: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Inferential Models Martin & Liu

• Hannig et al. Y = G(U, θ), U has known distribution• suppose we can invert this for any y0: θ = Qy0(U)

• fiducial distribution of θ is Qy0(U∗) U∗ independent copy of U

• use a random set S to predict U

• this random set is converted to a belief function about θ

• need to ensure the belief function is valid and efficient

• valid = calibrated ?

BFF Four Harvard, May 2017 20/34

Page 21: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Comparisons: conditioning

• objective Bayes

• generalized fiducial inference

• confidence distributions andconfidence curves

• approximate significance functions

• inferential models

• yes

• yes and no JASA ’16

• needs to be built inahead of time

• yes; via approximatelocation model

• needs to be built inahead of time

BFF Four Harvard, May 2017 21/34

Page 22: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Comparisons: Eliminating Nuisance Parameters

• objective Bayes

• generalized fiducial inference

• confidence distributions andconfidence curves

• approximate significance functions

• inferential models

• marginalizationrarely works ...

• depends on the problem?

• use profilelog-likelihood, or similar

focus parameter

• marginalizationvia Laplace approximation

• marginalizationinvoked ahead of time

BFF Four Harvard, May 2017 22/34

Page 23: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Comparisons: Calibration

• objective Bayes

• generalized fiducial inference

• confidence distributions andconfidence curves

• approximate significance functions

• inferential models

• often

• yes

• typically approximate

• typically approximate

• yes

BFF Four Harvard, May 2017 23/34

Page 24: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Comparisons: Nature of Probability

• Bayes / objective Bayes

• generalized fiducial inference

• confidence distributions andconfidence curves

• approximate significance functions

• inferential models

• epistemic / empirical

• empirical

• empiricalbut not prescriptive

• empirical

• ?epistemic?

BFF Four Harvard, May 2017 24/34

Page 25: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

What’s the end goal?

• Applications – something that works• gives ‘sensible’ answers• not too sensitive to model assumptions• computable in reasonable time• provides interpretable parameters

• Foundations – peeling back the layers• what does ’works’ mean?• what probability do we mean• ‘Goldilocks’ conditioning Meng & Liu, 2016• how does this impact applied work?

BFF Four Harvard, May 2017 25/34

Page 26: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Role of Foundations Cox & R, 2015

• avoid apparent discoveries based on spurious patterns

• to shed light on the structure of the problem

• calibrated inferences about interpretable parameters

• realistic assessment of precision

• understanding when/why methods work/fail

BFF Four Harvard, May 2017 26/34

Page 27: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Well, are we?objective Bayes

confidence distributions

generalized fiducial

inferential models

significance functions

Larry W: “the perpetual motion machineof Bayesian inference”

Min-ge, Regina: “everything fits”Nils: “CDs are the ‘gold standard’ ”

Jan: “bring it on ... I’ll figure it out”

Ryan: “it’s the only solution”Chuanhai: “ it might take 100 years”

Don: “it’s the best solution ...you can’t solve everything at once”

BFF Four Harvard, May 2017 27/34

Page 28: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Some warning signs

BFF Four Harvard, May 2017 28/34

Page 29: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Some warning signs

BFF Four Harvard, May 2017 29/34

Page 30: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Some warning signs

BFF Four Harvard, May 2017 30/34

Page 31: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Some warning signs

“deep neural networks easily fit random labels”

“these observations rule out ... possible explanations for thegeneralization performance of state-of-the-art neural networks.”

BFF Four Harvard, May 2017 31/34

Page 33: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

Summary

• Bayes, fiducial, structural, confidence, belief

• BFF 1 - 4: Develop links to bridge gaps among differentstatistical paradigms

• targetting parameters• limit distributions• calibration in repeated sampling• relevant repetitions for the data at hand

NR: Why is conditional inference so hard?DRC: I expect we’re all missing something, but I don’t knowwhat it is

StatSci Interview 1996

BFF Four Harvard, May 2017 33/34

Page 34: BFF Four: Are we Converging? · BFF 1 – 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective

ReferencesCox, D.R. (1958). Ann. Math. Statist.Cox, D.R. (2006). Principles of Statistical Inference.Cunen et al. (2017). J. Statist. Plann. Infer.Efron, B. (1993). BiometrikaFisher, R.A. (1930). Proc. Cam. Phil. Soc.Fraser, D.A.S. (1966). BiometrikaFraser, D.A.S. (1991). J. Amer. Statist. Assoc.Fraser, D.A.S. et al. (2010). J. R. Statist. Soc. BFraser, D.A.S. and Reid, N. (1993). Statist. SinicaGelman, A. (2008). Ann. Appl. Statist.Hannig, J. and Xie, M. (2012). Elect. J. Statist.Hannig, J. et al. (2016). J. Amer. Statist. Assoc.Hjort, N. and Schweder, T. Confidence, Likelihood, Probability:Inference with Confidence DistributionsMeng, X.-L. and Liu, K. (2016). Ann. Rev. Stat. and its Applic.Norman, S. et al. (2016). Ann. Rev. Stat. and its Applic.Reid, N. and Cox, D.R. (2015). Intern. Statist. Rev.Stein, C. (1959). Ann. Math. Statist.Stigler, S. (2013). Statistical ScienceXie, M. and Singh, K. (2013) Internat. Statist. Rev.Xie, M. et al. (2011). J. Amer. Statist. Assoc.Zabell, X. (1992). Statistical Science

BFF Four Harvard, May 2017 34/34