CHAPTER 6 Statistical Inference & Hypothesis Testing

• 6.1 - One Sample

Mean μ, Variance σ 2, Proportion π

• 6.2 - Two Samples Means, Variances, Proportions μ1 vs. μ2 σ1

2 vs. σ22 π1 vs. π2

• 6.3 - Multiple Samples Means, Variances, Proportions μ1, …, μk σ1

2, …, σk2 π1, …, πk

CHAPTER 6 Statistical Inference & Hypothesis Testing

• 6.1 - One Sample

Mean μ, Variance σ 2, Proportion π

• 6.2 - Two Samples Means, Variances, Proportions μ1 vs. μ2 σ1

2 vs. σ22 π1 vs. π2

• 6.3 - Multiple Samples Means, Variances, Proportions μ1, …, μk σ1

2, …, σk2 π1, …, πk

CHAPTER 6 Statistical Inference & Hypothesis Testing

s2 = SS/df2 2

1 1 2 2

( 1) ( 1)2pooled 2

n s n sn ns

(5 1)( ) (3 1)( )2pooled 5 3 2s

788.5 1663 1080

2 2(593 ) (520 )22 3 1s

546 546 1663

593 525 5202 3y 546

Example: Y = “$ Cost of a certain medical service”

• Data: Sample 1 = {667, 653, 614, 612, 604}; n1 = 5 Sample 2 = {593, 525, 520}; n2 = 3

667 653 614 612 6041 5y 630

• Analysis via T-test (if equivariance holds): Point estimates /iy y n1 2y y 84

NOTE:> 0

Assume Y is known to be normally distributed at each of k = 2 health care facilities (“groups”).

Clinic: Y2 ~ N(μ2, σ2) Hospital: Y1 ~ N(μ1, σ1) • Null Hypothesis H0: μ1 = μ2, i.e., μ1 – μ2 = 0 (“No difference exists.") 2-sided test at significance level α = .05

“Group Means”

2 2(667 ) (604 )21 5 1s

630 630 788.5“Group Variances”

1663788.5 2.11 4F

Pooled Variance

The pooled variance is a weighted average of the group variances, using the degrees of freedom as the weights.

SS1 SS2

2 2(593 546) (520 546)22 3 1s

16632 2(667 630) (604 63021 5 1s

) 788.5

p-value =

SSErr = 64802 2

1 1 2 2

( 1) ( 1)2pooled 2

n s n sn ns

788.(5 1)( ) (3 1)( )2pooled 5 3 2

5 1663s 1080

2 2(593 ) (520 )22 3

546 4s 1663

593 525 5202 3y 546

667 653 614 612 6041 5y 630

• Analysis via T-test (if equivariance holds): Point estimates /iy y n1 2y y 84

NOTE:> 0

“Group Means”

2 2(667 ) (604 )21 5 1s

630 630 788.5“Group Variances”

1663788.5 2.11 4F

Pooled Variance

s2 = SS/df

2 2(593 546) (520 546)22 3 1s

dfErr = 6

Standard Error

20 pooled

1 1s.e. sn n

01 1s.e.5 3

1080 24

2 2(667 630) (604 63021 5 1s

) 788.5

01 22 ( ) 2 2P Y Y P T P T 6 624

8484 3.5

> 2 * (1 - pt(3.5, 6))[1] 0.01282634

Reject H0 at α = .05stat signif, Hosp > Clinic

R code:

> y1 = c(667, 653, 614, 612, 604)> y2 = c(593, 525, 520)> > t.test(y1, y2, var.equal = T)

Two Sample t-test

data: y1 and y2 t = 3.5, df = 6, p-value = 0.01283alternative hypothesis: true difference in means is not equal to 0 95 percent confidence interval: 25.27412 142.72588 sample estimates:mean of x mean of y 630 546

p-value < α = .05Reject H0 at this level.

The samples provide evidence that the difference between mean costs is (moderately) statistically significant, at the 5% level, with the hospital being higher than the clinic (by an average of $84).

Formal Conclusion

Interpretation

“Total Variability” = “Variability between groups” + “Variability within groups”

1Y 2Y kY

Hypothesis?

HA: “At least one ‘treatment mean’ μi is significantly different from the others.

Analysis of Variance (ANOVA) Main Idea: Among several (k 2) independent, equivariant,

normally-distributed “treatment groups”…

Alternate method ~

• (if equivariance holds): Point estimates /iy y nANOVA F-test593 525 520

2 3y 546

667 653 614 612 6041 5y 630 1 2 84y y

NOTE:> 0

“Group Means”

“Grand Mean”667 653 614 612 604 593 525 520

598.50

5 (630) 3 (546)

The grand mean is a weighted average of the group means, using the sample sizes as the weights.

667 653 614 612 604

1 5y 630 593 525 5202 3y 546

Analysis of Variance (ANOVA)

“Total Variability” =

Alternate method ~

“Variability between groups” + “Variability within groups”

1Y 2Y kY

HA: “At least one ‘treatment mean’ μi is significantly different from the others.

Main Idea: Among several (k 2) independent, equivariant, normally-distributed “treatment groups”…

5( ) 3( )5 3

630 546 598.50

593 525 5202 3y 546

667 653 614 612 6041 5y 630

“Group Means”

“Grand Mean”

• (if equivariance holds): Point estimates /iy y nANOVA F-test

How far is the “total” sample from the grand mean?

5( ) 3( )5 3

630 546 598.50

593 525 5202 3y 546

667 653 614 612 6041 5y 630

“Group Means”

“Grand Mean”

SSTot = 2 2 2 2 2(667 ) (653 ) (614 ) (612 ) (604 ) 598.5 598.5 598.5 598.5 598.52 2 2(593 ) (525 ) (520 ) 598.5 598.5 598.5 = 19710 dfTot = (5+3) –1 = 7

Alternate method ~

1Y 2Y kY

How can we measure this? Imagine zero variability within groups…

Alternate method ~

1 2 = =H0:

How can we measure this? Imagine zero variability within groups…

{630, 630, 630, 630, 630}

“Group Means”

“Grand Mean”

SSTot = 2 2 2 2 2(667 ) (653 ) (614 ) (612 ) (604 ) 598.5 598.5 598.5 598.5 598.5= 19710

{546, 546, 546}

SSTrt = 2 25 ( ) 3 ( ) 630 598.5 546 598.5

2 2 2(593 ) (525 ) (520 ) 598.5 598.5 598.5

= 13230

dfTot = (5+3) –1 = 7

dfTrt = (2) –1 = 1

5( ) 3( )5 3

630 546 598.50

667 653 614 612 6041 5y 630 593 525 520

2 3y 546

“The Clonemast

Alternate method ~

1Y 2Y kY

• (if equivariance holds): Point estimates /iy y n593 525 520

2 3y 546667 653 614 612 6041 5y 630

“Group Means”

“Grand Mean”

ANOVA F-test

SSTot = 2 2 2 2 2(667 ) (653 ) (614 ) (612 ) (604 ) 598.5 598.5 598.5 598.5 598.5= 19710

SSTrt = 2 25 ( ) 3 ( ) 630 598.5 546 598.5

2 2 2(593 ) (525 ) (520 ) 598.5 598.5 598.5

= 13230

dfTot = (5+3) –1 = 7

dfTrt = (2) –1 = 1

How far is each sample from its own group mean?

5( ) 3( )5 3

630 546 598.50

593 525 5202 3y 546667 653 614 612 604

1 5y 630

“Group Means”

“Grand Mean”

SSTot = 2 2 2 2 2(667 ) (653 ) (614 ) (612 ) (604 ) 598.5 598.5 598.5 598.5 598.5= 19710

SSTrt = 2 25 ( ) 3 ( ) 630 598.5 546 598.5

2 2 2(593 ) (525 ) (520 ) 598.5 598.5 598.5

= 13230

dfTot = (5+3) –1 = 7

dfTrt = (2) –1 = 1

SSErr =

5( ) 3( )5 3

630 546 598.50

2 2 2 2 2(667 ) (653 ) (614 ) (612 ) (604 ) 630 630 630 630 6302 2 2(593 ) (525 ) (520 ) 546 546 546 BUT…

1 2y y 84

s2 = SS/df2 2

1 1 2 2

( 1) ( 1)2pooled 2

n s n sn ns

(5 1)( ) (3 1)( )2pooled 5 3 2s

788.5 1663 1080

2 2(593 ) (520 )22 3

546 4s 1663

593 525 5202 3y 546

667 653 614 612 6041 5y 630

• Analysis via T-test (if equivariance holds): Point estimates /iy y nNOTE:

“Group Means”

2 2(667 ) (6630 63004 )21 5 1s

788.5“Group Variances”

1663788.5 2.11 4F

Pooled Variance

SS1 SS2

2 2(593 546) (520 546)22 3 1s

16632 2(667 630) (604 63021 5 1s

) 788.5

RECALL…

1 2y y 84

SSErr = 64802 2

1 1 2 2

( 1) ( 1)2pooled 2

n s n sn ns

(5 1)( ) (3 1)( )2pooled 5 3 2s

788.5 1663 1080

2 2(593 ) (520 )22 3

546 4s 1663

593 525 5202 3y 546

667 653 614 612 6041 5y 630

• Analysis via T-test (if equivariance holds): Point estimates /iy y nNOTE:

“Group Means”

2 2(667 ) (6630 63004 )21 5 1s

788.5“Group Variances”

1663788.5 2.11 4F

Pooled Variance

s2 = SS/df

2 2(593 546) (520 546)22 3 1s

dfErr = 6

2 2(667 630) (604 63021 5 1s

) 788.5

RECALL…

2 2 2 2 2(667 ) (653 ) (614 ) (612 ) (604 ) 630 630 630 630 6302 2 2(593 ) (525 ) (520 ) 546 546 546

593 525 5202 3y 546667 653 614 612 604

1 5y 630

“Group Means”

“Grand Mean”

SSTot = 2 2 2 2 2(667 ) (653 ) (614 ) (612 ) (604 ) 598.5 598.5 598.5 598.5 598.5= 19710

SSTrt = 2 25 ( ) 3 ( ) 630 598.5 546 598.5

2 2 2(593 ) (525 ) (520 ) 598.5 598.5 598.5

= 13230

dfTot = (5+3) –1 = 7

dfTrt = (2) –1 = 1

SSErr =

5( ) 3( )5 3

630 546 598.50

593 525 5202 3y 546667 653 614 612 604

1 5y 630

“Group Means”

“Grand Mean”

SSTot = 2 2 2 2 2(667 ) (653 ) (614 ) (612 ) (604 ) 598.5 598.5 598.5 598.5 598.5= 19710

SSTrt = 2 25 ( ) 3 ( ) 630 598.5 546 598.5

2 2 2(593 ) (525 ) (520 ) 598.5 598.5 598.5

= 13230

dfTot = (5+3) –1 = 7

dfTrt = (2) –1 = 1

SSErr =

5( ) 3( )5 3

630 546 598.50

4( ) 2( )788.5 1663 dfErr = = 6(5+3) –2= 6480

SSTot = SSTrt + SSErr dfTot = dfTrt + dfErr

Source df SS MS F-ratio p-value

Treatment 1 13230 13230

Error 6 6480 1080

Total 7 19710 –

ANOVA TableSSMSdf

2betweens

2withins

Note: This is also

2pooled .s

SSTot = SSTrt + SSErr dfTot = dfTrt + dfErrTot

Treatment 1 13230 13230

12.25 ????Error 6 6480 1080

Total 7 19710 –

ANOVA TableSSMSdf

2betweens

2withins

Note: This is also

2pooled .s

s 2 22

2 20 1 2

2 21 2

Test Statistic

Sampling Distribution =?

Treatment 1 13230 13230

Error 6 6480 1080

Total 7 19710 –

ANOVA TableSSMSdf

2betweens

2withins

Note: This is also

2pooled .s

p-value

Treatment 1 13230 13230

Error 6 6480 1080

Total 7 19710 –

ANOVA TableSSMSdf

2betweens

2withins

Note: This is also

2pooled .s

p-value

= .05α

Treatment 1 13230 13230

Error 6 6480 1080

Total 7 19710 –

ANOVA TableSSMSdf

2betweens

2withins

Note: This is also

2pooled .s

1, 6(on )F

p < .05

Treatment 1 13230 13230

12.25 .01282634

Error 6 6480 1080

Total 7 19710 –

ANOVA TableSSMSdf

2betweens

2withins

Note: This is also

2pooled .s

1, 6(on )F

1–pf(12.25, 1, 6)

Treatment 1 13230

12.25 .01282634

Error 6 6480 1080

Total 7 –

ANOVA TableSSMSdf

2betweens

2withins

1, 6(on )F

1–pf(12.25, 1, 6)

TotErr

SSTot = SSTrt + SSErr dfTot = dfTrt + dfErr

Thus, the treatment accounts for = 67.1% of the total variability in the response Y.

1323019710

R code:

# ANOVA FOR UNBALANCED DESIGN

> y1 = c(667, 653, 614, 612, 604)> y2 = c(593, 525, 520)> > Data = data.frame(+ Y = c(y1, y2),+ X = factor(rep(c("y1", "y2"), times = c(length(y1), length(y2))))+ )> > var.test(Y ~ X, data = Data) # EQUIVARIANCE?

F test to compare two variances

data: Y by X F = 0.4741, num df = 4, denom df = 2,p-value = 0.4738alternative hypothesis: true ratio of variances is not equal to 1 95 percent confidence interval: 0.01208057 5.04920249 sample estimates:ratio of variances 0.4741431

R code:

# ANOVA FOR UNBALANCED DESIGN

> out = aov(Y ~ X, data = Data)> anova(out)

Analysis of Variance Table

Response: Y Df Sum Sq Mean Sq F value Pr(>F) X 1 13230 13230 12.25 0.01283 *Residuals 6 6480 1080 ---Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Note: Vis-à-vis T-test vs. F-test,

• p-value is the same using either method (.01283), since the sample is unchanged!

• The square of the Tdf -score (3.5) is equal to the F1, df -score (12.25).

(Recall that the square of the Z-score is equal to the -score.)21

1X 2X kX

Suppose this ANOVA “overall F-test” indicates that a significant difference exists between one (or more) of the treatment means, at = .05.

How can we find out which one(s)?

Idea: Test all possible pairwise comparisons, each via a two-sample t-test.Example : Suppose there are k = 5 treatment groups.

(1, 2) (1, 3) (1, 4) (1, 5) (2, 3) (2, 4) (2, 5) (3,4) (3,5) (4,5)... ... ... ... ... ... ... ... ... ...p p p p p p p p p p

There are such comparisons.5

1Y 2Y kY

= ==H0:

…etc…

PROBLEM???

(1, 2) (1, 3) (1, 4) (1, 5) (2, 3) (2, 4) (2, 5) (3,4) (3,5) (4,5)... ... ... ... ... ... ... ... ... ...p p p p p p p p p p

1Y 2Y kY

= ==H0:

…etc…

PROBLEM???

SPURIOUS SIGNIFICANCE!!!

* = .05/10

(1, 2) (1, 3) (1, 4) (1, 5) (2, 3) (2, 4) (2, 5) (3,4) (3,5) (4,5)... ... ... ... ... ... ... ... ... ...p p p p p p p p p p

1Y 2Y kY

= ==H0:

…etc…

Make each comparison at level * = / 10.

PROBLEM???

(1, 2) (1, 3) (1, 4) (1, 5) (2, 3) (2, 4) (2, 5) (3,4) (3,5) (4,5)... ... ... ... ... ... ... ... ... ...p p p p p p p p p p

1Y 2Y kY

= ==H0:

…etc…

BONFERRONI CORRECTIONMake each comparison at level * = / 10.

1Y 2Y kY

= ==H0:

Alternate method ~

MODEL ASSUMPTIONS?

1Y 2Y kY

= ==H0:

Alternate method ~

• Equivariance can be tested via very similar “two variances” F-test in 6.2.2 (but this is very sensitive to normality assumption), or others. If violated, can extend Welch Test for two means.

1Y 2Y kY

= ==H0:

Alternate method ~

• Normality can be tested via usual methods. If violated, use nonparametric Kruskal-Wallis Test.

1Y 2Y kY

= ==H0:

Alternate method ~

• Extensions of ANOVA for data in matched “blocks” designs, repeated measures, multiple factor levels within groups, etc.

1Y 2Y kY

= ==H0:

Alternate method ~

• How to identify significant group(s)? Pairwise testing, with correction (e.g., Bonferroni) for spurious significance.

• Example: k = 5 groups result in 10 such tests, so let each α* = α / 10.

“spurious significance”

CHAPTER 6 Statistical Inference & Hypothesis Testing

Documents

Transcript of CHAPTER 6 Statistical Inference & Hypothesis Testing

Approximate Likelihoods - Statistical Inference, Learning ... · Approximate Bayesian Computation Marin et al., 2010 likelihood is not computable, but we can simulate from the model

One-sample normal hypothesis Testing, paired t …people.stat.sc.edu/hansont/stat704/notes2.pdfOne-sample normal hypothesis Testing, paired t-test, two-sample normal inference, normal

Chapter 10 Statistical Inference About Means and ...dscsss/teaching/mgs9920/slides/ch10.pdf · 1 Slide 1 Chapter 10 Statistical Inference About Means and Proportions With Two Populations

1 Testing Statistical Hypothesis for Dependent Samples.

Statistical Inference - Princeton Universityimai.princeton.edu/teaching/files/statistics.pdf · Statistical Inference Kosuke Imai Department of Politics Princeton University Fall

PROBABILITY AND STATISTICAL INFERENCE Probability vs ...courses.washington.edu/stat512/506-512Notes7.pdf · Clearly F X directly determines the probabilities of all intervals in R1:

CHAPTER 6 Statistical Inference & Hypothesis Testing

Statistical Inference for Diffusion Processes · Ergodicity, LLN, CLTs Statistical inference for SDEs (high frequency data) Statistical inference for SDEs (discrete data) Numerical

Two statistical inference methods: Confidence interval Estimator +/- Margin of Error Hypothesis testing Hypothesis: H 0 v.s. H a Test statistic.

Statistical Inference - Course Project 1 - Amazon S3 Inference - Course Project 1 Chan Chee-Foong May 14, 2016 Overview Inthisassignment,wewillinvestigatetheexponentialdistributioninRandcompareitwiththeCentral

Statistical Inference Wen, shu-hui shwen@mail.tcu.edu.tw.

6. Statistical Inference and Hypothesis Testingpages.stat.wisc.edu/.../6_-_Statistical_Inference/6.1_-_One_Sample.pdf · Ismor Fischer, 8/19/2008 Stat 541 / 6-8 Objective 3: Consider

Hypothesis Testing

Fundamental Theory of Statistical Inferenceayoung/ltcc/slideschapter2.pdf · G. Alastair Young Fundamental Theory of Statistical Inference. Title: Fundamental Theory of Statistical

Hypothesis Testing

EGR 252 S10 Ch.10 8th edition Slide 1 Statistical Hypothesis Testing Review A statistical hypothesis is an assertion concerning one or more populations.

Ch7: Hypothesis Testing (1 Sample) 7.1 Introduction to Hypothesis Testing 7.2 Hypothesis Testing for the Mean (σ known ) 7.3 Hypothesis Testing for the.

15 - 1 Module 15: Hypothesis Testing This modules discusses the concepts of hypothesis testing, including α-level, p-values, and statistical power. Reviewed.

Feature Selection - Computer Sciencerlaz/prec20092/slides/FeatureSelection.pdf · Outlier Removal For a normally ... Feature Selection Based On Statistical Hypothesis Testing (cont)

EGR 252 Ch. 9 Lecture1 JMB 2014 9th edition Slide 1 Chapter 9: One- and Two- Sample Estimation Statistical Inference Estimation Tests of hypotheses.