Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test...

33
Inferential Statistics & Test of Significance

Transcript of Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test...

Page 1: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

Inferential Statistics & Test of Significance

Page 2: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

Confidence Interval (CI)Confidence Interval (CI)

−−

± yZY σα 2/

Y = mean Z = Z score related with a 95% CI σ = standard error

rorstandarder*)2(96.1 orsamplemean±

Page 3: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

Building a CIBuilding a CI• Assume the following

Ny

σ =−

400

15

100

=

=

=

Ny

y

σ

µ

750.40015

==−yσ

Page 4: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

CI

53.9847.101

)750.0)(96.1(100

==

±

LowerUpper

Why do we use 1.96?

Page 5: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

Source; Knoke & Bohrnstead (1991:167)

Is there a sample that is different from the mean?

Page 6: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

Significance TestingSignificance Testing• When we explain some phenomenon we

move beyond description to inferential statistics and hypothesis testing.

• Tests of significance allow us to test hypotheses, and when we find a relationship between variables, reject the null hypothesis.

Page 7: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

Hypothesis testing

• Hypothesis testing means that we are testing our null hypothesis (Ho) against some competing or alternative hypothesis (H1)

• Normally we choose statements such as

Ho : µy = 100 H1: µy ≠100

Or H1: µy > 100

Or H1: µy < 100

Page 8: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

Significance TestingSignificance Testing• Even with high powered statistical measures,

there will be results that pop up that are affected by chance. If we were to keep running our models a thousand times, or fewer, we would likely see some results that do not stem from systematic processes.

• Thus, we need to determine at what level of significance we are willing to frame our results. We can never be 100% confident.

• Conventional levels of significance where we reject the null hypothesis are usually .05 or .01. The probability .10 is weakly significant.

Page 9: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

Significance TestingSignificance Testing• When you erroneously reject the null

hypothesis when it is true, you make a Type I error. This means you are accepting a “False Positive” result.

• Think of this as a fiancé test. The chances of rejecting or saying no to mister or miss “right”

Page 10: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

SignificanceSignificance TestingTesting• A Type II error occurs when you accept the

null hypothesis when it is not true.

• This is a “False Negative”, when you have say yes to Mr. or Miss “wrong”

• Type II errors in statistical testing result from too little data, omitted variable bias, and multicollinearity.

Page 11: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

Other distributions • The normal distribution assumes:1. We know the standard error of the population,

however, often we don’t know it. 2. The t-distribution become the best alternative

when we don’t know the standard error but we know the standard deviation.

3. As the sample gets bigger the t-distributionapproaches the normal distribution

4. There are other distribution such as chi square and the that we will discuss latter.

Page 12: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

T- Distribution & Normal DistributionThe form of the t-distribution depends on the sample size. As the sample getsLarger there is not difference between the normal and the t-distribution

Source: Gujarati (1992:76)

Page 13: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

The t formula

NSy

ty

yµ−=

)/(2/ NStYCI yα±=

For α =.05 and N=30 , t =2.045

Page 14: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

95% CI using t-test

• Mean= 20 • Sy = 5• N= 20

20± 2.093 (5/√20) = 22.34 upper 18.88 lower

Page 15: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

Why do we care about CI?Why do we care about CI?

• We use CI interval for hypothesis testing• For instance, we want to know if there is a

difference of home values between El Paso and Boston

• We want to know whether or not taking class at Kaplan makes a difference in our GRE scores

• We want to know if there is a difference between the treatment and control groups.

Page 16: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

Mean Difference testingMean USA

Las Cruces BostonEl Paso

Home Values

Page 17: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to
Page 18: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

T-Tests of Independence• Used to test whether there is a significant

difference between the means of two samples.

• We are testing for independence, meaning the two samples are related or not.

• This is a one-time test, not over time with multiple observations.

• Example: The values of homes between El Paso and Boston

Page 19: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

T-Test of Independence• Useful in experiments where people are

assigned to two groups, when there should be no differences, and then introduce Independent variables (treatment) to see if groups have real differences, which would be attributable to introduced X variable. This implies the samples are from different populations (with different µ).

• This is the Completely Randomized Two-Group Design.

Page 20: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

T-Test of Independence• For example, we can take a random sample of

high school students and divided into two groups. One gets tutoring for the SAT and the other does not.

Ho: µ1≠ µ2 H1: µ1= µ2

• After one group gets tutoring, but not the other, we compare the scores. We find that indeed the group exposed to tutoring outperformed the other group. We thus conclude that tutoring makes a difference.

Page 21: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

• Positive increments at a different rate

Treatment

Control

Post-testPre-test

Page 22: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

Two Sample Difference of Means T-Test

+−+

−+−

−=

21

21

2

21

222

11

21

2)1()1(

nnnn

nnsnsn

XXt

2

21

222

11

2)1()1(

−+−+−

nnsnsn Pooled variance of the two groups

Sp2 =

+

21

21

nnnn = common standard deviation of two groups

Page 23: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

Two Sample Difference of Means T-Test

• The nominator of the equation captures difference in means, while the denominator captures the variation within and between each group.

• Important point: of interest is the difference between the sample means, not sample and population means. However, rejecting the null means that the two groups under analysis have different population means.

Page 24: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

An example• Test on GRE verbal test scores by gender:Females: mean = 50.9, variance = 47.553, n=6Males: mean=41.5, variance= 49.544, n=10

+−+−+−

−=

)10(6106

2106544.49)110(553.47)16(

5.419.50t

Page 25: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

[ ])26667(.826.484.9

=t

02.134.9

=t

605.2608.34.9

==t Now what do we do with this obtained value?

Page 26: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

Steps of Testing and Significance1. Statement of null hypothesis: if there is

not one then how can you be wrong?2. Set Alpha Level of Risk: .10, .05, .013. Selection of appropriate test statistic:

T-test, 4. Computation of statistical value: get

obtained value.5. Compare obtained value to critical

value: done for you for most methods in most statistical packages.

Page 27: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

Steps of Testing and Significance

6. Comparison of the obtained and critical values.

7. If obtained value is more extreme than critical value, you may reject the null hypothesis. In other words, you have significant results.

8. If point seven above is not true, obtained is lower than critical, then null is not rejected.

Page 28: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

GRE Verbal ExampleObtained Value: 2.605Critical Value?Degrees of Freedom: number of cases left after subtracting

1 for each sample. (14)Ho : µf =µmH1: µf ≠µm

Is the null hypothesis (Ho) supported?

Answer: No, women have higher verbal skills and this is statistically significant. This means that the mean scores of each gender as a population are different.

Page 29: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

Paired T-Tests• We use Paired T-Tests, test of

dependence, to examine a single sample subjects/units under two conditions, such as pretest - posttest experiment.

• For example, we can examine whether a group of students improves if they retake the GRE exam. The T-test examines if there is any significant difference between the two studies. If so, then possibly something like studying more made a difference.

Page 30: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

Paired T-Tests• Unlike a test for independence, this test

requires that the two groups/samples being evaluated are dependent upon each other.

• For example, we can use a paired t-test to examine two sets of scores across time as long as they come from the same students.

• This is appropriate for a pre-test –post-test research design

Page 31: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

)1()(( 22

−−∑ ∑

nDDn

D

ΣD = sum differences between groups, plus it is squared.

n = number of paired groups

Page 32: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

Comparing Test Scores

80.16071.87393.68691.875

886995928888

72.87486.77581.45094.287

969573.36971.248Final Midterm

Page 33: Inferential Statistics & Test of Significanceutminers.utep.edu/crboehmer/Inferential Statistics Test of...move beyond description to inferential ... significance we are willing to

Paired Samples Statistics

74.3571 14 14.60562 3.9035284.5643 14 9.32924 2.49335

MIDFINAL

Pair1

Mean N Std. DeviationStd. Error

Mean

Paired Samples Correlations

14 .710 .004MID & FINALPair 1N Correlation Sig.

Paired Samples Test

-10.2071 10.34300 2.76428 -16.1790 -4.2353 -3.693 13 .003MID - FINAPair 1Mean Std. Deviation

Std. ErrorMean Lower Upper

95% ConfidenceInterval of the

Difference

Paired Differences

t df Sig. (2-tailed)

What can we conclude?