Research Methods: Statistics

24
Statistics Brian J. Piper, Ph.D. ψ

Transcript of Research Methods: Statistics

Page 1: Research Methods: Statistics

Statistics

Brian J. Piper, Ph.D.

ψ

Page 2: Research Methods: Statistics

Mark Twain (?)

• There are three types of lies, lies, damn lies, and statistics.

http://en.wikipedia.org/wiki/Lies,_damned_lies,_and_statistics

Page 3: Research Methods: Statistics

Goals

• Levels of measurement• Group comparisons (t)• Association (scatterplot/r)• Effect-size (d)

Page 4: Research Methods: Statistics

Levels of Measurement

• Nominal: categorical, example: sex• Ratio: quantitative, example: age• Ordinal: ranking, example: self-report

– Strongly disagree = 1– Disagree = 2– Neutral = 3– Agree = 4– Strongly agree = 5– S.D.-------------------------------------------------------------------------------------------S.A.

)

Page 5: Research Methods: Statistics

What do these countries have in common?

• Liberia• Burma• United States

Page 6: Research Methods: Statistics

Metric System

Unit Symbol Factor

tera T 1 x 1012

giga G 1 x 109

kilo K 1 x 103

--- -- 1

centi c 1 x 10-2

milli m 1 x 10-3

micro μ 1 x 10-6

Page 7: Research Methods: Statistics

DataSex Height

(m)Weight

(kg)

Sex Height (m)

Weight

(kg)

Female 2.0 60 Male 2.5 80

Female 1.9 58 Male 2.3 76

Female 1.8 56 Male 2.1 74

Female 1.7 54 Male 2.0 73

Female 1.6 52 Male 1.8 72

Female 1.5 50 Male 1.7 70

Female 1.4 48 Male 1.6 68

Female 1.3 46 Male 1.5 65

Female 1.6 57 Male 2 50

Page 8: Research Methods: Statistics

Sex Height (m)

Weight

(kg)

Sex Height (m)

Weight

(kg)

Female 2.0 60 Male 2.5 80

Female 1.9 58 Male 2.3 76

Female 1.8 56 Male 2.1 74

Female 1.7 54 Male 2.0 73

Female 1.6 52 Male 1.8 72

Female 1.5 50 Male 1.7 70

Female 1.4 48 Male 1.6 68

Female 1.3 46 Male 1.5 65

Female 1.6 57 Male 2 50Average 1.64 53.4 1.94 69.8

Mean (or average) = Sum (X) /N where N is the # of scores

Page 9: Research Methods: Statistics

Variability

• Variability: how much scores differ, on average, from mean– Variance = Sum (X – Mean)2 /N– Standard Deviation (SD) = √Variance– Standard Error of Mean (SEM) = SD / √ N

Page 10: Research Methods: Statistics

Group Comparisons I

• Are women lighter then men?– P = probability value

if p < .05 therefore statistically “significant”

– t test = (MeanMales - MeanFemales) / SEM

– t = 4.97, p = .0001

malefemale

SEX_

40

50

60

70

80

90

WE

IGH

T

0123456789Count

0 1 2 3 4 5 6 7 8 9Count

←←

Page 11: Research Methods: Statistics

Group Comparisons II

• Do men have a higher IQ then women?• T is the measure of variability (e.g. S.E.M.)

A. Sample Size = 40

Men (N = 20) Women (N=20)0

25

50

75

100

125

IQ

B. Sample Size = 4,000

Men (N = 2000) Women (N=2000)0

25

50

75

100

125

IQ

C. Sample Size = 4,000 ( * p < .05).

Men (N = 2000) Women (N=2000)90

95

100

105

*

IQ

A finding with a * refers to a “statistically significant” finding, e.g. men > women

Page 12: Research Methods: Statistics

Error Bars Example 2

Batterham et al. New England Journal of Medicine, 349, 941-948.

Page 13: Research Methods: Statistics

1.3 1.4 1.5 1.6 1.7 1.8 1.9 2.0HEIGHT_F

44

48

52

56

60

64

WE

IGH

T_

F

Scatterplots

1.4 1.6 1.8 2.0 2.2 2.4 2.6HEIGHT_M

40

50

60

70

80

90

WE

IGH

T_M

ALEOutlier? ->

Outlier? ->

Page 14: Research Methods: Statistics

Positive Association

Study Hours

Score

3 80

5 90

2 75

6 80

7 90

1 50

2 65

7 85

1 40

7 100

Page 15: Research Methods: Statistics

Negative Association

Variable A

Variable B

Page 16: Research Methods: Statistics
Page 17: Research Methods: Statistics

Standardized “Z” Scores

• Z is a #• Z = 0 therefore average• Z > 0 therefore above average• Z < 0 therefore below average

• Z = (X – Mean) / SD• Z = (600 – 500) / 100

= 1.0

3.1

Page 18: Research Methods: Statistics

• r: quantifies relationship between two variables (e.g. x & y)• No association: r = 0.00 (C)• Positive association: r > 0.00 (A B)• Negative association: r < 0.00 (DE)• Strong association: A E, Weak association: B D

A B C

DE

r = Sum(Zx * Zy)/ N

3.6

Page 19: Research Methods: Statistics

Probability

Frequency Blue Brown

+ 2 10

- 999,998 999,990

Probability

Blue Brown

+ .000002 .000010

- .999998 .99999BrainCancer

Eyes Eyes

Page 20: Research Methods: Statistics

Risk

• Absolute Risk: Rate of condition/total population studied, e.g. .000010 or .000002

• Relative Risk: Rate of condition among group A divided by rate of condition among group B– .000010 / .000002 = 5.0

.0010% or .0002%

3.2

Page 21: Research Methods: Statistics

Effect-Size

• Procedure used to summarize the magnitude of group differences.– Cohen’s d = (MeanA – MeanB) / SD

• d = 0.20 small effect size• d = 0.50 medium effect size• d = 0.80 large effect size

Can be averaged for multiple studies (meta-analysis).

Page 22: Research Methods: Statistics

D.A.R.E.

• Founded in 1983 by Daryl Gates• Police officers give lectures to middle school• Found in 80% of U.S. school districts, 54

nations• Cohen’s d = (MeanD.A.R.E. – MeanControl )/ SDpop

d = 0.30 small, 0.50 medium, 0.70 large

http://www.dare.com/home/default.asp

Page 23: Research Methods: Statistics

Does D.A.R.E work?

Ennett S.T. (1994). American Journal of Public Health, 84, 1394-1401.

Page 24: Research Methods: Statistics

SummaryGoal Intuition Test

Difference in means

Bar Graphs with SEM

“t-test”

Relationship between variables (ratio x ratio)

Scatterplot Correlation “r”

Summarize many studies

Read papers Effect size “Cohen’s d”