Chapter 11(1e), Ch. 10 (2/3e) Hypothesis Testing Using the Chi Square ( χ 2 ) Distribution.
of 26
/26

Author
angelinahouston 
Category
Documents

view
224 
download
6
Embed Size (px)
Transcript of Chapter 11(1e), Ch. 10 (2/3e) Hypothesis Testing Using the Chi Square ( χ 2 ) Distribution.
 Slide 1
 Chapter 11(1e), Ch. 10 (2/3e) Hypothesis Testing Using the Chi Square ( 2 ) Distribution
 Slide 2
 Outline: The basic logic of Chi Square. The terminology used with bivariate tables. The computation of Chi Square with an example problem using the Five Step Model
 Slide 3
 Basic Logic Chi Square is a test of significance based on bivariate tables. We are looking for significant differences between the actual cell frequencies in a table (f o ) and those that would be expected by random chance (f e ). The data are often presented in a table format. If starting with raw data on two variables, a bivariate table must be created first.
 Slide 4
 Bivariate Tables: Must have a title. Cells are intersections of columns and rows. Subtotals are called marginals. N is reported at the intersection of row and column marginals.
 Slide 5
 Tables (cont.) Columns are scores of the independent variable. There will be as many columns as there are scores on the independent variable. Rows are scores of the dependent variable. There will be as many rows as there are scores on the dependent variable.
 Slide 6
 Tables (cont.) There will be as many cells as there are scores on the two variables combined. Each cell reports the number of times each combination of scores occurred.
 Slide 7
 What your table should look like: Title RowsColumns Total Row 1cell acell bRow Marginal 1 Row 2cell ccell dRow Marginal 2 TotalColumn Marginal 1 Column Marginal 2 N
 Slide 8
 The Chi Square Distribution The chi square distribution is asymmetric and its values are always positive (Appendix C). Degrees of freedom are based on the table and are calculated as (rows1)X(columns1).
 Slide 9
 Example: 1e #11.2 (#10.2 in 2/3e) Question: Are the homicide rate and volume of gun sales related for a sample of 25 cities? The bivariate table showing the relationship between homicide rate (columns) and gun sales (rows). This 2x2 table has 4 cells. GUN SALESLowHighTotals High8513 Low4812 Totals1213N = 25 HOMICIDE RATE
 Slide 10
 Solution Using 5Step Method Step 1 Make Assumptions and Meet Test Requirements Independent random samples Level of measurement is nominal Note that no assumption is made about the shape of the sampling distribution. When the distribution is normal, a parametric test (Z or ttest, ANOVA) can be used. The chi square test is nonparametric. It can be used when normality is not assumed.
 Slide 11
 Step 2 State the Null and Alternate Hypothesis H 0 : The variables are independent You can also say:H 0 : f o = f e H 1 : The variables are dependent Or: H 1 : f o f e
 Slide 12
 Step 3 Select the Sampling Distribution and Establish the Critical Region Because normality is not assumed and our data are in tabular form, our Sampling Distribution = 2 Alpha =.05 df = (r1)(c1) = 1 2 (critical) = 3.841
 Slide 13
 Step 4 Calculate the Test Statistic Formula: 2 (obtained) = Method: 1. Find expected frequencies for each cell. 2. Complete computational table to find 2 (obtained)
 Slide 14
 1. Find expected frequencies for each cell. To find f e = Multiply column and row marginals for each cell and divide by N. (13*12)/25 = 156/25 = 6.24 (13*13)/25 = 169/25 = 6.76 (12*12)/25 = 144/25 = 5.76 (12*13)/25 = 156/25 = 6.24
 Slide 15
 Observed and Expected Frequencies for each cell (Note that totals are unchanged): GUN SALESLowHighTotal High f o = 8 f e = 6.24 f o = 5 f e = 6.76 13 Low f o = 4 f e = 5.76 f o = 8 f e = 6.24 12 Total1213N = 25 HOMICIDE RATE
 Slide 16
 2. Complete Computational Table A table like this will help organize the computations: (a) Add values for f o and f e for each cell to table. fofo fefe f o  f e (f o  f e ) 2 (f o  f e ) 2 /f e 86.24 56.76 45.76 86.24 Total 2525
 Slide 17
 Computational Table (cont.) (b) Subtract each f e from each f o. The total of this column must be zero. fofo fefe f o  f e (f o  f e ) 2 (f o  f e ) 2 /f e 86.241.76 56.761.76 45.761.76 86.241.76 Total 25250
 Slide 18
 Computational Table (cont.) (c) Square each of these values fofo fefe f o  f e (f o  f e ) 2 (f o  f e ) 2 /f e 86.241.763.10 56.761.763.10 45.761.763.10 86.241.763.10 Total 25250
 Slide 19
 Computational Table (cont.) (d) Divide each of the squared values by the f e for that cell. (e) The sum of this column is chi square. 2 (obtained) = 2.00 fofo fefe f o  f e (f o  f e ) 2 (f o  f e ) 2 /f e 86.241.763.10.50 56.761.763.10.46 45.761.763.10.54 86.241.763.10.50 Total 25250 2 = 2.00
 Slide 20
 Step 5 Make a Decision and Interpret the Results of the Test 2 (critical) = 3.841 2 (obtained) = 2.00 The test statistic is not in the Critical Region. Fail to reject the H 0. There is no significant relationship between homicide rate and gun sales.
 Slide 21
 Interpreting Chi Square The chi square test tells us only if the variables are independent or not. It does not tell us the pattern or nature of the relationship. To investigate the pattern, compute % within each column and compare across the columns.
 Slide 22
 Interpreting Chi Square (cont.) Cities low on homicide rate were high in gun sales and cities high in homicide rate were low in gun sales. As homicide rates increase, gun sales decrease. This relationship is not significant but does have a clear pattern. GUN SALESLowHighTotal High8 (66.7%)5 (38.5%)13 Low4 (33.3%)8 (61.5%)12 Total12 (100%)13 (100%)N = 25 HOMICIDE RATE
 Slide 23
 The Limits of Chi Square Like all tests of hypothesis, chi square is sensitive to sample size. As N increases, obtained chi square increases. With large samples, trivial relationships may be significant. To correct for this, when N>1000, set your alpha =.01. Remember: significance is not the same thing as importance.
 Slide 24
 Yates Correction for Continuity The chi square statistic is sensitive to small cell sizes. Whenever any of your cell sizes are
 Using SPSS to Calculate Chi Square 1. Open data set in SPSS. 2. Go to Analyze>Descriptive Statistics>Crosstabs. 3. Move your dependent variable into the Rows box, and your independent variable into the Columns box. 4. Click Statistics and check box for Chi square. 5. Click Cells and select Column in percentages box. 6. Click Continue and OK. Note: The rule of thumb for analyzing your % data is Percentage Down, Compare Across When analyzing your %, always compare the categories of your dependent (Row) variable across the columns of your independent (Column) variable.
 Slide 26
 Practice Question: Try 1e #11.3 (#10.3 in 2/3e) parts a and b. Use the five step model to calculate a full solution to this question.