7.1 - 1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides...

download 7.1 - 1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.

If you can't read please download the document

description

Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7-1 Review and Preview

Transcript of 7.1 - 1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides...

Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by Mario F. Triola Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Chapter 7 Estimates and Sample Sizes 7-1 Review and Preview 7-2 Estimating a Population Proportion 7-3 Estimating a Population Mean: Known 7-4 Estimating a Population Mean: Not Known 7-5 Estimating a Population Variance Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7-1 Review and Preview Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Preview The two major activities of inferential statistics are: (1) to use sample data to estimate values of a population parameters (2) to test hypotheses or claims made about population parameters later chapters. We introduce methods for estimating values of these important population parameters: proportions, means, and variances. We also present methods for determining sample sizes necessary to estimate those parameters. This chapter presents the beginning of inferential statistics. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7-2 Estimating a Population Proportion Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Key Concept In this section we present methods for using a sample proportion (like presidential approval proportion in a poll of 1200) to estimate the value of a population proportion (entire US population). The sample proportion p is the best point estimate (one number) of the unknown population proportion p. A point estimate is a single value (or point) used to approximate a population parameter. We can use a sample proportion distribution to construct a confidence interval of a population proportion. phat is used as the point estimate of p because it is unbiased and it is the most consistent of the estimators that could be used. Unbiased distribution of sample proportions tends to center about the value of p, not over/under estimate it. Consistent standard deviation of sample proportions tends to be smaller than the standard deviation of any other unbiased estimators. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: Because the sample proportion is the best point estimate of the population proportion, we conclude that the best point estimate of p is When using the sample results to estimate the percentage of all adults in the United States who believe in global warming, the best estimate is 70%. In the Chapter Problem we noted that in a Pew Research Center poll, 70% of 1501 randomly selected adults in the United States believe in global warming, so the sample proportion is = Find the best point estimate of the proportion of all adults in the United States who believe in global warming. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Sample proportion is a good estimate of population proportion, but just how good is it??? A confidence interval (CI) is a range (or an interval) of values used to estimate the true value of a population parameter. For example: presidential approval rating is 51% with 3% margin of error => CI = [48%,54%]. It is based on known or approximated distribution of sample proportion it is approximately normal via normal approximation of the binomial. A confidence level is the probability 1 that the confidence interval actually does contain the population parameter, assuming that the estimation process of creating a confidence interval is repeated a large number of times. (The confidence level is also called degree of confidence, or the confidence coefficient.) Most common choices for confidence level are are 90%, 95%, or 99% ( = 10%), ( = 5%), ( = 1%) Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. We must be careful to interpret confidence intervals correctly. There is a correct interpretation and many different and creative incorrect interpretations of the confidence interval. For example: < p < (70% with margin of error 2.3%) We are 95% confident that the interval from to actually contains the true value of the population proportion p. This means that if we were to select many different samples of size 1501 and construct the corresponding confidence intervals, 95% of them would actually contain the value of the population proportion p. Note that in this correct interpretation, the level of 95% refers to the success rate of the process being used to estimate the proportion. Incorrect interpretations: 1) There is 95% chance that the true value of p falls between and Any specific population has a fixed and constant value p, and a confidence interval constructed from a sample either contains p or not there is no probability involved. 2) 95% of sample proportions fall between and No we are trying to estimate the true population proportion, not sample proportions. Interpreting a Confidence Interval Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Interpreting a Confidence Interval Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Critical Values We use normal approximation of the binomial distribution of sample proportion a standard z score is used to distinguish between sample statistics that are likely to occur and those that are unlikely to occur. Such a z score is called a critical value. Critical values are based on the following observations: 1)Under certain conditions, the sampling distribution of sample proportions can be approximated by a normal distribution. That is binomial approximation of normal distribution from Ch 6. 2)A z score associated with a sample proportion has a probability of /2 of falling in the right tail. 2)The z score separating the right-tail region is commonly denoted by z /2 and is referred to as a critical value because it is on the borderline separating z scores from sample proportions that are likely to occur from those that are unlikely to occur. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Finding z 2 for a 95% Confidence Level - z 2 z 2 Critical Values 2 = 2.5% = = 5% > alpha = 0.05 > z_alpha = qnorm(1-alpha/2) > z_alpha [1] TI 2 nd VARS invNormal(0.975) Excel =NORMSINV(0.975) StatDisk Analysis->Prob Distibutions->Normal Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. When data from a simple random sample are used to estimate a population proportion p, the margin of error, denoted by E, is the maximum likely difference (with probability 1 , such as 0.95) between the observed proportion and the true value of the population proportion p. The margin of error E is also called the maximum error of the estimate and can be found by multiplying the critical value and the standard deviation of the sample proportions: Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Margin of Error for Proportions Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. 1.Verify that the required assumptions are satisfied. 1)The sample is a simple random sample. 2) The conditions for the binomial distribution are satisfied. 3) The normal distribution can be used to approximate the distribution of sample proportions because np 5, and nq 5 are both satisfied. 2. Find the critical value z /2 that corresponds to the desired confidence level (table or technology). 3.Evaluate the margin of error 4. Using the value of the calculated margin of error, E and the value of the sample proportion, p, find the values of p E and p + E. Substitute those values in the general format for the confidence interval: Procedure for Constructing a Confidence Interval for p Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: a.Find the margin of error E that corresponds to a 95% confidence level. b.Find the 95% confidence interval estimate of the population proportion p. c.Based on the results, can we safely conclude that the majority of adults believe in global warming? d.Assuming that you are a newspaper reporter, write a brief statement that accurately describes the results and includes all of the relevant information. In the Chapter Problem we noted that a Pew Research Center poll of 1501 randomly selected U.S. adults showed that 70% of the respondents believe in global warming. The sample results are n = 1501, and Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Requirement check: 1) simple random sample; 2) binomial distribution: fixed number of trials = 1501, trials are independent, two categories of outcomes (believes or does not); probability remains constant. 3) Number of successes and failures 1501*0.7, 1501*0.3 are both >5. a) Use the formula to find the margin of error. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. b) The 95% confidence interval: > pbar = 0.7 > n = 1501 > alpha = 0.05 > z_alpha = qnorm(1-alpha/2) > z_alpha [1] > > E = z_alpha * sqrt( pbar*(1-pbar)/n ) > E [1] > CI = c(pbar - E, pbar + E) > CI [1] TI Stat->Tests->1-PropZInt STATDISK Analysis->Confidence Intervals->One Sample Proportion Margin of error, E = % Confidence Interval: < p < Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Excel Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. c)Based on the confidence interval obtained in part (b), it does appear that the proportion of adults who believe in global warming is greater than 0.5 (or 50%), so we can safely conclude that the majority of adults believe in global warming. Because the limits of and are likely to contain the true population proportion, it appears that the population proportion is a value greater than 0.5. d)Here is one statement that summarizes the results: 70% of United States adults believe that the earth is getting warmer. That percentage is based on a Pew Research Center poll of 1501 randomly selected adults in the United States. In theory, in 95% of such polls, the percentage should differ by no more than 2.3 percentage points in either direction from the percentage that would be found by interviewing all adults in the United States. Caution: Never follow the common misconception that poll results are unreliable if the sample size is a small percentage of the population size. The population size is usually not a factor in determining the reliability of a poll. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Sample Size Suppose we want to collect sample data in order to estimate some population proportion with given accuracy. The question is how many sample items must be obtained? Note that sample size does NOT depend on the population size N !!! Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. TI 2 nd VARS -> invNormal(0.975) > pbar = 0.73; qbar = 1-pbar; > alpha = 0.05 > E = 0.03 > z_alpha = qnorm(1-alpha/2) > z_alpha [1] > n = z_alpha^2 * pbar * qbar / E^2 > n [1] > pbar = 0.5; qbar = 1-pbar; > alpha = 0.05 > E = 0.03 > z_alpha = qnorm(1-alpha/2) > z_alpha [1] > n = z_alpha^2 * pbar * qbar / E^2 > n [1] Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. My example Obtaining point estimate and margin of error from CI A journal article says that of 1200 people surveyed the 99% confidence interval for presidential approval is (37%,42%) > CI_Lower = 0.37 > CI_Upper = 0.42 > pbar = ( CI_Lower + CI_Upper ) /2 > pbar [1] > E = (CI_Upper - CI_Lower) /2 > E [1] 0.025 Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7-3 Estimating a Population Mean: Known In addition to knowing the values of the sample data or statistics, in this section we also assume, we know the value of the population standard deviation, . Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Point Estimate of the Population Mean The sample mean x is the best point estimate of the population mean . 1.For all populations, the sample mean x is an unbiased estimator of the population mean , meaning that the distribution of sample means tends to center about the value of the population mean . (review Ch 6) 2. For many populations, the distribution of sample means x tends to be more consistent (with less variation) than the distributions of other sample statistics. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Confidence Interval for Estimating a Population Mean (with Known) = population mean = population standard deviation = sample mean n = number of sample values E = margin of error z /2 = z score separating an area of a/2 in the right tail of the standard normal distribution Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Confidence Interval for Estimating a Population Mean (with Known) 1. The sample is a simple random sample. 2. The value of the population standard deviation is known. 3. Either or both of these conditions is satisfied: The population is normally distributed OR n > 30 so that it is approximately normally distributed by CLT. In either case, the most important part is that: If we collect simple random samples of size n, the sample means x are either exactly or approximately Normal( ) Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. > xbar = > n = 40 > sigma = 26 > alpha = 0.05 > z_alpha = qnorm(1-alpha/2) > z_alpha [1] > > E = z_alpha * sigma/sqrt(n) > E [1] > CI = c(xbar - E, xbar + E) > CI [1] TI Stat->Tests->Zinterval STATDISK Analysis->Confidence Intervals ->Mean One Sample Margin of error, E = % Confident the population mean is within the range: < mean < Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Finding a Sample Size for Estimating a Population Mean If the computed sample size n is not a whole number, round the value of n up to the next larger whole number. If we want more accurate estimate, we need to decrease the margin of error E => sample size n must be substantially increased. When collecting a simple random sample that will be used to estimate a population mean , how many sample values (n) must be obtained to get a certain margin or error? Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: = 0.05 / 2 = z / 2 = 1.96 E = 3 = 15 n = = = With a simple random sample of only 97 statistics students, we will be 95% confident that the sample mean is within 3 IQ points of the true population mean . Assume that we want to estimate the mean IQ score for the population of statistics students. How many statistics students must be randomly selected for IQ tests if we want 95% confidence that the sample mean is within 3 IQ points of the population mean? Note that IQ tests are typically designed so that mean is around 100 and standard deviation is 15 > E = 3; sigma = 15; > alpha = 0.05 > z_alpha = qnorm(1-alpha/2) > z_alpha [1] > n = ( z_alpha*sigma/E )^2 > n [1] TI 2 nd VARS -> invNormal(0.975) Excel =NORMSINV(0.975) Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7-4 Estimating a Population Mean: Not Known most typical situation Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. As before, the sample mean x is the best point estimate of the population mean. But now we do not assume that we know population standard deviation , still we can estimate it with sample standard deviation s. Sample Mean William Gossett, Guinness Brewery The number of degrees of freedom for a collection of sample data is the number of sample values that can vary after certain restrictions have been imposed on all data values. The degree of freedom is often abbreviated df. For our case the restriction is that the sample mean is already found: sum(x)/n = xbar fixed number. If say you are averaging 25 test scores and already found that the sample mean is 77, then the sum is fixed at 77*25 =1925. Then, we can freely assign only 24 scores, the last one is fixed to have sum = => degrees of freedom df = n 1 or TI84 Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. 2. Using n 1 degrees of freedom, refer to Table A-3 or use technology to find the critical value t 2 that corresponds to the desired confidence level. Procedure for Constructing a Confidence Interval for (With Unknown) 1. Verify that the requirements are satisfied. 3. Evaluate the margin of error E = t 2 s / n. 4. Find the values of Substitute those values in the general format for the confidence interval: 5. Round the resulting confidence interval limits. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. TI 2nd VARS invT(0.975,6) > # Example 1 How to find critical value t_alpha2 > alpha = 0.05 > n = 7 > df = n-1 > t_alpha2 = qt(1-alpha/2,df) > t_alpha2 [1] STATDISK Analysis->Probability Distributions->Student t-distrib t Value: Prob Dens: Cumulative Probs Left: Right: Tailed: Central: Degrees Freedom Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: A common claim is that garlic lowers cholesterol levels. In a test of the effectiveness of garlic, 49 subjects were treated with doses of raw garlic, and their cholesterol levels were measured before and after the treatment. The changes in their levels of LDL cholesterol (in mg/dL) have a mean of 0.4 and a standard deviation of Use the sample statistics of n = 49, = 0.4 and s = 21.0 to construct a 95% confidence interval estimate of the mean net change in LDL cholesterol after the garlic treatment. What does the confidence interval suggest about the effectiveness of garlic in reducing LDL cholesterol? Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Requirements are satisfied: simple random sample n = 49 (i.e., n > 30 good enough even for arbitrary non-normal distribution). 95% implies alpha = With n = 49, the df = 49 1 = 48 Closest df in table is 50, two tails, so t /2 = Using t /2 = 2.009, s = 21.0 and n = 49 the margin of error is: TI 2nd VARS invT(0.975,48) Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Construct the confidence interval: We are 95% confident that the limits of 5.6 and 6.4 actually do contain the value of , the mean of the changes in LDL cholesterol for the population. Because the confidence interval limits contain the value of 0, it is very possible that the mean of the changes in LDL cholesterol is equal to 0, suggesting that the garlic treatment did not affect the LDL cholesterol levels. It does not appear that the garlic treatment is effective in lowering LDL cholesterol. > xbar = 0.4 > n = 49 > s = 21.0 > alpha = 0.05 > df = n-1 > t_alpha2 = qt(1-alpha/2,df) > t_alpha2 [1] > > E = t_alpha2 * s/sqrt(n) > E [1] > CI = c(xbar - E, xbar + E) > CI [1] TI STAT->TESTS->TInterval STATDISK Analysis->Confidence Intervals->Mean One Sample Margin of error, E = % Confident the population mean is within the range: < mean < Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Properties of the Student t Distribution 1. The Student t distribution is different for different sample sizes (see the picture, for the cases n = 3 and n = 12). 2.The Student t distribution has the same general symmetric bell shape as the standard normal distribution but it reflects the greater variability (with wider distributions) that is expected with small samples. 3.The Student t distribution has a mean of t = 0 (just as the standard normal distribution has a mean of z = 0). 4.The standard deviation of the Student t distribution varies with the sample size and is greater than 1 (unlike the standard normal distribution, which has a = 1). 5.As the sample size n gets larger, the Student t distribution gets closer to the normal distribution. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Choosing the Appropriate Distribution Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example, CI for mean women weight. > DataFrame = read.csv("FHEALTH.csv") > attach(DataFrame) > > x = WT # weight > x [1] .. > n = length(x) > n # check how big is the sample [1] 40 > qqnorm(x); qqline(x); # check normality assumption > > xbar = mean(x) > xbar [1] > s = sd(x) # sigma is NOT given, use s - sample standard deviation > s [1] > alpha = 0.01 > df = n-1 > t_alpha2 = qt(1-alpha/2,df) > t_alpha2 [1] > > E = t_alpha2 * s/sqrt(n) # error margin > E [1] > CI = c(xbar - E, xbar + E) # CI for mean mu of the population > CI [1] On TI calculator: Store values in a list L1. Stat->Tests-> TInterval STATDISK 1 st use Datasets to open Female Health Exam 2 nd Data-> Descriptive Statistics on column 3 of weights 3 rd Analysis->Confidence Intervals->Mean One Sample Margin of error, E = % Confident the population mean is within the range: < mean < Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Excel computation for pulse in FHEALTH.XLS Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Finding the Point Estimate and E from a CI Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. s Section 7-5 Estimating a Population Variance skip Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Chi-Square Distribution In a normally distributed population with variance 2 assume that we randomly select independent samples of size n and, for each sample, compute the sample variance s 2 (which is the square of the sample standard deviation s). The sample statistic 2 (pronounced chi-square) has a sampling distribution called the chi-square distribution. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. where n = sample size s 2 = sample variance 2 = population variance Chi-Square Distribution 2 = 2 = 2 ( n 1 ) s 2 degrees of freedom = n 1 Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Properties of the Distribution of the Chi-Square Statistic 1. The chi-square distribution is not symmetric, unlike the normal and Student t distributions. Chi-Square Distribution Chi-Square Distribution for df = 10 and df = 20 As the number of degrees of freedom increases, the distribution becomes more symmetric. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. 2. The values of chi-square can be zero or positive, but they cannot be negative. 3.The chi-square distribution is different for each number of degrees of freedom, which is df = n 1. As the number of degrees of freedom increases, the chi- square distribution approaches a normal distribution. In Table A-4, each critical value of 2 corresponds to an area given in the top row of the table, and that area represents the cumulative area located to the right of the critical value. Properties of the Distribution of the Chi-Square Statistic cont. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example A simple random sample of ten voltage levels is obtained. Construction of a confidence interval for the population standard deviation requires the left and right critical values of 2 corresponding to a confidence level of 95% and a sample size of n = 10. Find the critical value of 2 separating an area of in the left tail, and find the critical value of 2 separating an area of in the right tail. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example Critical Values of the Chi-Square Distribution > alpha = 0.05 > n = 10 > df = n-1 > df [1] 9 > chi2L = qchisq(alpha/2,df) > chi2L [1] > chi2R = qchisq(1-alpha/2,df) > chi2R [1] Excel =CHIINV(0.025,9) =CHIINV(0.975,9) STATDISK Analysis->Prob Distr->Chi Squared Chi Sq Value: Prob Dens: Cumulative Probs Left: Right: Degrees Freedom Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Estimators of 2 The sample variance s 2 is the best point estimate of the population variance 2. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Estimators of The sample standard deviation s is a commonly used point estimate of (even though it is a biased estimate). Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Confidence Interval for Estimating a Population Standard Deviation or Variance Requirements: 1. The sample is a simple random sample. 2. The population must have normally distributed values (even if the sample is large). Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Confidence Interval for Estimating a Population Standard Deviation or Variance Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Confidence Interval for Estimating a Population Standard Deviation or Variance Confidence Interval for the Population Standard Deviation Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. The proper operation of typical home appliances requires voltage levels that do not vary much. Listed below are ten voltage levels (in volts) recorded in the authors home on ten different days. These ten values have a standard deviation of s = 0.15 volt. Use the sample data to construct a 95% confidence interval estimate of the standard deviation of all voltage levels Example: Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Requirements are satisfied: simple random sample and normality Example: Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. n = 10 so df = 10 1 = 9 Use table A-4 to find: Example: Construct the confidence interval: n = 10, s = 0.15 > s = 0.15 > CIvar = c( (n-1)*s^2/chi2R, (n-1)*s^2/chi2L ) > CIvar [1] > CIsd = sqrt(CI) > CIsd [1] STATDISK Analysis->Confidence Intervals->St-Dev One Sample 95% Confidence Interval for the st dev: < SD < % Confidence Interval for the variance: < VAR < TI does not have it. There are some special programs you have to upload, see p 377. Excel use DDXL p 377. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Evaluation the preceding expression yields: Example: Finding the square root of each, yields this 95% confidence interval estimate of the population standard deviation: Based on this result, we have 95% confidence that the limits of 0.10 volt and 0.27 volt contain the true value of . Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Determining Sample Sizes The procedures for finding the sample size necessary to estimate 2 are much more complex than the procedures given earlier for means and proportions. Instead of using very complicated procedures, we will use Table 7-2. STATDISK also provides sample sizes. With STATDISK, select Analysis, Sample Size Determination, and then Estimate St Dev. Minitab, Excel, and the TI-83/84 Plus calculator do not provide such sample sizes. Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Determining Sample Sizes Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: We want to estimate the standard deviation of all voltage levels in a home. We want to be 95% confident that our estimate is within 20% of the true value of . How large should the sample be? Assume that the population is normally distributed. From Table 7-2, we can see that 95% confidence and an error of 20% for correspond to a sample of size 48. => We should obtain a simple random sample of 48 voltage levels form the population of voltage levels.