Skip to main content
Ch. 9 - Inferences from Two Samples
Triola - Elementary Statistics 14th Edition
Triola14th EditionElementary StatisticsISBN: 9780137366446Not the one you use?Change textbook
Chapter 9, Problem 9.4.10b

Second-Hand Smoke Samples from Data Set 15 “Passive and Active Smoke” include cotinine levels measured in a group of smokers ( n = 40, x_bar = 172.48 ng/mL, 119.50 ng/mL ) and a group of nonsmokers not exposed to tobacco smoke ( n = 40, x_bar = 16.35 ng/mL, 62.53 ng/mL ). Cotinine is a metabolite of nicotine, meaning that when nicotine is absorbed by the body, cotinine is produced.


b. The 40 cotinine measurements from the nonsmoking group consist of these values (all in ng/mL): 1, 1, 90, 244, 309, and 35 other values that are all 0. Does this sample appear to be from a normally distributed population? If not, how are the results from part (a) affected?

Verified step by step guidance
1
Step 1: Understand the problem. The goal is to determine whether the sample of cotinine measurements from the nonsmoking group appears to come from a normally distributed population. If it does not, we need to consider how this affects the results from part (a).
Step 2: Analyze the data distribution. The nonsmoking group has 40 measurements, with 35 values being 0, and the remaining values are 1, 1, 90, 244, and 309. A normal distribution typically has a symmetric, bell-shaped curve, so we need to assess whether this data aligns with that shape.
Step 3: Use a histogram or boxplot to visualize the data. Plot the data to observe its shape. If the data is heavily skewed or has extreme outliers (e.g., the values 244 and 309), it is unlikely to be normally distributed.
Step 4: Perform a normality test. Use a statistical test such as the Shapiro-Wilk test or Anderson-Darling test to formally assess normality. These tests will provide a p-value, and if the p-value is below a significance level (e.g., 0.05), we reject the null hypothesis that the data is normally distributed.
Step 5: Interpret the results. If the data is not normally distributed, the results from part (a) may be affected because many statistical methods (e.g., t-tests) assume normality. In such cases, alternative methods like nonparametric tests or data transformations may be needed to ensure valid conclusions.

Verified video answer for a similar problem:

This video solution was recommended by our tutors as helpful for the problem above.
Video duration:
2m
Was this helpful?

Key Concepts

Here are the essential concepts you must grasp in order to answer the question correctly.

Normal Distribution

Normal distribution is a probability distribution that is symmetric about the mean, indicating that data near the mean are more frequent in occurrence than data far from the mean. It is characterized by its bell-shaped curve, defined by its mean and standard deviation. Understanding whether a dataset follows a normal distribution is crucial for applying many statistical tests, which often assume normality.
Recommended video:
Guided course
09:47
Finding Standard Normal Probabilities using z-Table

Cotinine Levels

Cotinine is a metabolite of nicotine, commonly used as a biomarker to measure exposure to tobacco smoke. In the context of the provided data, cotinine levels are compared between smokers and nonsmokers to assess the impact of second-hand smoke. Analyzing these levels helps in understanding the health implications of tobacco exposure and the effectiveness of smoking cessation efforts.
Recommended video:
04:46
Critical Values: z Scores

Statistical Tests for Normality

Statistical tests for normality, such as the Shapiro-Wilk test or the Kolmogorov-Smirnov test, are used to determine if a dataset follows a normal distribution. If the sample does not appear to be normally distributed, it may affect the validity of parametric tests, which rely on this assumption. Non-normal data may require the use of non-parametric tests or data transformation techniques to ensure accurate analysis.
Recommended video:
Guided course
06:34
Step 2: Calculate Test Statistic
Related Practice
Textbook Question

In Exercises 5–20, assume that the two samples are independent simple random samples selected from normally distributed populations, and do not assume that the population standard deviations are equal. (Note: Answers in Appendix D include technology answers based on Formula 9-1 along with “Table” answers based on Table A-3 with df equal to the smaller of n1-1 and n2-1)


Color and Cognition Researchers from the University of British Columbia conducted a study to investigate the effects of color on cognitive tasks. Words were displayed on a computer screen with background colors of red and blue. Results from scores on a test of word recall are given below. Higher scores correspond to greater word recall.


b. Construct a confidence interval appropriate for the hypothesis test in part (a). What is it about the confidence interval that causes us to reach the same conclusion from part (a)?


" style="max-width: 100%; white-space-collapse: preserve;" width="440">

91
views
Textbook Question

In Exercises 5–16, use the listed paired sample data, and assume that the samples are simple random samples and that the differences have a distribution that is approximately normal.


The Freshman 15 The “Freshman 15” refers to the belief that college students gain 15 lb (or 6.8 kg) during their freshman year. Listed below are weights (kg) of randomly selected male college freshmen (from Data Set 13 “Freshman 15” in Appendix B). The weights were measured in September and later in April.


b. Construct the confidence interval that could be used for the hypothesis test described in part (a). What feature of the confidence interval leads to the same conclusion reached in part (a)?

105
views
Textbook Question

In Exercises 5–16, use the listed paired sample data, and assume that the samples are simple random samples and that the differences have a distribution that is approximately normal.


Measured and Reported Weights Listed below are measured and reported weights (lb) of random female subjects (from Data Set 4 “Measured and Reported” in Appendix B).


b. Construct the confidence interval that could be used for the hypothesis test described in part (a). What feature of the confidence interval leads to the same conclusion reached in part (a)?


121
views
Textbook Question

Independent Samples Which of the following involve independent samples?


b. Data Set 6 “Births” includes birth weights of a sample of baby boys and a sample of baby girls.


113
views
Textbook Question

Can Dogs Detect Malaria? A study was conducted to determine whether dogs could detect malaria from socks worn by malaria patients and socks worn by patients without malaria. Among 175 socks worn by malaria patients, the dogs made correct identifications 123 times. Among 145 socks worn by patients without malaria, the dogs made correct identifications 131 times (based on data presented at an annual meeting of the American Society of Tropical Medicine, by principal investigator Steve Lindsay). Use a 0.05 significance level to test the claim of no difference between the two rates of correct responses.


b. Test the claim by constructing an appropriate confidence interval.


71
views
Textbook Question

In Exercises 5–20, assume that the two samples are independent simple random samples selected from normally distributed populations, and do not assume that the population standard deviations are equal. (Note: Answers in Appendix D include technology answers based on Formula 9-1 along with “Table” answers based on Table A-3 with df equal to the smaller of n1-1 and n2-1)


Bicycle Commuting A researcher used two different bicycles to commute to work. One bicycle was steel and weighed 30.0 lb; the other was carbon and weighed 20.9 lb. The commuting times (minutes) were recorded with the results shown below (based on data from “Bicycle Weights and Commuting Time,” by Jeremy Groves, British Medical Journal).


b. Construct the confidence interval suitable for testing the claim in part (a).


107
views