BackStep-by-Step Guidance for Elementary Statistics Exam 1
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Q1. What term is used to describe a survey where people choose to respond themselves? What is wrong with this sampling method?
Background
Topic: Sampling Methods in Statistics
This question tests your understanding of different types of sampling methods, especially those that can introduce bias into survey results.
Key Terms:
Voluntary Response Sample: A sample where participants choose to be part of the survey.
Self-Selected Sample: Another term for voluntary response, where individuals decide to participate.
Census: A survey that attempts to include the entire population.
Population: The entire group being studied.
Step-by-Step Guidance
Read the description of the survey: people responded to a question posted online, and only those who chose to respond are included.
Recall the definitions of the terms provided (voluntary response, census, self-selected, population).
Think about which terms accurately describe a sample where participation is optional and not everyone in the population is included.
Consider what is problematic about this sampling method: does it represent the whole population fairly, or could it introduce bias?
Try solving on your own before revealing the answer!
Q2. What type of data is the number of wheels on a vehicle?
Background
Topic: Types of Data (Discrete vs. Continuous)
This question tests your ability to distinguish between discrete and continuous data.
Key Terms:
Discrete Data: Data that can only take specific, separate values (often counts).
Continuous Data: Data that can take any value within a range (often measurements).
Step-by-Step Guidance
Think about the possible values for the number of wheels on a vehicle (e.g., 2, 4, 6, etc.).
Ask yourself: Can the number of wheels be a fraction or any value in an interval, or must it be a whole number?
Review the definitions of discrete and continuous data to determine which applies.
Match the correct answer choices to your reasoning.
Try solving on your own before revealing the answer!
Q3. What is the most appropriate level of measurement for brand of toothpaste?
Background
Topic: Levels of Measurement (Nominal, Ordinal, Interval, Ratio)
This question tests your understanding of how to classify data according to its level of measurement.
Key Terms:
Nominal: Categories with no inherent order (e.g., brand names).
Ordinal: Categories with a meaningful order but no consistent difference between them.
Interval: Ordered categories with equal intervals but no true zero.
Ratio: Like interval, but with a true zero point.
Step-by-Step Guidance
Consider what kind of information 'brand of toothpaste' provides: is it a name, a ranking, or a number?
Recall the definitions of the four levels of measurement.
Decide which level best fits data that consists of brand names.
Try solving on your own before revealing the answer!
Q4. What is the most appropriate level of measurement for the prison population of different countries?
Background
Topic: Levels of Measurement
This question tests your ability to classify numerical data according to its level of measurement.
Key Terms:
Ratio Level: Data with a true zero and meaningful ratios (e.g., population counts).
Step-by-Step Guidance
Think about what 'prison population' means: is it a count, a category, or a ranking?
Recall the characteristics of the ratio level of measurement (true zero, ratios make sense).
Determine if the data fits the ratio level or another level.
Try solving on your own before revealing the answer!
Q5. Express the relative frequencies of grades (A, B, C, D, F) as percentages given their frequencies: 4, 12, 3, 5, 3.
Background
Topic: Frequency and Relative Frequency
This question tests your ability to convert frequencies into relative frequencies (percentages).
Key Formula:
Relative Frequency (%) =
Step-by-Step Guidance
Add up all the frequencies to find the total number of grades.
For each grade, divide its frequency by the total and multiply by 100 to get the percentage.
Fill in the table with the calculated percentages for each grade.
Try solving on your own before revealing the answer!
Q6. What does it mean for the findings of a statistical analysis to be statistically significant?
Background
Topic: Statistical Significance
This question tests your understanding of what it means for results to be statistically significant in hypothesis testing.
Key Terms:
Statistical Significance: The likelihood that a result is not due to random chance.
Step-by-Step Guidance
Review the definition of statistical significance in the context of hypothesis testing.
Consider which answer choices correctly describe statistical significance (e.g., results unlikely to occur by chance).
Eliminate choices that refer to practical importance or chance occurrence.
Try solving on your own before revealing the answer!
Data Analysis Problem 1: Create a frequency table with bins of width 10 for the ages of award-winning male actors, and sketch a histogram.
Background
Topic: Frequency Distributions and Histograms
This question tests your ability to organize raw data into a frequency table and visualize it with a histogram.
Key Steps:
Frequency Table: A table that shows how many data points fall into each bin (interval).
Histogram: A bar graph representing the frequency of data within each bin.
Step-by-Step Guidance
Find the minimum and maximum ages in the data set to determine the range.
Divide the range into intervals (bins) of width 10 (e.g., 20-29, 30-39, etc.).
Count how many ages fall into each bin and record these frequencies in a table.
Use the frequency table to sketch a histogram, with bins on the x-axis and frequencies on the y-axis.
Try solving on your own before revealing the answer!
Data Analysis Problem 1 (continued): If you had data for female actors, what graph would you use to compare males to females? Why?
Background
Topic: Comparative Data Visualization
This question tests your ability to choose appropriate graphs for comparing two groups.
Key Terms:
Side-by-Side Boxplots: Useful for comparing distributions between groups.
Double Histogram: Two histograms on the same axes for comparison.
Step-by-Step Guidance
Think about what you want to compare (e.g., center, spread, outliers) between the two groups.
Recall which types of graphs are best for visual comparison of distributions.
Choose a graph type and explain why it is appropriate for this comparison.
Try solving on your own before revealing the answer!
Data Analysis Problem 2: Calculate the mean, median, midrange, standard deviation, and range for the given cell phone radiation data.
Background
Topic: Measures of Center and Spread
This question tests your ability to compute and interpret various measures of center (mean, median, midrange) and spread (standard deviation, range).
Key Formulas:
Mean:
Median: Middle value when data is ordered.
Midrange:
Standard Deviation:
Range:
Step-by-Step Guidance
Order the data from smallest to largest to help with median, range, and midrange calculations.
Calculate the mean by summing all values and dividing by the number of data points.
Find the median by identifying the middle value (or average of two middle values if the count is even).
Compute the midrange by adding the smallest and largest values and dividing by 2.
Calculate the range by subtracting the smallest value from the largest.
For standard deviation, subtract the mean from each value, square the result, sum these squares, divide by n-1, and take the square root.
Try solving on your own before revealing the answer!
Data Analysis Problem 2 (continued): Is there a reason to prefer one measure of center over the others for this data? If so, which and why?
Background
Topic: Choosing Measures of Center
This question tests your ability to interpret data characteristics (such as skewness or outliers) and select the most appropriate measure of center.
Key Concepts:
Mean: Sensitive to outliers and skewed data.
Median: Resistant to outliers; good for skewed data.
Midrange: Very sensitive to outliers.
Step-by-Step Guidance
Look at the data for any extreme values (outliers) or skewness.
Recall which measures of center are robust to outliers and which are not.
Decide which measure would best represent the 'typical' value for this data set and explain your reasoning.
Try solving on your own before revealing the answer!
Data Analysis Problem 3: Prepare a five-number summary for the NV group fusion times.
Background
Topic: Five-Number Summary
This question tests your ability to summarize a data set using the minimum, first quartile (Q1), median, third quartile (Q3), and maximum.
Key Terms:
Five-Number Summary: Minimum, Q1, Median, Q3, Maximum.
Step-by-Step Guidance
Order the data from smallest to largest.
Identify the minimum and maximum values.
Find the median (middle value).
Find Q1 (median of the lower half) and Q3 (median of the upper half).
Try solving on your own before revealing the answer!
Data Analysis Problem 3 (continued): How long would a participant need to take to be an outlier? How many outliers are there?
Background
Topic: Identifying Outliers Using the IQR Method
This question tests your ability to use the interquartile range (IQR) to determine outlier thresholds.
Key Formulas:
IQR:
Outlier Boundaries: Lower: , Upper:
Step-by-Step Guidance
Calculate the IQR using your Q1 and Q3 values.
Compute the lower and upper outlier boundaries using the formulas above.
Identify any data points below the lower boundary or above the upper boundary as outliers.
Try solving on your own before revealing the answer!
Data Analysis Problem 3 (continued): Make a box plot to represent the NV group.
Background
Topic: Box Plots
This question tests your ability to construct a box plot using the five-number summary and identify outliers.
Key Steps:
Draw a number line that covers the range of your data.
Mark the five-number summary values (min, Q1, median, Q3, max).
Draw a box from Q1 to Q3, with a line at the median.
Extend 'whiskers' to the min and max (excluding outliers), and plot outliers as individual points.
Step-by-Step Guidance
Use your five-number summary to determine the box and whisker positions.
Mark any outliers as individual points beyond the whiskers.
Label your axes and ensure the plot is to scale.