Comprehensive Guidance for Key Statistics Concepts and Problems

Notes Practice Video lessons

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Q1. Describe the scenario which makes an experiment a random experiment.

Background

Topic: Random Experiments in Probability

This question tests your understanding of what defines a random experiment in probability theory.

Key Terms:

Random Experiment: An action or process that leads to one of several possible outcomes, where the outcome cannot be predicted with certainty in advance.
Outcome: The result of a single trial of an experiment.
Sample Space: The set of all possible outcomes.

Step-by-Step Guidance

Think about what makes an experiment 'random'—focus on unpredictability and repeatability.
Consider examples (like tossing a coin or rolling a die) and identify what they have in common regarding outcomes.
Formulate a definition that includes the key aspects: repeatability, unpredictability, and a well-defined set of possible outcomes.

Try solving on your own before revealing the answer!

Q2. Be able to list how many different ways a (i) random experiment can end and (ii) an event can occur.

Background

Topic: Counting Outcomes and Events

This question is about understanding the difference between outcomes of an experiment and the ways an event can occur.

Key Terms:

Outcome: A possible result of a random experiment.
Event: A set of one or more outcomes.
Sample Space (S): The set of all possible outcomes.

Step-by-Step Guidance

Identify the sample space for a given random experiment (e.g., rolling a die: S = {1,2,3,4,5,6}).
Count the number of possible outcomes (the size of the sample space).
For a specific event (e.g., rolling an even number), list all outcomes that make up the event.
Count the number of outcomes that correspond to the event.

Try solving on your own before revealing the answer!

Q3. Differentiate between an empirical probability and a theoretical probability.

Background

Topic: Types of Probability

This question tests your understanding of the difference between probabilities based on data (empirical) and those based on reasoning or models (theoretical).

Key Terms:

Empirical Probability: Probability based on observed data from experiments or trials.
Theoretical Probability: Probability calculated using known possible outcomes, assuming all are equally likely.

Step-by-Step Guidance

Recall the formula for empirical probability:
Recall the formula for theoretical probability:
Think about situations where you would use each type (e.g., flipping a coin 100 times vs. calculating probability from a model).
Summarize the main difference in your own words.

Try solving on your own before revealing the answer!

Q4. Compute the following probabilities: (i) P(A Or B) (ii) P(A And B).

Background

Topic: Basic Probability Rules

This question tests your ability to use the addition and multiplication rules for probabilities.

Key Formulas:

Addition Rule:
Multiplication Rule:

Step-by-Step Guidance

Identify the probabilities given (e.g., , , ).
For , use the addition rule formula above.
For , use the multiplication rule. If A and B are independent, .
Plug in the values you have, but stop before calculating the final result.

Try solving on your own before revealing the answer!

Q5. Explain what it means for events to be (i) mutually exclusive and (ii) independent. Additionally, be able to determine if two events are (i) mutually exclusive and (ii) independent.

Background

Topic: Relationships Between Events

This question tests your understanding of two important concepts in probability: mutual exclusivity and independence.

Key Terms:

Mutually Exclusive: Events that cannot happen at the same time ().
Independent: Events where the occurrence of one does not affect the probability of the other ().

Step-by-Step Guidance

Define mutually exclusive events and give an example (e.g., rolling a 2 or a 3 on a single die roll).
Define independent events and give an example (e.g., flipping two coins).
To check if two events are mutually exclusive, see if they can occur together.
To check if two events are independent, see if or .

Try solving on your own before revealing the answer!

**Q6. Compute the P(A And B) = P(A)*P(B) when A and B are independent events.**

Background

Topic: Probability of Independent Events

This question tests your ability to use the multiplication rule for independent events.

Key Formula:

(if A and B are independent)

Step-by-Step Guidance

Identify the probabilities and .
Confirm that A and B are independent.
Multiply by to set up the calculation, but do not compute the final value.

Try solving on your own before revealing the answer!

Q7. Calculate the conditional probability of A given B; P(A | B) = P(A And B)/P(B).

Background

Topic: Conditional Probability

This question tests your ability to calculate the probability of one event given that another has occurred.

Key Formula:

Step-by-Step Guidance

Identify and from the information given.
Plug these values into the conditional probability formula above.
Simplify the expression as much as possible without calculating the final value.

Try solving on your own before revealing the answer!

Q8. Draw a tree diagram and use the tree diagram to compute more complex conditional probabilities, such as P(originating event | secondary/other event)

Background

Topic: Tree Diagrams and Conditional Probability

This question tests your ability to visualize and solve multi-stage probability problems using tree diagrams.

Key Concepts:

Tree Diagram: A graphical representation of all possible outcomes of a sequence of events.
Conditional Probability: Probability of an event given another event has occurred.

Step-by-Step Guidance

Identify the sequence of events and their probabilities.
Draw branches for each possible outcome at each stage.
Label the probabilities on each branch.
To find , use the tree to trace all paths leading to the secondary event, then apply the conditional probability formula.

Try solving on your own before revealing the answer!

Q9. Explain the meaning of the "revisionist" probability computed in Outcome #8: how this is an 'updated' probability of the originating event occurring GIVEN new information/event occurs

Background

Topic: Bayesian (Posterior) Probability

This question tests your understanding of how probabilities are updated when new information is available (Bayes' Theorem).

Key Terms:

Posterior Probability: The updated probability of an event after taking into account new evidence.
Bayes' Theorem:

Step-by-Step Guidance

Recall that the revisionist (posterior) probability is calculated after observing new data.
Understand that this is different from the prior probability, which is before new data is considered.
Explain in your own words how the probability is 'revised' or updated using the new information.

Try solving on your own before revealing the answer!

Q10. Compute the Prevalence, Sensitivity, Specificity of a test.

Background

Topic: Diagnostic Test Evaluation

This question tests your ability to calculate key measures for evaluating diagnostic tests.

Key Formulas:

Prevalence:
Sensitivity:
Specificity:

Step-by-Step Guidance

Identify the numbers of true positives, false negatives, true negatives, and false positives from the data.
Calculate prevalence using the formula above.
Calculate sensitivity and specificity using their respective formulas, but stop before the final calculation.

Try solving on your own before revealing the answer!

Q11. Compute the Relative Risk and interpret its meaning in the context of the data.

Background

Topic: Risk Analysis in Epidemiology

This question tests your ability to calculate and interpret relative risk from a contingency table or data set.

Key Formula:

Step-by-Step Guidance

Calculate the risk (probability of outcome) in both the exposed and unexposed groups.
Set up the relative risk formula using these two risks.
Interpret what a relative risk greater than, less than, or equal to 1 means in context.

Try solving on your own before revealing the answer!

Q12. Identify a population. Identify if a population variable is numerical or categorical.

Background

Topic: Populations and Variables

This question tests your understanding of populations and the types of variables in statistics.

Key Terms:

Population: The entire group of individuals or items of interest.
Numerical Variable: Takes on values that are numbers (quantitative).
Categorical Variable: Takes on values that are categories or labels (qualitative).

Step-by-Step Guidance

Define the population for a given study or scenario.
Identify the variable being measured and determine if it is numerical or categorical.
Give an example of each type of variable.

Try solving on your own before revealing the answer!

Q13. Be able to distinguish between (i) a population and sample and (ii) a parameter and a statistic. Be able to make a statistical inference statement.

Background

Topic: Populations, Samples, Parameters, and Statistics

This question tests your understanding of the basic building blocks of statistical inference.

Key Terms:

Population: The entire group of interest.
Sample: A subset of the population.
Parameter: A numerical summary of a population.
Statistic: A numerical summary of a sample.

Step-by-Step Guidance

Define population and sample in your own words.
Define parameter and statistic, and give examples of each.
Write a statistical inference statement (e.g., using a sample statistic to estimate a population parameter).

Try solving on your own before revealing the answer!

Q14. Differentiate between the different methods of collecting data: Observational studies and Experimental studies. Be familiar with concepts such as control group, placebo and the placebo effect, random assignment, blinded and double-blinded experiments. When should observational studies be used? When should experimental studies be used?

Background

Topic: Study Design in Statistics

This question tests your understanding of how data can be collected and the strengths/weaknesses of different study designs.

Key Terms:

Observational Study: Researchers observe without intervention.
Experimental Study: Researchers assign treatments to subjects.
Control Group, Placebo, Placebo Effect, Random Assignment, Blinding: Key concepts in experimental design.

Step-by-Step Guidance

Define observational and experimental studies.
Explain the role of control groups, placebos, and blinding in experiments.
Discuss when each type of study is appropriate.

Try solving on your own before revealing the answer!

Q15. Make the distinction between a non-random sample (sampling bias) and a random sample. Be able to name some sources of measurement bias, such as non-response bias, response bias, and instrument bias.

Background

Topic: Sampling Methods and Bias

This question tests your understanding of sampling techniques and potential sources of bias in data collection.

Key Terms:

Random Sample: Every member of the population has an equal chance of being selected.
Sampling Bias: Systematic error due to non-random sampling.
Measurement Bias: Errors in data collection (non-response, response, instrument bias).

Step-by-Step Guidance

Define random and non-random samples.
List and explain types of measurement bias.
Give examples of each type of bias.

Try solving on your own before revealing the answer!

Q16. Identify if a random sampling design is a simple random sample, a stratified random sample, a systematic random sample, or a cluster sample.

Background

Topic: Sampling Designs

This question tests your ability to distinguish between different random sampling methods.

Key Terms:

Simple Random Sample: Every member has an equal chance; selection is completely random.
Stratified Random Sample: Population divided into strata, then random samples taken from each stratum.
Systematic Random Sample: Every kth member is selected from a list.
Cluster Sample: Population divided into clusters, some clusters are randomly selected, and all members in those clusters are sampled.

Step-by-Step Guidance

Review the definitions of each sampling method.
Given a scenario, identify which method is being used based on how the sample is selected.
Explain your reasoning for the classification.

Try solving on your own before revealing the answer!

Q17. Visualize numerical data through the creation of either of the three types of histograms: frequency, relative frequency (or percentage), density-scale. Use the created histogram to (i) identify the distribution shape of the sample and (ii) estimate theoretical probabilities. Use StatCrunch to create these data visualization/graphs.

Background

Topic: Data Visualization with Histograms

This question tests your ability to create and interpret different types of histograms and use them for probability estimation.

Key Terms:

Frequency Histogram: Shows counts in each bin.
Relative Frequency Histogram: Shows proportions or percentages in each bin.
Density-Scale Histogram: Area of bars represents probability; total area = 1.

Step-by-Step Guidance

Organize your data into bins (intervals).
Count the number of data points in each bin for a frequency histogram.
Divide counts by total number for relative frequency, or adjust bar heights for density-scale.
Use the histogram to describe the distribution shape (e.g., symmetric, skewed).
Estimate probabilities by looking at the area under the relevant bars.

Try solving on your own before revealing the answer!

Q18. Compute the sample mean, sample median, and sample variance using the formulae (by hand) and with StatCrunch. Interpret the meaning of the sample median and sample standard deviation in the context of the data.

Background

Topic: Measures of Central Tendency and Spread

This question tests your ability to calculate and interpret basic descriptive statistics.

Key Formulas:

Sample Mean:
Sample Median: Middle value when data is ordered.
Sample Variance:

Step-by-Step Guidance

Order the data and find the middle value for the median.
Calculate the mean by summing all values and dividing by n.
Calculate the variance using the formula above, but stop before the final calculation.
Interpret what the median and standard deviation tell you about the data.

Try solving on your own before revealing the answer!

Q19. Compute the values of Q1 and Q3, the IQR, and both the lower and upper fences. Create a boxplot and determine if you have outliers.

Background

Topic: Five-Number Summary and Boxplots

This question tests your ability to summarize data using quartiles and identify outliers.

Key Formulas:

Q1: First quartile (25th percentile)
Q3: Third quartile (75th percentile)
IQR:
Lower Fence:
Upper Fence:

Step-by-Step Guidance

Order the data and find Q1 and Q3.
Calculate the IQR.
Compute the lower and upper fences using the formulas above.
Identify any data points outside the fences as outliers, but stop before listing them.

Try solving on your own before revealing the answer!

Q20. Apply the 68-95-99% Rule to approximate probabilities from the sample; compute z-values, use either the Z-table or STATCRUNCH to compute probabilities associated with computed z-values.

Background

Topic: Normal Distribution and Z-Scores

This question tests your ability to use the empirical rule and z-scores to estimate probabilities for normal distributions.

Key Formulas:

Empirical Rule: About 68% of data within 1 SD, 95% within 2 SD, 99.7% within 3 SD of the mean.
Z-Score:

Step-by-Step Guidance

Calculate the z-score for a given value using the formula above.
Use the empirical rule to estimate the probability for common z-values (1, 2, 3).
For other z-values, use a Z-table or StatCrunch to find the probability, but stop before reading the final value.