Skip to main content
Back

Statistics Review: Probability Distributions, Binomial & Normal Distributions, Proportions, Hypothesis Testing, and Chi-Square Tests

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Probability Distributions

Theoretical and Frequentist Distributions

Probability distributions describe the relative frequencies of outcomes of a random event. They are essential models for random experiments in statistics.

  • All possible outcomes of a random experiment must be listed.

  • Probability of each outcome must be specified.

  • Sum of all probabilities must equal 1 (unity).

Example: Probability distribution for rolling a fair die:

Number

1

2

3

4

5

6

Probability

1/6

1/6

1/6

1/6

1/6

1/6

Random Variables

Discrete Random Variables

Discrete random variables take on countable values.

  • Examples: Number of siblings, number of contacts stored on a phone.

Continuous Random Variables

Continuous random variables can take any value within a range and are not countable.

  • Examples: Height of basketball players, weight of an object, duration of phone calls.

Normal Distribution and Z-Scores

Finding Probabilities Using the Standard Normal Distribution

To find probabilities for normally distributed variables:

  1. Convert values to z-scores using the formula:

  2. Look up the corresponding area (probability) in the standard normal table (z-table).

Example: For grams, , : Probability for is 0.1056.

Using Technology (TI Calculators)

  1. Compute z

  2. Press 2ND VARS

  3. Scroll to normalcdf(

  4. Enter limits and parameters

  5. Output is the p-value (probability)

Binomial Distribution & Discrete Random Variables

Conditions for Binomial Distribution

  • Fixed number of trials ()

  • Each trial has only two possible outcomes (success/failure)

  • Trials are independent

  • Probability of success () is constant for each trial

Binomial Probability Formula

The probability of getting successes in independent trials:

Use binomial formula, binomial tables, or technology (e.g., binompdf(n, p, x) on calculators).

Notation:

Binomial Syntax: binompdf(n, p, x)

x

P(x)

For x=0; binompdf(4, 0.3, 0)

0

0.240

For x=1; binompdf(4, 0.3, 1)

1

0.412

For x=2; binompdf(4, 0.3, 2)

2

0.265

For x=3; binompdf(4, 0.3, 3)

3

0.076

For x=4; binompdf(4, 0.3, 4)

4

0.008

Sampling Distributions & Confidence Intervals: Proportion

Sample Proportions: Conditions

  • Sampling is random and independent

  • Sample size is large enough: and

  • Population is at least 10 times larger than the sample (if sampling without replacement)

Sample Proportions: Conclusions

  • Sampling distribution of is approximately normal

  • Mean:

  • Standard deviation:

  • Estimated standard deviation:

Hypothesis Testing for One Sample: Proportions

Steps for Hypothesis Testing

  1. Hypothesize Formulation: State and (e.g., , )

  2. Prepare - Check CLT Conditions: Choose significance level , check randomness, sample size, and population size

  3. Compute (Statistics) to Compare: Compute , then find p-value

  4. Interpret and Conclude: Compare p-value with ; if p-value , reject

Example: If 40% of a sample of 200 own dogs, test if this is higher than 34% (p-value = 0.0367, so reject at )

Type I and Type II Errors

  • Type I Error: Rejecting when it is true

  • Type II Error: Not rejecting when it is false

Confidence Intervals (CI) for Proportions

Constructing a Confidence Interval

  1. Check CLT conditions (random, large sample, large population)

  2. Calculate estimated standard deviation:

  3. Find for the desired confidence level (e.g., for 90%)

  4. Margin of error:

  5. CI:

Example: For , , 90% CI is (0.3222, 0.3778)

Sample Size for Desired Margin of Error

To achieve a margin of error at confidence level :

Sampling Distributions & Confidence Intervals: Mean

Population and Sample Variances

  • Sample Variance:

  • Sample Standard Deviation:

Sample Mean: Conditions

  • Random sample

  • Normality (population normal or large, usually )

  • Population at least 10 times larger than sample

Hypothesis Testing for One Sample: Means

Tailed vs. Two-Tailed Tests

Two-Tailed

One-Tailed (Left)

One-Tailed (Right)

Steps for Hypothesis Testing (Means)

  1. Formulate hypotheses (, )

  2. Check CLT conditions (random, normality, large population)

  3. Compute test statistic: , where

  4. Find p-value and compare with

  5. Interpret and conclude

Example: Testing if mean years of experience among nurses has increased (p-value = 0.0187, so reject at )

Chi-Square Tests & Goodness of Fit

Chi-Square Statistic

Measures how much observed frequencies differ from expected frequencies.

Formula:

Example Table: Observed vs. Expected

Girls

Boys

Total

Gym (Observed)

20

50

70

No Gym (Observed)

30

200

230

Total

50

250

300

Girls

Boys

Gym (Expected)

11.67

58.33

No Gym (Expected)

38.33

191.67

Expected Frequency:

Calculated

Conditions for Chi-Square Tests

  • Data must be counts for categories

  • Random sample

  • Expected cell frequency at least 5

  • Counts are independent

Additional info:

  • These notes cover core topics from chapters 6-12 of a typical introductory statistics course, including probability distributions, binomial and normal distributions, sampling distributions, confidence intervals, hypothesis testing, and chi-square tests.

Pearson Logo

Study Prep