BackThe Normal Curve, Standardization, and z Scores
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
The Normal Curve, Standardization, and z Scores
The Normal Curve
The normal curve is a fundamental concept in statistics and probability, characterized by its bell-shaped, unimodal, and symmetric appearance. It is mathematically defined and serves as the foundation for inferential statistics.
Normal Curve: A specific bell-shaped curve that is unimodal (one peak), symmetric about the mean, and mathematically defined.
Importance: The normal curve underlies many statistical methods and is essential for understanding distributions of data in natural and social sciences.
Sample Size and the Normal Curve
The relationship between sample size and the normal curve is crucial for understanding how data distributions behave as more data is collected.
Sample Size: As the sample size increases, the sample distribution more closely resembles a normal curve.
When the sample size approaches the population size, the distribution tends to be normally distributed.
Small samples may not appear normal, but larger samples tend to approximate the normal curve.
Examples:
Sample of 5: Distribution is irregular and may not resemble a normal curve.
Sample of 30: Distribution begins to take on a bell-shaped appearance.
Sample of 157: Distribution closely matches the normal curve.
Variables and Normal Distribution
Many continuous variables (e.g., height, test scores) are approximately normally distributed in large populations.
Nominal (categorical) variables cannot be normally distributed because the normal curve applies to continuous data.
Standardization, z Scores, and the Normal Curve
Standardization allows comparison of scores from different distributions by converting them to a common scale with a known mean and standard deviation.
z Score: The number of standard deviations a particular score is from the mean.
The z Distribution
The z distribution is a normal distribution of standardized scores (z scores), with a mean of 0 and a standard deviation of 1.
Calculating a Particular z Score
Step 1: Subtract the mean of the population from the raw score.
Step 2: Divide by the standard deviation of the population.
Formula:
X: Raw score
μ: Population mean
σ: Population standard deviation
Transforming z Scores into Raw Scores
Step 1: Multiply the z score by the standard deviation of the population.
Step 2: Add the mean of the population to this product.
Formula:
Key Terms
z Distribution: A normal distribution of standardized scores.
Standard Normal Distribution: A normal distribution of z scores (mean = 0, standard deviation = 1).
Comparing Scores from Different Scales
If raw scores are measured on different scales, converting them to z scores allows for direct comparison.
z Scores as a Standardization Tool in Research
z scores are used to compare values across different distributions and to interpret research findings in terms of standard deviations from the mean.
Transforming z Scores into Percentiles
z scores indicate where a value fits into a normal distribution.
The area under the normal curve can be used to calculate percentiles for any score.
The Normal Curve and Percentages
The normal curve can be divided into sections that correspond to percentages of the data:
Approximately 68% of data falls within ±1 standard deviation of the mean.
Approximately 95% falls within ±2 standard deviations.
Approximately 99.7% falls within ±3 standard deviations.
The Central Limit Theorem
The Central Limit Theorem states that the distribution of sample means approaches a normal distribution as the sample size increases, regardless of the population's distribution.
Sample means are normally distributed even if the population is not normal.
A distribution of means is less variable than a distribution of individual scores.
Creating Distributions
Distribution of Scores: Shows the frequency of individual data points.
Distribution of Means: Shows the frequency of sample means, which is less variable and more normal in shape.
Characteristics of the Distribution of Means
The mean of the distribution of means equals the population mean.
The standard deviation of the distribution of means (standard error) is less than the population standard deviation.
Standard error formula:
σM: Standard error (standard deviation of the distribution of means)
σ: Population standard deviation
N: Sample size
Using the Appropriate Measure of Spread
Distribution of means is less spread out than the distribution of individual scores.
Scores Versus Means
As sample size increases, the distribution of means becomes more normal, even if the original data is skewed.
Using the Central Limit Theorem to Make Comparisons with z Scores
When the population is not available, use the distribution of means for standardization.
The z formula for means:
M: Sample mean
μM: Mean of the distribution of means
σM: Standard error
Identifying Outliers or Cheaters
z scores can be used to identify values that are unusually high or low, which may indicate errors or outliers in the data.
Additional info: While these concepts are foundational in statistics, they are also relevant to General Chemistry in the context of data analysis, error analysis, and interpreting experimental results.