BackFundamental Concepts and Applications in Introductory Statistics
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Statistical and Practical Significance
Definitions and Distinctions
Understanding the difference between statistical significance and practical significance is essential in interpreting results from statistical analyses.
Statistical significance: Achieved when results are very unlikely to occur by chance. It is determined by hypothesis testing and p-values.
Practical significance: Relates to whether the observed effect is large enough to be meaningful in a real-world context, regardless of statistical significance.
Example
A new drug reduces blood pressure by 1 mmHg with a p-value of 0.001. Statistically significant, but the effect may not be practically significant for patient health.
Bias in Data Sources
Potential for Bias
Bias occurs when a data source or collection method systematically favors certain outcomes.
Organizations may have incentives to present data in a way that supports their interests, leading to potential bias.
Independent data sources with no incentive to alter results are less likely to introduce bias.
Example
A car club reporting on public transportation may have a bias if their members prefer driving.
Statistical Analysis of Data
Significance and Practicality
Statistical significance: Results unlikely due to chance.
Practical significance: Results are large enough to matter in practice.
Sample Data Analysis
Analyzing temperature data at different times can reveal correlations or differences between groups.
Subject | 8 AM | 12 AM |
|---|---|---|
1 | 72 | 66 |
2 | 68 | 67 |
3 | 77 | 72 |
4 | 75 | 69 |
5 | 82 | 76 |
6 | 56 | 54 |
Analysis can address whether there is a difference between average temperatures for males and females, or a correlation between times.
Parameters and Statistics
Definitions
Parameter: A numerical measurement describing a characteristic of a population.
Statistic: A numerical measurement describing a characteristic of a sample.
Example
The mean income of all residents in a city is a parameter; the mean income of a sample of residents is a statistic.
Types of Data: Qualitative vs. Quantitative
Definitions
Qualitative data: Describes categories or qualities (e.g., colors, names).
Quantitative data: Consists of counts or measurements (e.g., height, weight).
Example
The price of gasoline is quantitative; types of gasoline are qualitative.
Discrete vs. Continuous Data
Definitions
Discrete data: Can take only specific values, often counts (e.g., number of students).
Continuous data: Can take any value within an interval (e.g., height, weight).
Example
Farm produce lengths are continuous; number of cows is discrete.
Levels of Measurement
Four Levels
Nominal: Categories only, no order (e.g., social security numbers).
Ordinal: Categories with order, but no meaningful differences (e.g., rankings).
Interval: Ordered, differences are meaningful, but no true zero (e.g., temperature in Celsius).
Ratio: Ordered, meaningful differences, true zero exists (e.g., height, weight).
Level | Example |
|---|---|
Nominal | Social security numbers |
Ordinal | Class rankings |
Interval | Temperature (Celsius) |
Ratio | Length in meters |
Measures of Center
Mean, Median, Mode, and Midrange
Mean: The arithmetic average.
Median: The middle value when data are ordered.
Mode: The value that appears most frequently.
Midrange: The average of the maximum and minimum values.
Example
Given data: 72, 68, 77, 75, 82, 56
Mean:
Median: 73.5
Mode: None (all values unique)
Midrange:
Measures of Variation
Range, Variance, and Standard Deviation
Range: Difference between maximum and minimum values.
Variance: Average squared deviation from the mean.
Standard deviation: Square root of variance.
Example
Room prices: 294, 123, 281, 228, 232, 128, 219
Range:
Standard deviation:
Variance:
Symbols for Statistics and Parameters
Concept | Sample Symbol | Population Symbol |
|---|---|---|
Mean | ||
Standard deviation | ||
Variance |
Descriptive vs. Inferential Statistics
Definitions
Descriptive statistics: Summarize or describe characteristics of a data set (e.g., mean, median, mode).
Inferential statistics: Use sample data to make generalizations about a population.
Measures of Center and Outliers
Choosing the Best Measure
Median is preferred when data have outliers, as it is less affected by extreme values.
Mean is sensitive to outliers.
Mode is useful for categorical data.
Summary of Key Concepts
Statistical significance does not imply practical significance.
Bias can arise from data sources with incentives.
Parameters describe populations; statistics describe samples.
Data can be qualitative or quantitative, discrete or continuous.
Levels of measurement: nominal, ordinal, interval, ratio.
Measures of center: mean, median, mode, midrange.
Measures of variation: range, variance, standard deviation.
Descriptive statistics summarize data; inferential statistics generalize to populations.
Additional info: Some explanations and examples have been expanded for clarity and completeness.