BackIntro to Statistics and Collecting Data: Study Notes
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Intro to Statistics and Collecting Data
Introduction to Statistics
Statistics is the science of collecting, organizing, analyzing, and interpreting data to make informed decisions. It provides tools for understanding data and drawing conclusions from information.
Data: Information gathered through observation, measurement, or experiment.
Population: The entire group of interest in a study (e.g., all students in a school).
Sample: A subset of the population, selected for analysis.
Parameter: A numerical summary describing a characteristic of a population (e.g., population mean).
Statistic: A numerical summary describing a characteristic of a sample (e.g., sample mean).
Example: The average salary of every manager in a company is a parameter, while the average salary of a sample of 100 managers is a statistic.
Term | Definition | Example |
|---|---|---|
Population | Entire group of interest | All registered voters in a country |
Sample | Subset of the population | 100 randomly selected voters |
Parameter | Numerical summary of population | Proportion of all voters who are registered Democrats |
Statistic | Numerical summary of sample | Proportion of sampled voters who are registered Democrats |
Types of Data
Data can be classified as qualitative or quantitative, and quantitative data can be further divided into discrete and continuous types.
Qualitative Data: Describes qualities or categories (e.g., eye color, nationality).
Quantitative Data: Describes quantities or amounts and can be measured numerically.
Discrete Data: Can take only specific values, often counts (e.g., number of students).
Continuous Data: Can take any value within a range, often measurements (e.g., height, time).
Example: Surveying the nationalities of 10 people is qualitative data. Measuring the distance students walk to class is quantitative, continuous data.
Type | Description | Example |
|---|---|---|
Qualitative | Categories, qualities | Eye color, nationality |
Quantitative (Discrete) | Countable values | Number of books |
Quantitative (Continuous) | Measurable values | Height, time |
Collecting Data: Observational Studies vs. Experiments
There are two main ways to collect data: observational studies and experiments. The choice affects whether causation can be inferred.
Experiment: The researcher applies a treatment and measures its effect. Causation can be inferred if the experiment is well-designed.
Observational Study: The researcher observes and measures variables without intervention. Causation cannot be assumed.
Example: Testing a medication by giving it to a group and measuring results is an experiment. Surveying students about their sleep habits is an observational study.
Sampling Methods
Sampling is the process of selecting a subset (sample) from a population for study. The method of sampling affects the representativeness and reliability of results.
Simple Random Sample (SRS): Every member of the population has an equal chance of being selected.
Representative Sample: A sample that accurately reflects the characteristics of the population.
Example: Randomly selecting 2 students from each grade level is a representative sample. Drawing names from a hat is a simple random sample.
Other Sampling Methods
When simple random sampling is impractical, other methods can be used:
Systematic Sampling: Select every nth subject from a list.
Cluster Sampling: Divide the population into clusters, randomly select clusters, and include all members of selected clusters.
Stratified Sampling: Divide the population into groups (strata) based on a characteristic, then randomly sample from each stratum.
Sampling Method | Description | Example |
|---|---|---|
Simple Random | Equal chance for all | Drawing names from a hat |
Systematic | Select every nth unit | Every 10th product off a line |
Cluster | Randomly select groups, sample all in group | Randomly select 3 classrooms, survey all students in those rooms |
Stratified | Divide by characteristic, sample from each | Sample students from each grade level |
Example: A manager wants to survey employees at three locations. They could use cluster sampling by selecting one location and surveying all employees there, or stratified sampling by selecting a proportionate number from each location.
Additional info: These notes cover foundational concepts in statistics, including definitions, types of data, and sampling methods, which are essential for understanding how to collect and interpret data in statistical studies.