BackIntroduction to the Practice of Statistics: Key Concepts, Data Types, Study Designs, and Sampling Methods
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Introduction to the Practice of Statistics
Objectives
Define Statistics
Explain the process of statistics
Distinguish between Qualitative and Quantitative variables
Distinguish between Discrete and Continuous variables
Definition of Statistics
Statistics is the science of collecting, organizing, summarizing, and analyzing information to draw conclusions or answer questions.
Four-Step Statistical Process
Plan (Identify a question): Formulate a statistical question that can be answered with data. This is a crucial step in the process.
Collect (Produce Data): Design and implement a plan to collect appropriate data. Data can be gathered through observations, interviews, questionnaires, databases, sampling, or experimentation.
Process (Analyze the Data): Organize and summarize the data using graphical or numerical methods. This may include histograms, dot plots, and box plots.
Conclusion (Interpret the Results): Interpret findings in the context of the original question and explain how the data answers it.
Key Terms in the Process of Statistics
Population: The entire group of individuals to be studied.
Individual: A person or object that is a member of the population.
Sample: A subset of the population that is being studied.
Types of Statistics
Descriptive statistics: Methods for organizing and summarizing data, often using tables, graphs, and numerical summaries.
Inferential statistics: Methods that use sample data to make generalizations about a population and measure the reliability of the results.
Example
The average score for a class of 28 students taking a calculus midterm exam was 72%. Parameter
Interviews of 100 adults 18 years of age or older found that 44% could state the minimum age required for the office of U.S. president. Statistic
Types of Data: Qualitative and Quantitative Variables
Qualitative vs. Quantitative Variables
Qualitative (Categorical) variables: Classify individuals based on some attribute or characteristic (e.g., gender, nationality).
Quantitative variables: Provide numerical measures of individuals. Arithmetic operations can be performed on these values (e.g., age, income).
Discrete vs. Continuous Variables
Discrete variable: A quantitative variable with a finite or countable number of possible values (e.g., number of children).
Continuous variable: A quantitative variable with an infinite number of possible values that are not countable (e.g., daily intake of whole grains in grams).
Example: Classification
Nationality: Qualitative
Number of children: Discrete
Household income in the previous year: Quantitative
Daily intake of whole grains: Continuous
Observational Studies versus Designed Experiments
Objectives
Distinguish between an observational study and an experiment
Explain the various types of observational studies
Definitions
Observational study: Measures the value of the response variable without attempting to influence the value of either the response or explanatory variables.
Designed experiment: Researcher intentionally changes the value of the explanatory variable and records the value of the response variable.
Lurking variable: An explanatory variable not considered in a study but may affect the value of the response variable.
Key Point
Observational studies do not allow a researcher to claim causation, only association. Only well-designed experiments can prove the cause and effect.
Types of Observational Studies
Cross-sectional studies: Collect information about individuals at a specific point in time or over a very short period.
Case-control studies: Retrospective studies that require individuals to look back in time or require researchers to look at existing records. Individuals are matched based on certain characteristics.
Cohort studies: Identify a group of individuals to participate in a study (the cohort) and are observed over a long period. Characteristics are recorded, and data is collected over time.
Example Table: Types of Observational Studies
Study Type | Key Feature | Example |
|---|---|---|
Cross-sectional | Snapshot at one point in time | Daily coffee consumption and skin cancer |
Case-control | Retrospective, matched groups | Tanning bed use and skin cancer |
Cohort | Follow group over time | Doll and Hill smoking study |
Sampling Techniques
Simple Random Sampling
Random sampling is the process of using chance to select individuals from a population to be included in the sample. A sample of size n from a population of size N is obtained through simple random sampling if every possible sample of size n has an equally likely chance of occurring.
Steps for Simple Random Sampling
Put members in alphabetical order and number them.
Select random numbers using a random number generator or table.
Match generated random numbers to the corresponding individuals.
Example: Simple Random Sampling
From 80 students, select 5 using random digits: 05, 16, 62, 77, 48.
Other Sampling Techniques
Stratified sample: Population is separated into homogeneous, nonoverlapping groups (strata), and a simple random sample is taken from each stratum.
Systematic sample: Every kth individual is selected from the population. The first individual is selected at random between 1 and k.
Cluster sample: All individuals within a randomly selected collection or group are included in the sample.
Comparison Table: Sampling Techniques
Sampling Method | Key Feature | Example |
|---|---|---|
Simple Random | Every individual has equal chance | Randomly select 5 students from a class |
Stratified | Divide into strata, sample from each | Sample students by grade level |
Systematic | Select every k-th individual | Every 10th person entering a store |
Cluster | Sample all individuals in selected groups | Sample all students in selected classrooms |
Strata vs. Clusters
Strata are groups of similar individuals, and samples are taken from each stratum. Clusters are groups selected at random, and all individuals in the selected clusters are sampled.
Summary of Key Formulas
Parameter vs. Statistic: A parameter describes a population, a statistic describes a sample.
Simple Random Sampling Probability:
Additional info: Some explanations and examples have been expanded for clarity and completeness.