BackDescriptive Statistics and Introduction to SPSS: Study Notes for Health Sciences
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Applied Statistics for the Health Sciences
Course 1: Descriptive Statistics & Introduction to SPSS
This section introduces foundational concepts in statistics, focusing on descriptive statistics and practical data analysis using SPSS, tailored for health sciences students.
The 3 Research Components
Overview of Research Structure
Variable Interaction: Understanding how different variables relate and interact within a study.
Research Sample: The group of subjects or data points selected for analysis.
Statistical Approach: The methods and techniques used to analyze and interpret data.
Example: In a study examining the effect of exercise on blood pressure, the variables might be exercise frequency and blood pressure readings, the sample could be adults aged 18-65, and the statistical approach might involve comparing means.
Statistical Analyses
Types of Statistical Analyses
Descriptive Statistics: Summarize and describe the main features of a dataset.
Inferential Statistics: Make predictions or inferences about a population based on sample data.
Example: Descriptive statistics might include calculating the average age in a sample, while inferential statistics could involve testing whether a new drug is effective compared to a placebo.
Overview of Descriptive Statistics
Main Topics Covered
Descriptive statistics
Measures of central tendency
Measures of variability
Introduction to SPSS
Descriptive Statistics
Definition and Purpose
Descriptive statistics are used to summarize and describe the main features of a dataset.
They help to make sense of data by providing simple summaries about the sample and measures.
Types of Descriptive Statistics:
Measures of central tendency
Measures of variability
Measures of Central Tendency
Definition and Types
Measures of central tendency represent an indicator of the typical score in a dataset. The three main measures are:
Mean
Median
Mode
Mean
The most frequently used measure of central tendency.
Calculated by adding all scores and dividing by the number of scores.
Formula:
Example: For scores 3, 5, 7, 8, 4:
Problems with the Mean
Influenced by extreme scores (outliers).
May not represent the typical value if the data is skewed.
Median
The middle score after all scores have been ranked in ascending order.
Not influenced by extreme scores.
Example: For scores 3, 5, 7, 8, 4 (ordered: 3, 4, 5, 7, 8), the median is 5.
Formula:
Mode
The most frequently occurring score in a dataset.
Useful for categorical data.
Example: For scores 3, 5, 7, 7, 8, the mode is 7.
Graphical Representation
Bar Chart: Heights of bars represent mean scores for groups.
Line Graph: Means represented as end points of lines.
Measures of Variability
Definition and Importance
Measures of variability indicate the spread or dispersion of scores in a dataset. They help to understand how much scores differ from the typical value.
Range
Interquartile Range
Standard Deviation
Range
The difference between the highest and lowest scores in a sample.
Formula:
Example: For scores 3, 5, 7, 8, 4:
Interquartile Range (IQR)
Measures the spread of the middle 50% of scores.
Calculated by finding the difference between the 75th percentile (Q3) and the 25th percentile (Q1).
Formula:
Example: For ordered scores 3, 4, 5, 7, 8, Q1 = 4, Q3 = 7, so
Standard Deviation
Indicates how much the scores in a dataset typically vary from the mean.
Calculated as the square root of the variance.
Formula:
Where is each score, is the mean, and is the number of scores.
Example: For scores 3, 5, 7, 8, 4:
Mean = 5.4
Deviations: -2.4, -0.4, 1.6, 2.6, -1.4
Squared deviations: 5.76, 0.16, 2.56, 6.76, 1.96
Sum = 17.2
Variance =
Standard deviation =
Introduction to SPSS
Why Learn SPSS?
SPSS (Statistical Package for the Social Sciences) is a widely used software for statistical analysis in health sciences.
It simplifies data management, analysis, and graphical representation.
Steps for Downloading SPSS:
Download SPSS to your computer.
Open the provided SPSS database file.
Double-click the file to open in SPSS.
Calculating Descriptive Statistics in SPSS
Open the SPSS database.
Go to Analyze > Descriptive Statistics > Descriptives.
Select the variable of interest (e.g., Coping variable).
Click the blue arrow to move the variable to the analysis box.
Click Options and select desired statistics (mean, standard deviation, etc.).
Click Continue and OK to view results.
Example Output: SPSS will display a table with mean, standard deviation, minimum, and maximum values for the selected variable.
Graphing in SPSS
To graph mean scores for groups, go to Graphs > Chart Builder.
Select Bar or Line chart.
Drag the variable to the appropriate axis.
Click OK to generate the graph.
Data Cleaning in SPSS
Check for data entry errors.
Review missing data and outliers.
Ensure the number of missing data is not excessive (e.g., less than 2.5% of total cases).
Example: If a variable has many missing values, results may be affected or not significant.
Summary Table: Measures of Central Tendency and Variability
Measure | Definition | Formula | Strengths | Limitations |
|---|---|---|---|---|
Mean | Arithmetic average of scores | Uses all data points | Affected by outliers | |
Median | Middle value in ordered data | Middle value | Not affected by outliers | Ignores magnitude of values |
Mode | Most frequent score | Most common value | Useful for categorical data | May not be unique |
Range | Difference between highest and lowest scores | Simple to calculate | Ignores distribution of scores | |
Interquartile Range | Spread of middle 50% of scores | Not affected by outliers | Ignores extreme values | |
Standard Deviation | Average deviation from the mean | Uses all data points | Complex to calculate |
Additional info: These notes expand on the original slides by providing full definitions, formulas, and examples for each concept, as well as a summary table for comparison.