Descriptive Statistics and Introduction to SPSS: Study Notes for Health Sciences

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Applied Statistics for the Health Sciences

Course 1: Descriptive Statistics & Introduction to SPSS

This section introduces foundational concepts in statistics, focusing on descriptive statistics and practical data analysis using SPSS, tailored for health sciences students.

The 3 Research Components

Overview of Research Structure

Variable Interaction: Understanding how different variables relate and affect each other in a study.
Research Design: The plan or strategy for conducting research, including how data is collected and analyzed.
Statistical Analysis: The process of applying statistical methods to interpret data and draw conclusions.

Statistical Analyses

Types of Statistical Analyses

Descriptive Statistics: Summarize and describe the main features of a dataset.
Inferential Statistics: Make predictions or inferences about a population based on sample data.

Example: Descriptive statistics might include calculating the average age of participants, while inferential statistics could involve testing whether a treatment has a significant effect.

Overview of Descriptive Statistics

Main Topics Covered

Descriptive statistics
Measures of central tendency
Measures of variability
Introduction to SPSS

Descriptive Statistics

Definition and Purpose

Descriptive statistics are used to summarize and describe the characteristics of a dataset.
They help to make sense of data by providing simple summaries about the sample and measures.

Types of Descriptive Statistics:

Measures of central tendency
Measures of variability

Measures of Central Tendency

Definition and Types

Measures of central tendency represent an indicator of the typical score in a dataset. The three main measures are:

Mean
Median
Mode

Mean

The mean is the most frequently used measure of central tendency.
It is calculated by adding all the scores and dividing by the number of scores.

Formula:

Example: For scores 2, 3, 4, 5:

Problems with the Mean

The mean is influenced by extreme scores (outliers).
For example, in the set 1, 1, 1, 1, 20, the mean is 4.8, which does not represent the typical score.

Median

The median is the middle score after all scores have been ordered.
It is not influenced by extreme scores.

How to Find the Median:

Order the scores from lowest to highest.
If the number of scores is odd, the median is the middle score.
If even, the median is the average of the two middle scores.

Example: For scores 1, 2, 3, 4, 5, the median is 3.

Formula:

Mode

The mode is the most frequently occurring score in a dataset.
It is typically used with categorical data.

Example: For scores 1, 1, 2, 3, 4, the mode is 1.

Formula:

Measures of Variability

Definition and Importance

Measures of variability indicate the spread or dispersion of scores in a dataset. They help to understand how much the scores differ from each other.

Range
Interquartile range
Standard deviation

Range

The range is the difference between the highest and lowest scores in a sample.

Formula:

Example: For scores 1, 2, 3, 4, 5, the range is 5 - 1 = 4.

Interquartile Range (IQR)

The IQR measures the spread of the middle 50% of scores.
It is calculated by subtracting the first quartile (Q1) from the third quartile (Q3).

Formula:

Example: For the dataset 1, 2, 3, 4, 5, 6, 7, 8, 9, Q1 = 3, Q3 = 7, so IQR = 7 - 3 = 4.

Standard Deviation

Standard deviation measures how much the scores deviate from the mean.
It is the square root of the variance.

Formula:

Example: For scores 1, 2, 3, 4, 5, calculate the mean, subtract the mean from each score, square the result, sum the squares, divide by N, and take the square root.

Additional info: Standard deviation is widely used to assess the variability in health sciences data, such as blood pressure readings or test scores.

Introduction to SPSS

Why Learn SPSS?

SPSS (Statistical Package for the Social Sciences) is a powerful software for statistical analysis.
It is commonly used in health sciences for data management and analysis.

Steps for Downloading SPSS:

Download SPSS to your computer.
Open the downloaded file and install the SPSS database.
Double-click on the file to start SPSS.

Calculating Descriptive Statistics in SPSS

Open the SPSS database.
Go to Analyze > Descriptive Statistics > Descriptives.
Select the variable of interest (e.g., Coping variable).
Click on the blue arrow to move the variable to the analysis box.
Click on the Options button and select desired statistics (mean, standard deviation, etc.).
Click Continue and view the results in the SPSS output window.

Graphing Mean Scores in SPSS

Go to Graphs > Chart Builder.
Select Simple Bar chart.
Add the variable to the Y-axis and the category to the X-axis.
Click OK to generate the graph.

Data Cleaning in SPSS

Always clean your data before analysis.
Check for errors in data transcription.
Assess the number of missing data points.
Evaluate the distribution of scores (e.g., skewness, outliers).

Additional info: Proper data cleaning ensures the accuracy and reliability of statistical results, which is critical in health sciences research.

Summary Table: Measures of Central Tendency and Variability

Measure	Definition	Formula	Strengths	Limitations
Mean	Average of all scores		Uses all data points	Affected by outliers
Median	Middle value in ordered data	-	Not affected by outliers	Does not use all data points
Mode	Most frequent value	-	Useful for categorical data	May not be unique
Range	Difference between highest and lowest scores		Easy to calculate	Ignores distribution of scores
Interquartile Range	Spread of middle 50% of scores		Not affected by outliers	Requires ordered data
Standard Deviation	Average deviation from the mean		Uses all data points	Complex to calculate