Skip to main content
Back

Descriptive Statistics and Introduction to SPSS: Study Notes for Health Sciences

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Applied Statistics for the Health Sciences

Course 1: Descriptive Statistics & Introduction to SPSS

This section introduces foundational concepts in statistics, focusing on descriptive statistics and practical data analysis using SPSS, tailored for health sciences students.

The 3 Research Components

Overview of Research Structure

  • Variable Interaction: Understanding how different variables relate and affect each other in a study.

  • Research Design: The plan or strategy for conducting research, including how data is collected and analyzed.

  • Statistical Analysis: The process of applying statistical methods to interpret data and draw conclusions.

Statistical Analyses

Types of Statistical Analyses

  • Descriptive Statistics: Summarize and describe the main features of a dataset.

  • Inferential Statistics: Make predictions or inferences about a population based on sample data.

Example: Descriptive statistics might include calculating the average age of participants, while inferential statistics could involve testing whether a treatment has a significant effect.

Overview of Descriptive Statistics

Main Topics Covered

  • Descriptive statistics

  • Measures of central tendency

  • Measures of variability

  • Introduction to SPSS

Descriptive Statistics

Definition and Purpose

  • Descriptive statistics are used to summarize and describe the characteristics of a dataset.

  • They help to make sense of data by providing simple summaries about the sample and measures.

Types of Descriptive Statistics:

  • Measures of central tendency

  • Measures of variability

Measures of Central Tendency

Definition and Types

Measures of central tendency represent an indicator of the typical score in a dataset. The three main measures are:

  • Mean

  • Median

  • Mode

Mean

  • The mean is the most frequently used measure of central tendency.

  • It is calculated by adding all the scores and dividing by the number of scores.

Formula:

  • Example: For scores 2, 3, 4, 5:

Problems with the Mean

  • The mean is influenced by extreme scores (outliers).

  • For example, in the set 1, 1, 1, 1, 20, the mean is 4.8, which does not represent the typical score.

Median

  • The median is the middle score after all scores have been ordered.

  • It is not influenced by extreme scores.

How to Find the Median:

  • Order the scores from lowest to highest.

  • If the number of scores is odd, the median is the middle score.

  • If even, the median is the average of the two middle scores.

Example: For scores 1, 2, 3, 4, 5, the median is 3.

Formula:

Mode

  • The mode is the most frequently occurring score in a dataset.

  • It is typically used with categorical data.

Example: For scores 1, 1, 2, 3, 4, the mode is 1.

Formula:

Measures of Variability

Definition and Importance

Measures of variability indicate the spread or dispersion of scores in a dataset. They help to understand how much the scores differ from each other.

  • Range

  • Interquartile range

  • Standard deviation

Range

  • The range is the difference between the highest and lowest scores in a sample.

Formula:

Example: For scores 1, 2, 3, 4, 5, the range is 5 - 1 = 4.

Interquartile Range (IQR)

  • The IQR measures the spread of the middle 50% of scores.

  • It is calculated by subtracting the first quartile (Q1) from the third quartile (Q3).

Formula:

Example: For the dataset 1, 2, 3, 4, 5, 6, 7, 8, 9, Q1 = 3, Q3 = 7, so IQR = 7 - 3 = 4.

Standard Deviation

  • Standard deviation measures how much the scores deviate from the mean.

  • It is the square root of the variance.

Formula:

  • Example: For scores 1, 2, 3, 4, 5, calculate the mean, subtract the mean from each score, square the result, sum the squares, divide by N, and take the square root.

Additional info: Standard deviation is widely used to assess the variability in health sciences data, such as blood pressure readings or test scores.

Introduction to SPSS

Why Learn SPSS?

  • SPSS (Statistical Package for the Social Sciences) is a powerful software for statistical analysis.

  • It is commonly used in health sciences for data management and analysis.

Steps for Downloading SPSS:

  1. Download SPSS to your computer.

  2. Open the downloaded file and install the SPSS database.

  3. Double-click on the file to start SPSS.

Calculating Descriptive Statistics in SPSS

  • Open the SPSS database.

  • Go to Analyze > Descriptive Statistics > Descriptives.

  • Select the variable of interest (e.g., Coping variable).

  • Click on the blue arrow to move the variable to the analysis box.

  • Click on the Options button and select desired statistics (mean, standard deviation, etc.).

  • Click Continue and view the results in the SPSS output window.

Graphing Mean Scores in SPSS

  • Go to Graphs > Chart Builder.

  • Select Simple Bar chart.

  • Add the variable to the Y-axis and the category to the X-axis.

  • Click OK to generate the graph.

Data Cleaning in SPSS

  • Always clean your data before analysis.

  • Check for errors in data transcription.

  • Assess the number of missing data points.

  • Evaluate the distribution of scores (e.g., skewness, outliers).

Additional info: Proper data cleaning ensures the accuracy and reliability of statistical results, which is critical in health sciences research.

Summary Table: Measures of Central Tendency and Variability

Measure

Definition

Formula

Strengths

Limitations

Mean

Average of all scores

Uses all data points

Affected by outliers

Median

Middle value in ordered data

-

Not affected by outliers

Does not use all data points

Mode

Most frequent value

-

Useful for categorical data

May not be unique

Range

Difference between highest and lowest scores

Easy to calculate

Ignores distribution of scores

Interquartile Range

Spread of middle 50% of scores

Not affected by outliers

Requires ordered data

Standard Deviation

Average deviation from the mean

Uses all data points

Complex to calculate

Pearson Logo

Study Prep