Skip to main content
Back

Statistical Analysis of Guatemalan Cholesterol Levels: Distributions, Summary Statistics, and Comparative Measures

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

Guatemalan Cholesterol Levels: A Statistical Case Study

Introduction

This study examines the distribution of serum total cholesterol levels among two groups of Guatemalan adults: low-income rural and high-income urban populations. The data, sourced from a clinical nutrition study, provides an opportunity to apply statistical methods for comparing distributions, identifying outliers, and interpreting measures of central tendency and variability.

Data Description

Cholesterol Level Data

  • Rural Group: 49 adult, low-income rural Guatemalans

  • Urban Group: 49 adult, high-income urban Guatemalans

  • Measurement: Serum total cholesterol levels (mg/dL)

Table 1. Cholesterol Levels (mg/dL)

Group

Cholesterol Levels (mg/dL)

Rural

95, 108, 108, 114, 115, 124, 129, 129, 131, 131, 135, 136, 136, 139, 140, 142, 142, 143, 143, 144, 144, 145, 146, 148, 152, 152, 155, 157, 158, 158, 162, 165, 166, 171, 172, 173, 174, 175, 180, 181, 189, 192, 194, 197, 204, 220, 223, 226, 231

Urban

133, 134, 155, 170, 175, 179, 181, 181, 188, 189, 190, 196, 197, 199, 200, 200, 201, 201, 204, 205, 205, 206, 214, 217, 222, 222, 227, 227, 228, 234, 236, 239, 241, 242, 244, 249, 252, 273, 279, 284, 284, 284, 330

Graphical Summaries

Histograms and Boxplots

  • Both groups' cholesterol distributions are mound-shaped (unimodal) with right skewness.

  • An outlier is present in the urban group (330 mg/dL), which is much higher than the next largest value.

  • Boxplots visually confirm higher cholesterol levels and greater variability in the urban group, with the outlier clearly visible.

Summary Statistics

Five Number Summary and Descriptive Statistics

Group

Lower Quartile (Q1)

Median

Upper Quartile (Q3)

Maximum

Range

Quartile Range (IQR)

Mean

Standard Deviation

Rural

136.00

152.00

174.00

231.00

136.00

38.00

157.02

31.75

Urban

186.00

206.00

228.00

330.00

197.00

42.00

216.87

39.92

Percentiles (Selected)

Group

20th Percentile

50th Percentile (Median)

80th Percentile

95th Percentile

Rural

131.00

152.00

181.00

231.00

Urban

175.00

206.00

243.00

330.00

Measures of Center and Spread

Central Tendency

  • Median: Rural group: 152 mg/dL; Urban group: 206 mg/dL

  • Mean: Rural group: 157.02 mg/dL; Urban group: 216.87 mg/dL

  • The urban group has a median cholesterol level about 54 mg/dL higher than the rural group, and a mean about 59.85 mg/dL higher.

Variability

  • Range: Urban group: 197 mg/dL; Rural group: 136 mg/dL

  • Interquartile Range (IQR): Urban group: 42 mg/dL; Rural group: 38 mg/dL

  • Standard Deviation: Urban group: 39.92 mg/dL; Rural group: 31.75 mg/dL

  • Ratios of variability measures (urban/rural): Range ≈ 1.4, IQR ≈ 1.1, Standard deviation ≈ 1.3

Skewness and Distribution Shape

Quantifying Skewness

  • Right skewness is present in both groups, stronger in the urban group (especially with the outlier).

  • Skewness can be quantified by comparing the range of the right half (max - median) to the left half (median - min): - Rural: - Urban: (with outlier); (without outlier)

  • Similar ratios for the tails and center support the conclusion of right skewness.

Interpretation of Skewness

  • Right skewness indicates more variability among individuals with higher cholesterol levels than among those with lower levels.

  • Outliers can significantly affect measures of spread and skewness.

Comparative Analysis

Summary of Differences

  • Urban Guatemalans have higher cholesterol levels and greater variability than rural Guatemalans.

  • Distributions are unimodal, with peaks at approximately 140 mg/dL (rural) and 200 mg/dL (urban).

  • Variability is about 30-40% greater in the urban group, depending on the measure used.

Possible Explanations (Contextual)

  • Rural Guatemalans may have lower cholesterol due to higher physical activity and lower dietary fat intake.

  • Greater variability in the urban group may reflect more diverse lifestyles and eating habits.

Key Statistical Concepts Illustrated

  • Five Number Summary: Minimum, Q1, Median, Q3, Maximum

  • Measures of Center: Mean, Median

  • Measures of Spread: Range, Interquartile Range (IQR), Standard Deviation

  • Skewness: Quantified by ratios of right/left halves and tails

  • Outliers: Identification and impact on summary statistics

  • Percentiles: Used for more detailed distribution comparisons

Formulas Used

  • Mean:

  • Standard Deviation:

  • Interquartile Range (IQR):

  • Skewness Ratio (example):

Example Application

  • Comparing cholesterol levels between two populations using summary statistics and graphical methods.

  • Assessing the impact of outliers on measures of spread and skewness.

  • Using percentiles and boxplots for visual and numerical comparison.

Additional info: The study notes infer some context about the data collection and possible explanations for observed differences, as suggested by the original material.

Pearson Logo

Study Prep