BackDefining and Collecting Data in Business Statistics
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Defining and Collecting Data
Classifying Variables by Type
In business statistics, understanding the types of variables is essential for selecting appropriate analytical methods. Variables can be classified as either categorical or numerical, each with distinct characteristics and uses.
Categorical (Qualitative) Variables: These variables represent categories or groups. Their values are labels or names, such as "yes", "no", "blue", "brown", or "green". No mathematical operations can be performed on these values.
Numerical (Quantitative) Variables: These variables represent measurable quantities. They can be further divided into:
Discrete Variables: Arise from a counting process (e.g., number of employees, number of products sold).
Continuous Variables: Arise from a measuring process (e.g., height, weight, time, sales revenue).
Example: The color of a car (categorical), the number of cars sold in a month (discrete numerical), and the price of a car (continuous numerical).
Measurement Scales
Measurement scales determine the level of information contained in a variable and influence the types of statistical analyses that can be performed.
Nominal Scale: Classifies data into distinct categories in which no ranking is implied. Example: Types of fruit (apple, orange, banana).
Ordinal Scale: Classifies data into distinct categories in which ranking is implied, but the differences between ranks are not meaningful. Example: Customer satisfaction ratings (satisfied, neutral, dissatisfied).
Interval Scale: An ordered scale where the difference between measurements is meaningful, but there is no true zero point. Example: Temperature in Celsius or Fahrenheit.
Ratio Scale: An ordered scale with meaningful differences and a true zero point, allowing for the calculation of ratios. Example: Sales revenue, weight, height.
Data Collection: Population vs. Sample
Data in business statistics can be collected from either a population or a sample. Understanding the distinction is crucial for proper data analysis and inference.
Population: The entire set of items or individuals of interest for a particular study.
Sample: A subset of the population, selected for analysis to draw conclusions about the whole population.
Example: If a company wants to know the average salary of its employees, the population is all employees, while a sample might be 100 randomly selected employees.
Population vs. Sample Table
Population | Sample |
|---|---|
All items or individuals about which you want to draw conclusions | A portion of the population's items or individuals |
Parameter vs. Statistic
When summarizing data, it is important to distinguish between parameters and statistics.
Population Parameter: A numerical summary that describes a specific variable for the entire population. Example: The average income of all employees in a company.
Sample Statistic: A numerical summary that describes a specific variable for a sample. Example: The average income of a sample of 100 employees.
Sources of Data
Data can be obtained from different sources, which affects its reliability and relevance.
Primary Sources: The data collector is the one performing the analysis. Example: A company conducting its own employee survey.
Secondary Sources: The person performing the analysis is not the data collector. Example: Using government census data for market analysis.