Definition
Basic Concept
Mode: the value that appears most frequently in a data set. It identifies the highest frequency or peak in the distribution.
Mathematical Expression
If X = {x1, x2, ..., xn} is a data set, mode m satisfies:
frequency(m) ≥ frequency(xi) for all i in 1 to nRole in Statistics
Mode: one of three central tendency measures alongside mean and median. Useful for qualitative and quantitative data.
Calculation
Frequency Distribution
Calculate frequency of each unique value. Mode = value(s) with maximum frequency.
Ungrouped Data
Count occurrences of each data point. Identify value with highest count.
Grouped Data
Use modal class (class with highest frequency) to estimate mode via formula.
Mode ≈ L + ((f1 - f0) / ((f1 - f0) + (f1 - f2))) × hwhere:L = lower boundary of modal class,f1 = frequency of modal class,f0 = frequency of class before modal class,f2 = frequency of class after modal class,h = width of class intervals.Tabular Presentation
| Value | Frequency |
|---|---|
| 2 | 3 |
| 5 | 7 |
| 8 | 5 |
Mode = 5 (highest frequency: 7)
Properties
Uniqueness
Data set may have no mode, one mode, or multiple modes.
Applicability
Applicable to nominal, ordinal, interval, and ratio data types.
Robustness
Insensitive to extreme values (outliers); unaffected by skewed data.
Non-mathematical Nature
Mode may not be unique or mathematically defined for continuous data without grouping.
Types of Mode
Unimodal
Single mode value; one peak in frequency distribution.
Bimodal
Two modes; two values share highest frequency.
Multimodal
More than two modes; multiple frequent values.
No Mode
All values occur with equal frequency; no repeated value.
Applications
Market Research
Identifying most popular product, preference, or category.
Education
Determining most common grade or score in examinations.
Health Sciences
Most frequent symptom or diagnosis in a patient group.
Data Categorization
Mode helps summarize qualitative data effectively.
Advantages and Limitations
Advantages
- Simple to identify and understand.
- Applicable to all data types, including nominal.
- Unaffected by extreme values or skewness.
- Useful for categorical data summarization.
Limitations
- May not exist in some data sets (no repeated values).
- Can be multiple modes, causing ambiguity.
- Not useful for further mathematical analysis.
- Less stable measure compared to mean and median.
Mode vs. Mean and Median
Definition Differences
Mode: most frequent value. Mean: arithmetic average. Median: middle value when data is ordered.
Sensitivity
Mean sensitive to outliers; median and mode robust.
Data Types
Mode applicable to nominal data; mean and median require numeric data.
Use Cases
Mode used for categorical mode identification; mean/median for central location in quantitative data.
| Measure | Definition | Data Type | Sensitivity to Outliers |
|---|---|---|---|
| Mode | Most frequent value | Nominal, Ordinal, Interval, Ratio | Insensitive |
| Median | Middle value | Ordinal, Interval, Ratio | Robust |
| Mean | Arithmetic average | Interval, Ratio | Sensitive |
Mode in Categorical Data
Nominal Data
Mode identifies most frequent category; mean and median not defined.
Ordinal Data
Mode useful to find common rating or rank.
Example
Survey responses: Red (10), Blue (15), Green (5) → Mode = Blue.
Limitations
Multiple modes possible if categories tie in frequency.
Multimodal Distributions
Definition
Data with two or more modes; indicates multiple peaks or clusters.
Causes
Heterogeneous populations, mixed distributions, or measurement types.
Identification
Frequency analysis or density estimation reveals multiple modes.
Implications
Suggests subgroups or different underlying processes in data.
Calculation Examples
Example 1: Ungrouped Data
Data: {3, 7, 3, 2, 9, 7, 7, 3}
Frequencies: 3 → 3, 7 → 3, 2 → 1, 9 → 1
Mode(s): 3 and 7 (bimodal)
Example 2: Grouped Data
Class intervals and frequencies:
| Class Interval | Frequency |
|---|---|
| 10-20 | 5 |
| 20-30 | 12 |
| 30-40 | 7 |
Modal class = 20-30 (highest frequency 12)
L = 20, f1 = 12, f0 = 5, f2 = 7, h = 10Mode ≈ 20 + ((12 - 5) / ((12 - 5) + (12 - 7))) × 10 = 20 + (7 / (7 + 5)) × 10 = 20 + (7 / 12) × 10 = 20 + 5.83 = 25.83Mode Calculation in Software
Excel
Function: =MODE.SNGL(range) returns single mode; =MODE.MULT(range) returns multiple modes.
R
No base mode function; user-defined function or packages like modeest used.
mode <- function(x) { ux <- unique(x) ux[which.max(tabulate(match(x, ux)))]}Python
Library: statistics.mode() returns single mode; statistics.multimode() returns all modes.
from statistics import mode, multimodedata = [1, 2, 2, 3, 3]print(mode(data)) # Output: 2 (first mode)print(multimode(data)) # Output: [2, 3]SPSS
Descriptives procedure includes mode as output option.
References
- McClave, J.T., Benson, P.G., & Sincich, T. Statistics for Business and Economics, 13th ed., Pearson, 2017, pp. 45-50.
- Triola, M.F. Elementary Statistics, 13th ed., Pearson, 2018, pp. 60-65.
- Walpole, R.E., Myers, R.H., Myers, S.L., & Ye, K. Probability and Statistics for Engineers and Scientists, 9th ed., Pearson, 2012, pp. 75-80.
- Bluman, A.G. Elementary Statistics: A Step by Step Approach, 9th ed., McGraw-Hill, 2017, pp. 30-35.
- Sheskin, D.J. Handbook of Parametric and Nonparametric Statistical Procedures, 5th ed., CRC Press, 2011, pp. 15-20.