menuGamaTrain
search

chevron_left Cumulative frequency: A ‘running total’ of the frequencies chevron_right

Cumulative frequency: A ‘running total’ of the frequencies
Anna Kowalski
share
visibility51
calendar_month2025-12-05

Cumulative Frequency: Your Guide to the "Running Total"

Discover how this simple yet powerful statistical tool helps us understand data from the ground up.
Summary: Cumulative frequency is a fundamental concept in data analysis, often described as a "running total." It represents the sum of all frequencies up to a certain point in a data set, allowing us to quickly answer questions like "how many observations are less than or equal to a given value?" This article will break down its construction, visualization through ogive graphs, and its practical applications in everyday statistics, from school grades to survey results.

From Simple Counts to Running Totals

Imagine you're counting the number of apples you pick each day for a week. The daily count is your frequency. Now, if you want to know the total number of apples picked by the end of Tuesday, you add Monday's count to Tuesday's count. That total is your cumulative frequency up to Tuesday.

Formally, the cumulative frequency for a particular value in a data set is the total number of observations that are less than or equal to that value. We start from the lowest value and add up frequencies as we move towards higher values.

Let's consider a small example. A teacher records the number of books read by 10 students in a month:

Number of Books (Value)Number of Students (Frequency)Cumulative Frequency (Running Total)
122 (2 students read 1 book or less)
235 (2+3. 5 students read 2 books or less)
349 (5+4. 9 students read 3 books or less)
4110 (9+1. All 10 students read 4 books or less)

The last cumulative frequency should always equal the total number of observations. This table instantly tells us that 9 out of 10 students read 3 or fewer books.

Formula Tip: The cumulative frequency $CF_i$ for the $i$-th group is calculated as: $CF_i = f_1 + f_2 + ... + f_i$, where $f_1, f_2, ..., f_i$ are the frequencies of the first, second, up to the $i$-th group. In simpler terms: New Cumulative Frequency = Previous Cumulative Frequency + Current Group Frequency.

Grouped Data and the Cumulative Frequency Column

Real-world data is often grouped into intervals or classes, like age ranges or score brackets. Calculating cumulative frequency for grouped data follows the same "running total" principle, but we focus on the upper class boundary[1].

Let's analyze the test scores (out of 100) of 50 students. The data is grouped into 5 classes.

Score Range (Class)Frequency (f)Cumulative Frequency (CF)
0 - 2055 (5 scores are $\leq$ 20)
21 - 40813 (5+8. 13 scores are $\leq$ 40)
41 - 601528 (13+15. 28 scores are $\leq$ 60)
61 - 801745 (28+17. 45 scores are $\leq$ 80)
81 - 100550 (45+5. All 50 scores are $\leq$ 100)

From this table, we can directly extract useful information: 28 students scored 60 or below. To find how many scored above 80, we subtract the cumulative frequency for scores $\leq$ 80 (45) from the total (50), giving us 5 students.

Visualizing the Trend: The Cumulative Frequency Curve (Ogive)

Numbers in a table are informative, but a graph can reveal patterns at a glance. A cumulative frequency curve, also called an ogive[2], is a line graph that plots the cumulative frequency against the upper class boundary for grouped data.

To draw an ogive:

  1. On the horizontal x-axis, plot the upper class boundaries.
  2. On the vertical y-axis, plot the corresponding cumulative frequencies.
  3. Plot each point: (Upper Boundary, Cumulative Frequency).
  4. Connect the points with a smooth curve. Typically, the curve starts at the lower boundary of the first class with a cumulative frequency of 0.

 

The ogive always has a characteristic S-shape or a curve that rises upwards. Its most powerful use is in estimating medians[3], quartiles[4], and percentiles. For example, to find the median score (the score of the 25th student when data is ordered), you find the cumulative frequency value of 25 on the y-axis, move horizontally to the curve, and then drop down vertically to the x-axis to read the estimated median score.

From Classroom to Real World: Practical Applications

Cumulative frequency is not just a math exercise; it's a tool used in many fields. Let's explore a concrete example related to a common activity: analyzing the time spent on homework.

A school club surveys 30 members about their weekly homework hours. The goal is to understand the distribution and make a case for a proposed "homework support session" for students who spend longer than average.

Hours per WeekNumber of StudentsCumulative Frequency
0 - 244
3 - 51014
6 - 8923
9 - 11528
12 - 14230

Using this cumulative frequency table, the club can make data-driven statements:

  • "Half of our members (15 students) spend 5 hours or less on homework." How? The median is between the 15th and 16th student. Looking at the cumulative frequency, 14 students are in the "5 hours or less" group, and 23 are in the "8 hours or less" group. So the median lies in the 6-8 hour group.
  • "Only 7 members spend more than 11 hours per week." Calculated as: Total (30) - CF for $\leq$ 11 hours (28) = 2. Wait, that's only 2! Let's re-check: The statement "more than 11 hours" refers to the last class (12-14 hours), which has a frequency of 2. So the correct statement is "Only 2 members spend more than 11 hours." The cumulative frequency helps us catch this kind of misinterpretation.
  • "Approximately 75% of members (about 22-23 students) spend 8 hours or less." The cumulative frequency for the 6-8 hour group is 23.

 

This simple analysis provides clear evidence to plan support sessions effectively.

Important Questions

Q1: What is the difference between frequency and cumulative frequency?

A: Frequency counts how many times a specific value or range occurs. Cumulative frequency is a running total of frequencies, showing how many observations lie at or below a certain value. Frequency tells you about individual groups; cumulative frequency tells you about the data's progression from the lowest point upwards.

Q2: How can I use cumulative frequency to find the median?

A: First, ensure your data is ordered. The median position is $(N+1)/2$ for small data sets or $N/2$ for grouped data, where $N$ is the total number of observations. Find this position in the cumulative frequency column. The corresponding value (or class) is the median. For grouped data, you use interpolation within the median class, often visualized using the ogive curve.

Q3: Can cumulative frequency ever decrease as you go down the table?

A: No. By definition, cumulative frequency is a running total. You keep adding non-negative frequencies (counts), so the total can only stay the same or increase. The final cumulative frequency must equal the total number of observations.

Conclusion

Cumulative frequency transforms a simple list of counts into a powerful narrative about data accumulation. It answers the essential question, "How many are there up to this point?" From building basic tables for ungrouped data to constructing insightful ogive graphs for grouped data, this concept forms the backbone for understanding distribution, central tendency, and percentiles. Its applications span academic grading, market research, quality control, and beyond. Mastering the "running total" is a fundamental step towards becoming proficient in data literacy, enabling you to not just see individual numbers, but to understand the story they tell as a whole.

Footnote

[1] Upper Class Boundary: The highest value that can belong to a given class interval. For the class 21-40, the upper boundary is 40.5 if dealing with continuous data (to separate it from the next class 41-60), or simply 40 for discrete data.

[2] Ogive (pronounced oh-jive): The name for the graph of a cumulative frequency distribution. It is a French word, sometimes called a cumulative frequency curve.

[3] Median: The middle value in an ordered data set, separating the higher half from the lower half.

[4] Quartiles: Values that divide an ordered data set into four equal parts. The first quartile (Q1) is the 25th percentile, the second quartile (Q2) is the median (50th percentile), and the third quartile (Q3) is the 75th percentile.

Did you like this article?

home
grid_view
add
explore
account_circle