Data, Frequency Tables & Diagrams
Types of data, frequency distributions, histograms, bar charts, and frequency polygons.
Statistics begins with organising information so that patterns can be seen. A table or diagram is not just a picture; it is a way to make a dataset easier to read, compare, and interpret.
In CSEC, Statistics questions may ask you to construct a diagram, read a value from a table, or explain what the data suggests. Always identify the type of data first, because that determines whether a bar chart, histogram, pie chart, line graph, or frequency polygon is appropriate. Your explanation should connect the diagram back to the data, not just describe its shape.
Understanding what kind of data you have is the first step in analysis.
Quantitative vs. Qualitative Data
The type of data determines what calculations and diagrams make sense. You can average numerical data, but not categories like favourite colour.
Quantitative data = numerical, can be measured
- Examples: height, age, temperature, test scores, weight
Qualitative data = descriptive, not numerical
- Examples: color, preference, opinion, gender, nationality
Discrete vs. Continuous Data
Discrete data is counted in separate values; continuous data is measured and can take values between marks on a scale.
Discrete data = counted, takes specific values only
- Examples: number of students (can't have 25.5 students), shoe size, number of cars
- Integers only
Continuous data = measured, can take any value in a range
- Examples: height (could be 175.3 cm or 175.31 cm), temperature, time
- Can be any decimal value
Ungrouped vs. Grouped Data
Ungrouped data shows individual values. Grouped data sacrifices exact detail to make large datasets easier to summarise.
Ungrouped data = individual values listed separately
- Used for: small datasets, when exact values matter
- Example: 5, 7, 8, 9, 11, 12, 15
Grouped data = values organized in classes/intervals
- Used for: large datasets, to see patterns
- Example: 5-10, 10-15, 15-20
Classify these data sets:
-
"Heights of 100 students: 1.50 m, 1.52 m, 1.67 m, ..."
- Quantitative, continuous, grouped (in height ranges)
-
"Colors preferred by 50 people: red, blue, red, green, ..."
- Qualitative, discrete
-
"Number of books read: 5, 7, 8, 9, 11, ..."
- Quantitative, discrete, ungrouped
- Quantitative = numbers; Qualitative = descriptions
- Discrete = counted (integers); Continuous = measured (decimals)
- Ungrouped = individual values; Grouped = in classes
Part 2: Frequency Tables
A frequency table organizes data by showing how often each value occurs.
Ungrouped Frequency Table
A frequency table counts how often each value appears. It makes repeated data easier to read and prepares it for graphs or averages.
Test scores for 20 students:
5, 7, 5, 8, 9, 5, 7, 6, 8, 7, 5, 9, 6, 8, 7, 9, 5, 6, 8, 7
Frequency table:
| Score | Frequency | Cumulative Frequency |
|---|---|---|
| 5 | 5 | 5 |
| 6 | 3 | 8 |
| 7 | 5 | 13 |
| 8 | 4 | 17 |
| 9 | 3 | 20 |
| Total | 20 |
Grouped Frequency Table
Grouped frequency tables are used when there are many different values. Each class interval should be clear and non-overlapping.
For large datasets, group values into class intervals.
Heights of 40 students (in cm):
SAMPLE DATA: 150, 152, 155, 158, 160, 161, 162, 165, 167, 168, 170, 171, 172, 174, 175, 176, 178, 180, 182, 183, 185, 186, 187, 188, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, ...
Grouped frequency table:
| Class Interval | Frequency | Cumulative Frequency |
|---|---|---|
| 150-159 | 5 | 5 |
| 160-169 | 8 | 13 |
| 170-179 | 12 | 25 |
| 180-189 | 10 | 35 |
| 190-199 | 5 | 40 |
| Total | 40 |
Class Features
Class boundaries and midpoints help you calculate and graph grouped data. The midpoint stands in for all the values in that interval when exact values are unavailable.
When grouping data, understand these terms:
Class interval = the range (e.g., 150-159)
- Lower class limit = 150
- Upper class limit = 159
Class boundaries = the true limits (used for graphs)
- For 150-159: boundaries are 149.5 to 159.5
- Removes the "gap" between classes
Class width = difference between boundaries
- 159.5 - 149.5 = 10
Class midpoint = center of the class
- (150 + 159) ÷ 2 = 154.5
For the class interval 170-179:
- Class limits: 170 (lower), 179 (upper)
- Class boundaries: 169.5 to 179.5
- Class width: 179.5 - 169.5 = 10
- Class midpoint: (170 + 179) ÷ 2 = 174.5
Part 3: Statistical Diagrams
Different diagrams show data in different ways.
Pie Charts
Pie charts show parts of a whole. Each sector angle is proportional to the category frequency.
A pie chart shows data as slices of a circle. Each slice is proportional to the frequency.
Favorite fruit for 60 students:
| Fruit | Frequency | Angle |
|---|---|---|
| Apple | 20 | (20÷60)×360° = 120° |
| Banana | 15 | (15÷60)×360° = 90° |
| Orange | 15 | (15÷60)×360° = 90° |
| Mango | 10 | (10÷60)×360° = 60° |
| Total | 60 | 360° |
Bar Charts
Bar charts compare separate categories or discrete values. The gaps between bars show that the categories are separate.
A bar chart uses rectangular bars to show frequencies. Good for categorical or discrete data.
Number of cars sold per day (Monday-Friday):
| Day | Cars Sold |
|---|---|
| Monday | 15 |
| Tuesday | 12 |
| Wednesday | 18 |
| Thursday | 20 |
| Friday | 14 |
Histograms
Histograms show grouped continuous data. Bars touch because the intervals run continuously into each other.
A histogram is like a bar chart, but for continuous grouped data. No gaps between bars.
Heights grouped into classes:
Frequency Polygon
A frequency polygon joins class midpoints to show the shape of a distribution. It is useful for comparing two grouped datasets on the same axes.
A frequency polygon connects the midpoints of each class with straight lines. Shows the shape of the distribution.
Using the same height data:
Plot a point at each class midpoint at its frequency height, then connect with straight lines.
Line Graph
Line graphs show change over time or ordered values. The pattern of rise, fall, and flat sections is the main information.
A line graph shows how a variable changes over time.
When to use which diagram:
- Pie chart: Showing parts of a whole; percentages
- Bar chart: Categorical data; comparing groups
- Histogram: Continuous grouped data; seeing distribution shape
- Frequency polygon: Showing distribution shape; comparing distributions
- Line graph: Data over time; showing trends