Mean means the normal average or arithmetic mean of a dataset. The mean of grouped data formula includes finding the midpoints of class intervals and their frequencies. On the other hand mean of ungrouped data, definition says that if you divide the sum of the total number of observations by the number of observations, then you will get the mean of ungrouped data.
CAT 2024: 20 Free Mock Test | 10 Year PYQs | 60 Day Study Material | Most Scoring Concepts
XAT 2025: Section-wise Preparation Tips | Sample Paper
Don't Miss: SNAP 2024 Sample Papers | NMAT 2024 Sample Papers | MAT 2024 Sample Papers
We will also discuss “mean of grouped data calculator”, “mean of grouped data with frequency”, “mean of grouped data questions”, “mean of grouped data example”, “mean of grouped data python”, “mean of ungrouped data calculator”, “mean of ungrouped data example”, “mean of ungrouped data class 10” etc.
Also read: mean of grouped data formula, mean of grouped data worksheet, mean of grouped data formula example, mean of grouped data and ungrouped data, mean of ungrouped data with frequency, mean of ungrouped data definition, mean of ungrouped data questions, mean of ungrouped data in statistics, mean of ungrouped data worksheet, mean of ungrouped data ppt
When a data set is large, a frequency distribution table is often used to display the data in an organized way. A frequency distribution table lists the data values, as well as the number of times each value appears in the data set. A frequency distribution table is easy to both read and interpret and in this concept is used for ungrouped data, or data that is listed.
The numbers in a frequency distribution table do not have to be put in order. To make it easier to enter the values in the table, a tally column is often inserted. Inserting a tally column allows you to account for every value in the data set, without having to continually scan the numbers to find them in the list. A slash (/) is used to represent the presence of a value in the list, and the total number of slashes will be the frequency. If a tally column is inserted, the table will consist of 3 columns, and if no tally column is inserted, the table will consist of 2 columns.
The mean of ungrouped data often means the arithmetic mean or normal average of a given set of values.
The formula is:
Mean = $\frac{\text{Sum of all the Observations}} {\text{Total number of Observations}}$
Determine the mean of the given data.
5, 7, 9, 7, 3, 7
Sum of all observations = 5 + 7 + 9 + 8 + 3 + 10 = 42
Total number of observations = 6
Hence, the mean = $\frac{42}{7}$ = 6
Consider the following frequency distribution table.
Age interval | Calories intake |
0-10 | 40 |
10-30 | 90 |
30-60 | 100 |
60-90 | 80 |
To calculate the mean we have to use the following formula.
Mean = $\frac{\sum\text{(Mid Point × Frequencies)}}{\sum \text{Frequencies}}$
Age interval | Midpoint | Calories intake | Midpoint × Calories intake |
0-10 | 5 | 40 | 200 |
10-30 | 20 | 90 | 1800 |
30-60 | 45 | 100 | 4500 |
60-90 | 75 | 80 | 6000 |
Required mean = $\frac{200+1800+4500+6000}{40+90+100+80}=\frac{12500}{310}$ = 40.32
Data that has been organized into groups like classes or intervals is called Grouped data. This type of data presentation is useful for large datasets, making it easier to analyze and interpret the data. Grouping data helps in identifying patterns and trends more effectively than working with raw, unorganized data.
Grouped data can be represented in many different ways like
Bar graph
Histogram
Frequency polygon
Bar graph:
A histogram is a type of bar graph that represents the frequency distribution of continuous data. Unlike bar graphs, histograms display data in intervals (bins) with no gaps between the bars. Each bar represents the frequency of data within that interval.
Histogram:
A histogram is a type of bar graph that represents the frequency distribution of continuous data. Unlike bar graphs, histograms display data in intervals (bins) with no gaps between the bars. Each bar represents the frequency of data within that interval.
Frequency polygon:
A frequency polygon is a graphical representation of the distribution of a dataset. It is created by plotting points corresponding to the frequency of each class interval and then connecting these points with straight lines. Frequency polygons are useful for comparing multiple distributions on the same graph.
A frequency distribution table organizes data into intervals (classes) and shows the number of observations (frequency) in each interval. It provides a summary of the dataset and helps in understanding the distribution of data.
Class interval | Frequency |
0-10 | 5 |
10-20 | 8 |
30-40 | 12 |
40-50 | 10 |
There are some key terms used in or can be derived from the frequency table.
Class size:
Class size is the difference between the upper and lower boundaries of a class interval.
For example, in the class interval 0-10, the class size is 10 - 0 = 10
Classmark(Midpoint):
Classmark is the average of the upper and lower boundaries of a class interval.
It is calculated by dividing the sum of upper and lower intervals by 2.
For example, in the class interval 0-10, the classmark is $\frac{0+10}{2}$ = 5
Class intervals:
Class intervals are the ranges into which data is grouped. There are two types of class intervals:
Continuous Class Interval: There are no gaps between intervals, and each interval includes the upper boundary of the previous interval. For example, 0-10, 10-20, 20-30, etc.
Discontinuous Class Interval: There are gaps between intervals, and each interval does not include the upper boundary of the previous interval. For example, 0-9, 10-19, 20-29, etc.
Frequency:
Frequency is the number of observations that fall within a particular class interval. It indicates how often a particular value or range of values occurs in the dataset.
There are three basic methods to find the mean of grouped data
Direct Method
Assumed mean method
Step deviation method
In the Direct method to calculate the mean, we will use the midpoint of the class intervals and frequencies.
The formula is:
Mean = $\frac{\sum\text{(Mid Point × Frequencies)}}{\sum \text{Frequencies}}$
In notation, it can be written as follows:
Mean, $\bar{x}= \frac{ \sum f_i x_i}{\sum f_i}$,
Where
fi is the frequency of the ith class
xi is the midpoint of the ith class
In the Assumed mean method, we have to assume a mean and proceed with it to calculate the original mean.
The formula is:
Mean, $\bar{x}=a + \frac{ \sum f_i d_i}{\sum f_i}$,
Where
di = xi – A, is a derivation of the ith class
A is the assumed mean
fi is the frequency of the ith class
xi is the midpoint of the ith class
Assumed Mean Method = $\bar{x}=a+\frac{\sum f_i d_i}{\sum f_i}$
The Step deviation method is used when all the class intervals are equal. It simplifies the calculation further.
The formula is:
Mean = $A + \frac{h \sum u_if_i}{\sum f_i}$,
Where
$u_i=\frac{x_i-A}{h}$, is a derivation of the ith class
A is the assumed mean
h is the class size
fi is the frequency of the ith class
xi is the midpoint of the ith class
Step Deviation Method $
=A+\frac{h\left(\sum u_{i}f_i\right)}{\sum f_i}
$
To learn more about Average or Arithmetic mean, read the below article.
To learn more about the mode of grouped and ungrouped data, read the below article.
To learn more about the median of grouped and ungrouped data, read the below article.
Mean = $\frac{\text{Sum of all the Observations}} {\text{Total number of Observations}}$
Using the Direct method to calculate, Mean = $\frac{\sum\text{(Mid Point × Frequencies)}}{\sum \text{Frequencies}}$
Using the Assumed mean method to calculate,
Mean, $\bar{x}=a + \frac{ \sum f_i d_i}{\sum f_i}$,
Where
di = xi – A, is a derivation of the ith class
A is the assumed mean
fi is the frequency of the ith class
xi is the midpoint of the ith class
Using the step deviation method to calculate,
Mean = $A + \frac{h \sum u_if_i}{\sum f_i}$,
Where
$u_i=\frac{x_i-A}{h}$, is a derivation of the ith class
A is the assumed mean
h is the class size
fi is the frequency of the ith class
xi is the midpoint of the ith class
Q1. Calculate the mean from the following table.
Scores | Frequencies |
0-10 | 2 |
10-20 | 4 |
20-30 | 12 |
30-40 | 21 |
40-50 | 6 |
50-60 | 3 |
60-70 | 2 |
34.2
33.4
32.6
35.8
Hint: Mean = $\frac{\sum\text{(Mid Point × Frequencies)}}{\sum \text{Frequencies}}$
Use this formula to calculate the mean.
Answer:
Scores | Frequencies | Mid Point | Mid Point × Frequencies |
0-10 | 2 | 5 | 10 |
10-20 | 4 | 15 | 60 |
20-30 | 12 | 25 | 300 |
30-40 | 21 | 35 | 735 |
40-50 | 6 | 45 | 270 |
50-60 | 3 | 55 | 165 |
60-70 | 2 | 65 | 130 |
Mean = $\frac{\sum \text{ (Mid Point × Frequencies)}}{\sum \text{Frequencies}}$
= $\frac{10+60+300+735+270+165+130}{2+4+12+21+6+3+2}$
= $\frac{1670}{50}$
= 33.4
Hence, the correct answer is 33.4.
Q2. The mean of 20 observations was 42. It was found later that observation 54 was wrongly taken as 45. The corrected new mean is:
45.42
43.5
44.25
42.45
Hint: First, find the sum of initial observations. Then subtract the wrong value, add the correct value and find the final sum.
Answer:
The mean of 20 initial observations = 42
Sum of initial observations = 42 × 20 = 840
Wrong entry = 45
Correct entry = 54
Sum after correction = 840 – 45 + 54 = 849
$\therefore$ Final mean = $\frac{849}{20}$ = 42.45
Hence, the correct answer is 42.45.
Q3. Find the standard deviation of the following data (rounded off to two decimal places).
5, 3, 4, 7
1.48
3.21
4.12
3.45
Hint: Standard deviation = $\sqrt{\frac{\sum {x_i}^2}{n}-(\frac{\sum x_i}{n})^2}$
where $x_i$ are the numbers and $n$ is the count of numbers.
Answer:
Standard deviation of 5, 3, 4, 7
= $\sqrt{\frac{\sum {x_i}^2}{n}-(\frac{\sum x_i}{n})^2}$
where $x_i$ are the numbers and $n$ is the count of numbers.
= $\sqrt{\frac{5^2+3^2+4^2+7^2}{4}-(\frac{5+3+4+7)}{4})^2}$
= $\sqrt{\frac{99}{4}-\frac{361}{16}}$
= $\sqrt{\frac{35}{16}}$
= 1.48
Hence, the correct answer is 1.48.
Q4. In data analysis, which of the following statements about the mean (average) is true?
Outliers are not sensitive to the mean.
Categorical data can be analyzed using the mean.
The mean is the best way to describe the variability of data.
The mean reflects the dataset's central value.
Answer:
The core value of a dataset is represented by the mean.
The mean is a metric of central tendency that offers a single value that serves as the "typical" value to sum up a dataset.
D) The core value of a dataset is represented by the mean: This claim is true. The mean is a metric of central tendency that offers a single value that serves as the "typical" value to sum up a dataset.
A) The statement "The mean is not sensitive to outliers" is false. Extreme values have a considerable impact on the mean and are very sensitive to outliers.
B) The mean shouldn't be utilized for categorical data because it is normally used for numerical data.
C) Although the mean helps determine central tendency, it is not a good way to represent data variability.
Hence, the correct answer is "The mean reflects the dataset's central value."
Q5. The mean of the following numbers is 12, 15, 18, 21, 24, and 27.
15.789
17.83
18.26
17.65
Answer:
The Mean of ungrouped data is given by $\sum$ total values/Total no of values
Here,
The sum of $2,15,18,21,24$, and $27$ is $107$.
Total no of values $=6$
Therefore, the mean $=\frac{107 }{ 6}=17.83$
Hence, the correct answer is 17.83.
Q6. The table below shows the number of people belonging to specific age groups. Find the mean of the given data.
The class marks $x_i$ can be found by adding the upper and lower bounds and dividing them by $2 \cdot(20+0 / 2=10,20+40 / 2=30$ etc $\ldots)$
Age group | No of people |
0-20 | 150 |
20-40 | 200 |
40-60 | 175 |
60-80 | 90 |
34.78
29.46
36.59
40.01
Answer:
Class Interval | Frequency fi | Class Marks xi | Xifi |
0-20 | 150 | 10 | 1500 |
20-40 | 200 | 30 | 6000 |
40-60 | 175 | 50 | 8750 |
60-80 | 90 | 70 | 6300 |
Total | 615 | 22,550 |
$\begin{aligned} \text { Mean } & =\Sigma \mathrm{X}_{\mathrm{i}} \mathrm{f}_{\mathrm{i}} / \underline{\boldsymbol{\Sigma}} \mathrm{f}_{\mathrm{i}} \\ & =\frac{22550 }{615} \\ & =36.59\end{aligned}$
Hence, the correct answer is 36.59.
Q7. The following table contains the number of patients admitted for cholera in St. Johns Hospital according to their age group. Find the mean of the following data.
Age group | 0-10 | 10-20 | 20-30 | 30-40 | 40-50 |
No of patients | 9 | 13 | 8 | 15 | 10 |
28.90
31.45
26.81
25.73
Answer:
The class marks can be found by adding the upper and lower bounds and dividing them by 2. (10 + 0 / 2 = 5, 20 + 10 / 2 = 15 etc...)
Class Interval | Frequency fi | Class Marks xi | Xifi |
0-10 | 9 | 5 | 45 |
10-20 | 13 | 15 | 195 |
20-30 | 8 | 25 | 200 |
30-40 | 15 | 35 | 525 |
40-50 | 10 | 45 | 450 |
Total | 55 | 1415 |
$\therefore$ The Mean of a grouped data is given by $\Sigma \mathrm{X}_{\mathrm{i}} \mathrm{f}_{\mathrm{i}} / \underline{\underline{\Sigma}} \mathrm{f}_{\mathrm{i}}$
Applying this, we get,
$
\begin{aligned}
& =\frac{1415 }{ 55} \\
& =25.73
\end{aligned}
$
Hence, the correct answer is 25.73.
The mean of ungrouped data often means the arithmetic mean or normal average of a given set of values.
Class size is the difference between the upper and lower boundaries of a class interval.
For example, in the class interval 0-10, the class size is 10 - 0 = 10
Mean are three types.
Arithmetic mean
Geometric mean
Harmonic mean
We can find the mean of a dataset by using three different methods.
These are:
Direct method
Assumed mean method
Step deviation method