Mean of grouped and Ungrouped Data: Formula, Examples, Questions, Calculator

Quantitative Aptitude
Mean of grouped and Ungrouped Data: Formula, Examples, Questions, Calculator

Mean of grouped and Ungrouped Data: Formula, Examples, Questions, Calculator

Team Careers360Updated on 24 Sep 2024, 10:21 AM IST

Mean means the normal average or arithmetic mean of a dataset. The mean of grouped data formula includes finding the midpoints of class intervals and their frequencies. On the other hand mean of ungrouped data, definition says that if you divide the sum of the total number of observations by the number of observations, then you will get the mean of ungrouped data.

We will also discuss “mean of grouped data calculator”, “mean of grouped data with frequency”, “mean of grouped data questions”, “mean of grouped data example”, “mean of grouped data python”, “mean of ungrouped data calculator”, “mean of ungrouped data example”, “mean of ungrouped data class 10” etc.

Also read: mean of grouped data formula, mean of grouped data worksheet, mean of grouped data formula example, mean of grouped data and ungrouped data, mean of ungrouped data with frequency, mean of ungrouped data definition, mean of ungrouped data questions, mean of ungrouped data in statistics, mean of ungrouped data worksheet, mean of ungrouped data ppt

What are Ungrouped data?

When a data set is large, a frequency distribution table is often used to display the data in an organized way. A frequency distribution table lists the data values, as well as the number of times each value appears in the data set. A frequency distribution table is easy to both read and interpret and in this concept is used for ungrouped data, or data that is listed.

What is a frequency distribution table and tally column?

The numbers in a frequency distribution table do not have to be put in order. To make it easier to enter the values in the table, a tally column is often inserted. Inserting a tally column allows you to account for every value in the data set, without having to continually scan the numbers to find them in the list. A slash (/) is used to represent the presence of a value in the list, and the total number of slashes will be the frequency. If a tally column is inserted, the table will consist of 3 columns, and if no tally column is inserted, the table will consist of 2 columns.

How to find the mean of Ungrouped data?

The mean of ungrouped data often means the arithmetic mean or normal average of a given set of values.

The formula is:

Mean = $\frac{\text{Sum of all the Observations}} {\text{Total number of Observations}}$

Determining mean from a given list

Determine the mean of the given data.

5, 7, 9, 7, 3, 7

Sum of all observations = 5 + 7 + 9 + 8 + 3 + 10 = 42

Total number of observations = 6

Hence, the mean = $\frac{42}{7}$ = 6

Determining mean using frequency distribution table

Consider the following frequency distribution table.

Age interval	Calories intake
0-10	40
10-30	90
30-60	100
60-90	80

To calculate the mean we have to use the following formula.

Mean = $\frac{\sum\text{(Mid Point × Frequencies)}}{\sum \text{Frequencies}}$

Age interval	Midpoint	Calories intake	Midpoint × Calories intake
0-10	5	40	200
10-30	20	90	1800
30-60	45	100	4500
60-90	75	80	6000

Required mean = $\frac{200+1800+4500+6000}{40+90+100+80}=\frac{12500}{310}$ = 40.32

What is grouped data?

Data that has been organized into groups like classes or intervals is called Grouped data. This type of data presentation is useful for large datasets, making it easier to analyze and interpret the data. Grouping data helps in identifying patterns and trends more effectively than working with raw, unorganized data.

Representation of Grouped Data

Grouped data can be represented in many different ways like

Bar graph
Histogram
Frequency polygon

Bar graph:

A histogram is a type of bar graph that represents the frequency distribution of continuous data. Unlike bar graphs, histograms display data in intervals (bins) with no gaps between the bars. Each bar represents the frequency of data within that interval.

Histogram:

Frequency polygon:

A frequency polygon is a graphical representation of the distribution of a dataset. It is created by plotting points corresponding to the frequency of each class interval and then connecting these points with straight lines. Frequency polygons are useful for comparing multiple distributions on the same graph.

Frequency distribution table for Grouped data

A frequency distribution table organizes data into intervals (classes) and shows the number of observations (frequency) in each interval. It provides a summary of the dataset and helps in understanding the distribution of data.

Class interval	Frequency
0-10	5
10-20	8
30-40	12
40-50	10

Terms used in or derived from the frequency table

There are some key terms used in or can be derived from the frequency table.

Class size:

Class size is the difference between the upper and lower boundaries of a class interval.

For example, in the class interval 0-10, the class size is 10 - 0 = 10

Classmark(Midpoint):

Classmark is the average of the upper and lower boundaries of a class interval.

It is calculated by dividing the sum of upper and lower intervals by 2.

For example, in the class interval 0-10, the classmark is $\frac{0+10}{2}$ = 5

Class intervals:

Class intervals are the ranges into which data is grouped. There are two types of class intervals:

Continuous Class Interval: There are no gaps between intervals, and each interval includes the upper boundary of the previous interval. For example, 0-10, 10-20, 20-30, etc.
Discontinuous Class Interval: There are gaps between intervals, and each interval does not include the upper boundary of the previous interval. For example, 0-9, 10-19, 20-29, etc.

Frequency:

Frequency is the number of observations that fall within a particular class interval. It indicates how often a particular value or range of values occurs in the dataset.

Calculation of Mean for Grouped data using frequency distribution table

There are three basic methods to find the mean of grouped data

Direct Method
Assumed mean method
Step deviation method

Direct Method

In the Direct method to calculate the mean, we will use the midpoint of the class intervals and frequencies.

The formula is:

Mean = $\frac{\sum\text{(Mid Point × Frequencies)}}{\sum \text{Frequencies}}$

In notation, it can be written as follows:

Mean, $\bar{x}= \frac{ \sum f_i x_i}{\sum f_i}$,

Where

fi is the frequency of the ith class

xi is the midpoint of the ith class

Assumed mean method

In the Assumed mean method, we have to assume a mean and proceed with it to calculate the original mean.

The formula is:

Mean, $\bar{x}=a + \frac{ \sum f_i d_i}{\sum f_i}$,

Where

di = xi – A, is a derivation of the ith class

A is the assumed mean

fi is the frequency of the ith class

xi is the midpoint of the ith class

Assumed Mean Method = $\bar{x}=a+\frac{\sum f_i d_i}{\sum f_i}$

Step deviation method

The Step deviation method is used when all the class intervals are equal. It simplifies the calculation further.

The formula is:

Mean = $A + \frac{h \sum u_if_i}{\sum f_i}$,

Where

$u_i=\frac{x_i-A}{h}$, is a derivation of the ith class

A is the assumed mean

h is the class size

fi is the frequency of the ith class

xi is the midpoint of the ith class

Step Deviation Method $

=A+\frac{h\left(\sum u_{i}f_i\right)}{\sum f_i}

More Detailed Information

To learn more about Average or Arithmetic mean, read the below article.

Average

To learn more about the mode of grouped and ungrouped data, read the below article.

Mode

To learn more about the median of grouped and ungrouped data, read the below article.

Median

Important points

Mean = $\frac{\text{Sum of all the Observations}} {\text{Total number of Observations}}$
Using the Direct method to calculate, Mean = $\frac{\sum\text{(Mid Point × Frequencies)}}{\sum \text{Frequencies}}$
Using the Assumed mean method to calculate,
Mean, $\bar{x}=a + \frac{ \sum f_i d_i}{\sum f_i}$,

Where

di = xi – A, is a derivation of the ith class

A is the assumed mean

fi is the frequency of the ith class

xi is the midpoint of the ith class

Using the step deviation method to calculate,

Mean = $A + \frac{h \sum u_if_i}{\sum f_i}$,

Where

$u_i=\frac{x_i-A}{h}$, is a derivation of the ith class

A is the assumed mean

h is the class size

fi is the frequency of the ith class

xi is the midpoint of the ith class

Practice Questions

Q1. Calculate the mean from the following table.

Scores	Frequencies
0-10	2
10-20	4
20-30	12
30-40	21
40-50	6
50-60	3
60-70	2

34.2
33.4
32.6
35.8

Hint: Mean = $\frac{\sum\text{(Mid Point × Frequencies)}}{\sum \text{Frequencies}}$

Use this formula to calculate the mean.

Answer:

Scores	Frequencies	Mid Point	Mid Point × Frequencies
0-10	2	5	10
10-20	4	15	60
20-30	12	25	300
30-40	21	35	735
40-50	6	45	270
50-60	3	55	165
60-70	2	65	130

Mean = $\frac{\sum \text{ (Mid Point × Frequencies)}}{\sum \text{Frequencies}}$

= $\frac{10+60+300+735+270+165+130}{2+4+12+21+6+3+2}$

= $\frac{1670}{50}$

= 33.4

Hence, the correct answer is 33.4.

Q2. The mean of 20 observations was 42. It was found later that observation 54 was wrongly taken as 45. The corrected new mean is:

45.42
43.5
44.25
42.45

Hint: First, find the sum of initial observations. Then subtract the wrong value, add the correct value and find the final sum.

Answer:

The mean of 20 initial observations = 42

Sum of initial observations = 42 × 20 = 840

Wrong entry = 45

Correct entry = 54

Sum after correction = 840 – 45 + 54 = 849

$\therefore$ Final mean = $\frac{849}{20}$ = 42.45

Hence, the correct answer is 42.45.

Q3. Find the standard deviation of the following data (rounded off to two decimal places).

5, 3, 4, 7

1.48
3.21
4.12
3.45

Hint: Standard deviation = $\sqrt{\frac{\sum {x_i}^2}{n}-(\frac{\sum x_i}{n})^2}$

where $x_i$ are the numbers and $n$ is the count of numbers.

Answer:

Standard deviation of 5, 3, 4, 7

= $\sqrt{\frac{\sum {x_i}^2}{n}-(\frac{\sum x_i}{n})^2}$

where $x_i$ are the numbers and $n$ is the count of numbers.

= $\sqrt{\frac{5^2+3^2+4^2+7^2}{4}-(\frac{5+3+4+7)}{4})^2}$

= $\sqrt{\frac{99}{4}-\frac{361}{16}}$

= $\sqrt{\frac{35}{16}}$

= 1.48

Hence, the correct answer is 1.48.

Q4. In data analysis, which of the following statements about the mean (average) is true?

Outliers are not sensitive to the mean.
Categorical data can be analyzed using the mean.
The mean is the best way to describe the variability of data.
The mean reflects the dataset's central value.

Answer:

The core value of a dataset is represented by the mean.
The mean is a metric of central tendency that offers a single value that serves as the "typical" value to sum up a dataset.
D) The core value of a dataset is represented by the mean: This claim is true. The mean is a metric of central tendency that offers a single value that serves as the "typical" value to sum up a dataset.

A) The statement "The mean is not sensitive to outliers" is false. Extreme values have a considerable impact on the mean and are very sensitive to outliers.

B) The mean shouldn't be utilized for categorical data because it is normally used for numerical data.

C) Although the mean helps determine central tendency, it is not a good way to represent data variability.

Hence, the correct answer is "The mean reflects the dataset's central value."

Q5. The mean of the following numbers is 12, 15, 18, 21, 24, and 27.

15.789
17.83
18.26
17.65

Answer:

The Mean of ungrouped data is given by $\sum$ total values/Total no of values

Here,

The sum of $2,15,18,21,24$, and $27$ is $107$.

Total no of values $=6$

Therefore, the mean $=\frac{107 }{ 6}=17.83$

Hence, the correct answer is 17.83.

Q6. The table below shows the number of people belonging to specific age groups. Find the mean of the given data.

The class marks $x_i$ can be found by adding the upper and lower bounds and dividing them by $2 \cdot(20+0 / 2=10,20+40 / 2=30$ etc $\ldots)$

Age group	No of people
0-20	150
20-40	200
40-60	175
60-80	90

34.78
29.46
36.59
40.01

Answer:

Class Interval	Frequency fi	Class Marks xi	Xifi
0-20	150	10	1500
20-40	200	30	6000
40-60	175	50	8750
60-80	90	70	6300
Total	615		22,550

$\begin{aligned} \text { Mean } & =\Sigma \mathrm{X}_{\mathrm{i}} \mathrm{f}_{\mathrm{i}} / \underline{\boldsymbol{\Sigma}} \mathrm{f}_{\mathrm{i}} \\ & =\frac{22550 }{615} \\ & =36.59\end{aligned}$

Hence, the correct answer is 36.59.

Q7. The following table contains the number of patients admitted for cholera in St. Johns Hospital according to their age group. Find the mean of the following data.

Age group	0-10	10-20	20-30	30-40	40-50
No of patients	9	13	8	15	10

28.90
31.45
26.81
25.73

Answer:

The class marks can be found by adding the upper and lower bounds and dividing them by 2. (10 + 0 / 2 = 5, 20 + 10 / 2 = 15 etc...)

Class Interval	Frequency fi	Class Marks xi	Xifi
0-10	9	5	45
10-20	13	15	195
20-30	8	25	200
30-40	15	35	525
40-50	10	45	450
Total	55		1415

$\therefore$ The Mean of a grouped data is given by $\Sigma \mathrm{X}_{\mathrm{i}} \mathrm{f}_{\mathrm{i}} / \underline{\underline{\Sigma}} \mathrm{f}_{\mathrm{i}}$

Applying this, we get,
$
\begin{aligned}
& =\frac{1415 }{ 55} \\
& =25.73
\end{aligned}
$

Hence, the correct answer is 25.73.

Frequently Asked Questions (FAQs)

Q: What is mean in math?

The mean of ungrouped data often means the arithmetic mean or normal average of a given set of values.

Q: What is class size?

Class size is the difference between the upper and lower boundaries of a class interval.

For example, in the class interval 0-10, the class size is 10 - 0 = 10

Q: What are the types of mean?

Mean are three types.