Data Representation: Definition, Types and Examples

Data Representation: Definition, Types and Examples

Edited By Komal Miglani | Updated on Jul 02, 2025 07:54 PM IST

Data is the collection of facts. Representation of data effectively is an important part of making complex decisions. It helps in understanding complex decisions. The way data is represented can be helpful in proving different insights. This representation describe the data in a better way and help the analyst to analyze the data in a better way and take out the insights from it.

This Story also Contains
  1. Representation of Data
  2. Importance of Data Representations
  3. Solved Examples Based on Representations of Data
Data Representation: Definition, Types and Examples
Data Representation: Definition, Types and Examples

This article is about the concept Representation of Data. This is an important concept which falls under the broader category of Statistics. This is not only important for board exams but also for various competitive exams.

Representation of Data

Any bit of information is data. For example, the marks you obtained in your Math exam are data. Data is a collection of information, measurements or observations.

The data once collected must be arranged or organized in a way so that inferences or conclusions can be made out from it.

The following are the ways to represent data

  1. Ungrouped distribution
  2. Ungrouped frequency distribution
  3. Grouped frequency distribution

The frequency of any value is the number of times that value appears in a data set.

Ungrouped distribution

Consider the marks obtained (out of $100$ marks) by $30$ students of Class $XI$ of a school:

$\begin{equation}
\begin{array}{llllllllll}
10 & 20 & 36 & 92 & 95 & 40 & 50 & 56 & 60 & 70 \\
92 & 88 & 80 & 70 & 72 & 70 & 36 & 40 & 36 & 40 \\
92 & 40 & 50 & 50 & 56 & 60 & 70 & 60 & 60 & 88
\end{array}
\end{equation}$

This representation is called Ungrouped distribution, as all the values are simply mentioned and separated by comma.

Ungrouped Frequency Distribution

Observe that, $4$ students got $70$ marks. So the frequency of $70$ marks is $4$.

To make the data more easily understandable, we create a table,

$\begin{array}{|c|c|}\hline \mathbf { Marks } & {\mathbf { Number\;of \;students }} \\ \hline 10 & {1} \\ 20 & {1} \\ {36} & {3} \\ {40} & {4} \\ {50} & {3} \\ {56} & {2} \\ {60} & {4} \\ {70} & {4} \\ {72} & {1} \\ {80} & {1} \\ {98} & {2} \\ {92} & {3} \\ {95} & {1} \\ \hline{\mathbf { Total }} & \mathbf{30} \\ \hline\end{array}$

The above table is called a Ungrouped Frequency Distribution.

Grouped Frequency Distribution

We can show data as ranges of marks and the number of students that obtained marks in that range.

So we can represent this data as

$\begin{array}{|l|c|c|c|c|c|c|}\hline \text { Class interval } & {10-25} & {25-40} & {40-55} & {55-70} & {70-85} & {85-100} \\ \hline \text { Number of students } & {2} & {3} & {7} & {6} & {6} & {6} \\ \hline\end{array}$

Here we have taken groups (range) of marks. This is why it is called Grouped distribution.

Also, the difference in marks in each interval is $15 (25-10 = 15, 70-55=15,...).$ This number is called the width of the class interval. Here width is 15, but we can take any width as per our convenience.

The above table is called a Grouped frequency distribution.

Relative frequency distribution

Relative frequency distribution is the distribution divided by the total number of observations.

$f_r-$ Relative frequency of a data or class interval
$f-$ frequency of a data or class interval
$N-$ sum of frequencies

$
f_r=\frac{f}{N}
$

Cumulative frequency distribution

The cumulative frequency of a value is its frequency plus the frequencies of all smaller values.

Example:

$ \begin{equation}
\begin{array}{ccc}
x & f & c . f \\
0 & 2 & 2 \\
1 & 4 & 6 \\
2 & 4 & 10 \\
3 & \underline{6} & 16 \\
& 16 &
\end{array}
\end{equation}$

The cumulative relative frequency of any value is similarly defined as the relative frequency of the value plus the relative frequencies of all smaller values.

Example:

$\begin{equation}
\begin{array}{ccc}
x & 1 / N=f_r & \text { c.r.f } \\
0 & 0.08 & 0.08 \\
1 & 0.16 & 0.24 \\
2 & 0.16 & 0.40 \\
3 & 0.24 & 0.64
\end{array}
\end{equation}$

Importance of Data Representations

1. Proper data representations make data easier and clearer to understand.

2. Effectively represented data can be communicated broadly to a larger audience.

3. Decision-making relies on data representation properly.

Recommended Video Based on Representation of Data

Solved Examples Based on Representations of Data

Example 1. What is the frequency of class interval $5$ - $10$ of the raw data - $ 3,2,1,5,6,9,0,10,7,8,10,5,6,7,3 $
1) $7$
2) $8$
3) $10$
4) $6$

Solution

As we learned

Discrete frequency distribution -

A representation of data in which each outcome is paired with its frequency is called a frequency distribution.

Example:

$\begin{equation}
\begin{array}{lllllll}
x: & 0 & 1 & 2 & 3 & 4 & 5 \\
\mathrm{f:} & 2 & 4 & 4 & 6 & 4 & 5
\end{array}
\end{equation}$

Class interval $5$ - $10$ has $\begin{equation}
5,6,9,7,8,6,7
\end{equation}$

We don't include $10$.

Hence, the answer is option 1.

Example 2: If the frequencies of the first four numbers out of $1, 2, 4, 6, 8$ are $2, 3, 3 , 2$ respectively, then the frequency of $8$ if their AM is $5$, is

1) $4$

2) $5$

3) $6$

4) none of these

Solution

As we learned

Discrete frequency distribution -

A representation of data in which each outcome is paired with its frequency is called a frequency distribution.

Here mean A=$5$

Let the frequency of $8$ be $x.$ Then by the formula

$\begin{equation}
\begin{aligned}
&\begin{aligned}
& A=\frac{\sum x f}{\sum f} \\
& 5=\frac{1.2+2.3+4.3+6.2+8 . x}{2+3+3+2+x}=\frac{32+8 x}{10+x}
\end{aligned}\\
&\text { of } 18=3 x ; \ x=6 \text {. }
\end{aligned}
\end{equation}$

Hence, the answer is an option 3.

Example 3: What is the interval width of data largest value = $20$, Last Value = $3$ number of intervals to splits= $3$?

1) $5$

2) $6$

3) $7$

4) does not exist

Solution

As we learned

Group frequency distribution -

The data is grouped into intervals: ${30-40, 40-50 ......}$ It reduces the cumbersome task of representing every value in the distribution as a unit as in discrete frequency distribution.

- wherein

Interval Width: $\frac{x_i-x_s+1}{n}$

Where $x_i$ is the largest value, $x_s$ is the smallest value and $n$ is a number of intervals to split data.

Interval width = $\frac{20-3+1}{3}=\frac{18}{3}=6$

Hence, the answer is the option (2).

Example 4: What is the relative frequency distribution for the table 5 - 10 ?

Class fi
$1-5$ $6$
$5-10$ $14$
$10-15$ $3$
$15-20$ $27$

Solution

As we learned

Relative frequency distribution -

Relative frequency distribution is the distribution divided by the total number of observations.

Since relative frequency $y=\frac{14}{6+14+3+27} \times 100\%=\frac{14}{50} \times 100\% = 28\%$

Example 5: Which class interval has maximum relative frequency?

$\begin{equation}
\begin{array}{ll}
\text { class } & \text { frequency } \\
0-2 & 3 \\
2-4 & 2 \\
4-6 & 5 \\
6-8 & 8 \\
8-10 & 2 \\
10-12 & 1
\end{array}
\end{equation}$

1) $0-4$
2) $2-6$
3) $4-8$
4) $6-10$

Solution

As we learned

Relative frequency distribution -
$
f_r=\frac{f}{N}
$
wherein
$f_r-$ Relative frequency of a data or class interval
$f-$ frequency of a data or class interval
$N-$ sum of frequencies
$
f r=\frac{f}{N}
$
$4-8$ has $13 ; 6-10$ has $10$ frequencies.
Thus $4$ - $8$ is the answer.

Hence, the answer is option 3.


Frequently Asked Questions (FAQs)

1. What is data?

Data is a collection of information, measurements or observations.

2. What is representation of data?

Arranging or organizing data in such a way that a conclusion can be drawn is called the representation of data.

3. What are 3 types of data representation?

There are 3 ways to represent data. They are
1. Ungrouped distribution
2. Ungrouped frequency distribution
3. Grouped frequency distribution

4. What is the difference between frequency and cumulative frequency?

The frequency of any value is the number of times that value appears in a data set. Cumulative frequency depends on the normal frequency. The cumulative frequency of a value is its frequency plus the frequencies of all smaller values.

5. What is meant by relative frequency and cumulative frequency?

Relative frequency distribution is the distribution divided by the total number of observations while cumulative frequency of a value is its frequency plus the freqencies of all small values.

6. Why is data representation important in statistical analysis?
Data representation is crucial in statistical analysis because it helps simplify complex information, making it more accessible and interpretable. It allows researchers and analysts to quickly identify patterns, outliers, and trends that might not be apparent in raw data. Effective data representation also facilitates communication of findings to a wider audience and supports informed decision-making.
7. How do tables differ from graphs in data representation?
Tables and graphs are both forms of data representation, but they serve different purposes:
8. What is a scatter plot and when is it used?
A scatter plot is a type of graph that displays the relationship between two continuous variables. It uses dots to represent individual data points, with one variable on the x-axis and the other on the y-axis. Scatter plots are used to:
9. How does a histogram differ from a bar chart?
While both histograms and bar charts use rectangular bars to represent data, they differ in several ways:
10. What is a box plot (box-and-whisker plot) and what information does it provide?
A box plot, also known as a box-and-whisker plot, is a graphical representation of the distribution of a dataset. It provides information about:
11. What is data representation in statistics?
Data representation in statistics refers to the various methods and formats used to organize, summarize, and present data in a meaningful way. It involves converting raw data into visual or numerical forms that are easier to understand and analyze, helping to reveal patterns, trends, and relationships within the data.
12. What are the main types of data representation?
The main types of data representation include:
13. What is the difference between qualitative and quantitative data representation?
Qualitative and quantitative data representation differ in the type of data they present:
14. What is a frequency distribution table?
A frequency distribution table is a way to organize and summarize data by showing how often each value or category occurs within a dataset. It typically consists of two columns: one for the data values or categories, and another for their corresponding frequencies (counts). This type of table is useful for identifying patterns and central tendencies in the data.
15. How does a pie chart represent data?
A pie chart represents data as slices of a circular "pie," where each slice represents a category's proportion of the whole. The size of each slice is proportional to the percentage it represents. Pie charts are useful for showing:
16. What is the purpose of a stem-and-leaf plot?
A stem-and-leaf plot is a method of organizing and displaying numerical data to show its distribution. Its purposes include:
17. How does a line graph differ from other types of graphs?
A line graph is unique in that it:
18. What is a heat map and when is it useful in data representation?
A heat map is a graphical representation of data where individual values are represented as colors. It's useful for:
19. How do pictographs represent data and what are their limitations?
Pictographs use icons or symbols to represent data, with each icon representing a specific quantity. They are:
20. What is the difference between a population pyramid and a regular bar chart?
A population pyramid is a specialized type of bar chart used to display the age and sex distribution of a population. Key differences include:
21. How does a Pareto chart combine different types of data representation?
A Pareto chart combines a bar graph and a line graph to show both individual values and cumulative totals. It's used to identify the most significant factors in a dataset. Features include:
22. What is a treemap and how does it represent hierarchical data?
A treemap is a data visualization method that displays hierarchical data as nested rectangles. Key features include:
23. How do radar charts (spider charts) represent multivariable data?
Radar charts, also known as spider charts, represent multivariable data on a two-dimensional chart with three or more quantitative variables. They:
24. What is the purpose of a dot plot in data representation?
A dot plot is a simple yet effective method of displaying data distribution. Its purposes include:
25. How does a Sankey diagram represent flows and relationships in data?
A Sankey diagram is a type of flow diagram where the width of the arrows is proportional to the flow quantity. It's used to:
26. What is the difference between a stacked bar chart and a grouped bar chart?
Stacked and grouped bar charts are both used to compare multiple categories, but they differ in how they present the data:
27. How does a violin plot combine aspects of box plots and density plots?
A violin plot is a hybrid between a box plot and a density plot. It shows:
28. What is the purpose of a waffle chart in data representation?
A waffle chart, also known as a square pie chart, is used to display data in a grid of squares. Its purposes include:
29. How does a stream graph (or streamgraph) represent time-series data?
A stream graph is a type of stacked area graph that represents how multiple categories change over time. Key features include:
30. What is the difference between a choropleth map and a cartogram?
Choropleth maps and cartograms are both used to represent geographical data, but they differ in their approach:
31. How does a parallel coordinates plot represent multidimensional data?
A parallel coordinates plot is a way to visualize and analyze multivariate data. Key features include:
32. What is the purpose of a Venn diagram in data representation?
Venn diagrams are used to show logical relationships between sets or groups. Their purposes include:
33. How does a bubble chart extend the concept of a scatter plot?
A bubble chart is an extension of a scatter plot that adds a third dimension to the data representation. Key features include:
34. What is a mosaic plot and how does it represent the relationship between categorical variables?
A mosaic plot, also known as a Marimekko chart, is a graphical representation of the relationship between two or more categorical variables. Key features include:
35. How does a funnel chart represent stages in a process?
A funnel chart is used to visualize stages in a sequential process, typically showing decreasing values. Key features include:
36. What is the difference between a line plot and a step plot?
Line plots and step plots are both used to show changes over time or ordered categories, but they differ in how they represent transitions:

Articles

Back to top