Write a short notes on Histogram

Histogram

What is it?

A histogram is a bar chart/diagram showing a distribution of variable quantities or characteristics.

It is a graphical display of the frequency distribution of the numerical data.

The data are displayed as a series of rectangles of equal width and varying heights.

When do we use it?

 A histogram is used to show clearly where the most frequently occurring values are located and the data is distributed.

It is also a tool for determining the maximum process results.

It enables the analyst to quickly visualize the features of a complete set of data.

How do we construct it?

A histogram may be constructed using the following steps:

1. After the data collection, count the number of data values collected.

2. Determine the range of the data.
Range = Highest value – Lowest value

3. Divide the data values in groups or classes and count the number of values in each class. Table 1 shows the guidelines to divide the data values.

Table 1. Guidelines to form classes

Number of values Number of classes Number of values Number of classes
Less than 50 5 – 7 100 – 250 7 – 12
 50 – 100 6 – 10 More than 250 10 – 20

4. Now determine the width of the class.

Width of the class = Range / Number of classes selected from Table 1

5. Draw a frequency table for all values.

6. Construct a histogram based on the frequency table. For that, mark the class limits on the horizontal axis and the frequency on the vertical axis.

7. Finally write the title and number of values on the diagram.

Types of Histograms and their Interpretations

The various possible types of histograms are depicted in Figs.1(a) to (h).

Types of histogram

Fig. 1. Types of histograms
Each and every shape of histograms interpret some meaning/characteristic about the data values. Table 2 shows the interpretations of different types of histograms.
Table 2 . Types of histograms and their interpretations
Shape Description
Bell-shaped Symmetrical shape with a peak in middle representing a normal distribution (Fig.1(a)).
Double-peaked Two normal distributions with two peaks in middle indicating more than one distribution at work (Fig.1(b)).
Plateau Flat top, no distinct peak and tails indicating more than one distribution at work (Fig.1(c)).
Comb Alternative peaks showing possible errors in data collection and analysis (Fig.1(d)).
Isolated peak Two normal distributions suggesting two processes taking place at the same time (Fig.1(e)).
Edged peak A normal distribution curve with a large peak at one end indicating errors in data recording (Fig.1(f )).
Skewed An asymmetrical shape positively or negatively skewed – usually reflecting limits in the specification on one side (Fig.1(g)).
Truncated An asymmetrical shape with a peak at the end. Usually being a part of a normal distribution with part of it having been removed (Fig.1(h)).

 

About the author

Nandhini Sathish

View all posts

Leave a Reply