### 2.8: Histogram

The histogram is a graphical representation in the x-y form of data distribution in a data set. The horizontal x-axis is labeled with what the data represents (for instance, distance from your home to school). The vertical y-axis is labeled either frequency or relative frequency (or percent frequency or probability).

A histogram graph consists of contiguous (adjoining) boxes. The heights of the bars correspond to frequency values. The graph will have the same shape with respective labels. The histogram (like the stemplot) can give the shape, the center, and the spread of the data. One will typically use a histogram to display large, continuous, quantitative data sets. The main advantage of a histogram is that it can readily display large data sets. A rule of thumb is to use a histogram when the data set consists of 100 values or more. To construct a histogram, one can decide how many bars or intervals, also called classes, represent the data. Many histograms consist of five to 15 bars or classes for clarity, yet one can choose the number of bars that are needed.