2.8: 直方图

Histogram
JoVE Core
Statistics
A subscription to JoVE is required to view this content.  Sign in or start your free trial.
JoVE Core Statistics
Histogram

13,875 Views

01:05 min
April 30, 2023

Overview

The histogram is a graphical representation in the x-y form of data distribution in a data set. The horizontal x-axis is labeled with what the data represents (for instance, distance from your home to school). The vertical y-axis is labeled either frequency or relative frequency (or percent frequency or probability).

A histogram graph consists of contiguous (adjoining) boxes. The heights of the bars correspond to frequency values. The graph will have the same shape with respective labels. The histogram (like the stemplot) can give the shape, the center, and the spread of the data. One will typically use a histogram to display large, continuous, quantitative data sets. The main advantage of a histogram is that it can readily display large data sets. A rule of thumb is to use a histogram when the data set consists of 100 values or more. To construct a histogram, one can decide how many bars or intervals, also called classes, represent the data. Many histograms consist of five to 15 bars or classes for clarity, yet one can choose the number of bars that are needed.

Transcript

回想一下,频率分布表有助于组织具有多个类别的定量数据,例如不同价格范围的书籍数量。

这种频率分布表可以使用直方图直观地表示,直方图是一种由等宽条形组成的图形,没有间隙。

纵轴表示每个类中的频率,横轴表示类边界。

那么,什么是阶级界限?数据表中的第一个区间显示 5 到 10 美元的价格,第二个区间提供 11 到 16 美元的价格范围。

请注意,表中缺少 10 到 11 之间的价格范围。通过计算它们的中点(称为类边界)来填补此空白。

这些类边界在横轴上表示,相应的频率在纵轴上表示。竖线(称为 bin)连接类边界和频率值。

Key Terms and definitions​

Learning Objectives

Questions that this video will help you answer

This video is also useful for