Back to chapter

8.13:

Determination of Expected Frequency

JoVE Core
Statistics
A subscription to JoVE is required to view this content.  Sign in or start your free trial.
JoVE Core Statistics
Determination of Expected Frequency

Languages

Share

Consider a contingency table with two variables: alcohol consumption, and road accident fatality, with corresponding observed frequencies.

Suppose a researcher wants to conduct a test of independence, which requires both the observed and expected frequencies of the dataset.

But how will the researcher determine the expected frequency? First, the researcher assumes that the two variables are independent. So, the probability of any event involving both variables can be calculated using the multiplication rule for independent events.

The product of the probability value and the table's grand total gives the expected frequency for the first cell. From this, a general formula for the expected frequency is obtained, which can be further simplified.

This simplified formula is then used to calculate the expected frequencies of the other cells.

Once all the expected frequencies are calculated, the researcher can proceed to determine the chi-square test statistic and conduct the hypothesis test.

8.13:

Determination of Expected Frequency

Suppose one wants to test independence between the two variables of a contingency table. The values in the table constitute the observed frequencies of the dataset. But how does one determine the expected frequency of the dataset? One of the important assumptions is that the two variables are independent, which means the variables do not influence each other. For independent variables, the statistical probability of any event involving both variables is calculated by multiplying the individual probabilities in the contingency table. It is also important to note that the expected frequency for each column must be at least 5. The expected frequencies are then used to calculate the chi-square value and P-value.