Chapter 8: Distributions

Back to chapter

8.13:

Determination of Expected Frequency

JoVE Core
Statistics

A subscription to JoVE is required to view this content. Sign in or start your free trial.

JoVE Core Statistics

Determination of Expected Frequency

Previous Video
8.12: Hypothesis Test for Test of Independence

Next Video
8.14: Test for Homogeneity

Languages

Share

English العربية 中文 Nederlands français Deutsch עברית italiano 日本語 한국어 português русский español Türkçe

Consider a contingency table with two variables: alcohol consumption, and road accident fatality, with corresponding observed frequencies.

Suppose a researcher wants to conduct a test of independence, which requires both the observed and expected frequencies of the dataset.

But how will the researcher determine the expected frequency? First, the researcher assumes that the two variables are independent. So, the probability of any event involving both variables can be calculated using the multiplication rule for independent events.

The product of the probability value and the table's grand total gives the expected frequency for the first cell. From this, a general formula for the expected frequency is obtained, which can be further simplified.

This simplified formula is then used to calculate the expected frequencies of the other cells.

Once all the expected frequencies are calculated, the researcher can proceed to determine the chi-square test statistic and conduct the hypothesis test.

8.13:

Determination of Expected Frequency

Suppose one wants to test independence between the two variables of a contingency table. The values in the table constitute the observed frequencies of the dataset. But how does one determine the expected frequency of the dataset? One of the important assumptions is that the two variables are independent, which means the variables do not influence each other. For independent variables, the statistical probability of any event involving both variables is calculated by multiplying the individual probabilities in the contingency table. It is also important to note that the expected frequency for each column must be at least 5. The expected frequencies are then used to calculate the chi-square value and P-value.

Tags

Determination Expected Frequency Test Independence Contingency Table Observed Frequencies Variables Statistical Probability Event Individual Probabilities Influence Chi-square Value P-value