Chapter 1: Research Methods

Back to chapter

1.16:

Regression zum Mittelwert

JoVE Core
Social Psychology

A subscription to JoVE is required to view this content. Sign in or start your free trial.

JoVE Core Social Psychology

Regression Toward the Mean

Previous Video
1.15: Reliability and Validity

Next Video
1.17: Measures of Central Tendency

Languages

Share

English العربية 中文 Nederlands français Deutsch עברית italiano 日本語 한국어 português русский español Türkçe

Sometimes, people think that certain events are under personal control. For instance, they may assume that their favorite athlete will always perform at their best, and therefore, may be surprised when an amazing performance is followed by an average one. Notably, this trend occurs for multiple variables with normal distributions. For example, when a physician takes the patient’s blood pressure, her diastolic value measures 95 mmHg. The doctor then enters and compares this number to that of all of his patients, and notes that—when graphed—these values demonstrate a normal, bell-shaped distribution. Here, the woman’s result falls at the extreme end of the curve. Concerned, the doctor retests her blood pressure; surprisingly, he finds that the diastolic number decreased. In other words, the previously observed “extreme” pressure regressed to the mean for all the patients. It’s rare for someone’s results to be far from the average repeatedly, even at the higher end. This inclination is called regression toward the mean—the tendency for an extreme value, upon reassessment, to be followed by a less radical score, essentially moving nearer to the group mean. Consequently, if a therapy only influences individuals with initially extreme results, it’s likely an ineffective strategy. However, if a treatment lowers a group’s average—like for blood pressure—this is strong evidence of its success. Without awareness of this statistical phenomenon, the doctor might have distributed blood pressure medication when, in fact, his patient didn’t need treatment. In the end, results based on outliers shouldn’t be taken too seriously.

1.16:

Regression zum Mittelwert

Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when researchers try to extrapolate results observed in a small sample to a larger population of interest.

Descriptive Statistics, Inferential Statistics and RTM

The field of statistics has two main subcategories, termed descriptive and inferential (for review, see Beins & McCarthy, 2019 and Franzoi, 2011). As the name suggests, the former seeks to “describe” data derived from a particular sample, e.g., newborns at a specific United States hospital whose mothers took a prenatal vitamin during pregnancy. Descriptive statistics typically include measures of central tendency (like the mean), gauges of variability (such as standard deviation), and graphs summarizing sample results. For example, descriptive statistics for our newborn example might include average birthweight and standard deviation, as well as a frequency distribution graph of birthweights for babies born at the hospital within the last year. Importantly, descriptive statistics only pertain to the sample being investigated, and, by themselves, can’t be used to draw conclusions about a larger population—in this case, all newborns in the United States born to mothers who took prenatal pills.

In contrast, inferential statistics are used by researchers to “infer” or draw conclusions about a population based on results calculated for a representative sample. Here, researchers might compare the average birthweight of babies born at the hospital to prenatal-taking mothers to that of newborns whose mothers did not. It might initially be observed that prenatal babies have a higher mean birthweight than non-prenatal babies. Using complex mathematical equations, inferential statistics can then determine if the difference between these two averages is significant—defined in science as having less than a 5% probability of being due to chance. If this is the case, conclusions can then be drawn about the greater population—in this instance, that throughout the country, newborns whose mothers took prenatal pills have a higher birthweight than those whose mothers did not.

Unfortunately, RTM can make it appear as if there is a significant difference between groups—due to a treatment, like prenatal medication above—when in reality no meaningful disparity exists, and any discrepancies are the result of random chance. Sometimes researchers even publish data stating that a particular regimen has been effective, when in fact their results stem from this statistical phenomenon; this has been the case for programs aimed at childhood obesity, among others (Skinner, Heymsfield, Pietrobelli, Faith, & Allison, 2015). Luckily, methods in inferential statistics have been developed that evaluate and take into account RTM, allowing researchers to be more confident in the veracity of their data, and the efficacy of any treatment (Barnett, van der Pols & Dobson, 2005).

RTM Across Different Fields

RTM has wide-reaching implications. In medical and scientific research, it has been observed for blood pressure (Bland & Altman, 1994), female bone density (Cummings, Palermo, Browner, et al., 2000), fetal heart rate (Park, Hoh, & Park, 2012), and even semen quality (Baker & Kovacs, 1985), to list a few examples. However, RTM extends beyond medical observations, and has been recorded in the stock market (Murstein, 2003), performance of flight students (Kahneman & Tversky, 1973), and has even been used as an explanation for why certain couples get divorced (as reviewed in Murstein, 2003). Thus, this statistical phenomenon affects numerous fields, from medicine to finances, and should be carefully considered by researchers using statistics to draw conclusions.

Suggested Reading

Beins, B. C. & McCarthy, M. A. (2019). Chapter 8, Basic Inferential Statistics. In Research Methods and Statistics in Psychology (2nd Ed.) (pp. 197-224). New York, NY: Cambridge University Press.

Franzoi, S. L. (2011). Chapter 2, Scientific Methods in Psychology. In Psychology: A Discovery Experience (pp. 28-59). Mason, OH: South-Western Cengage Learning.

Skinner, A. C., Heymsfield, S. B., Pietrobelli, A., Faith, M. S., & Allison, D. B. (2015). Ignoring Regression to the Mean Leads to Unsupported Conclusion About Obesity. International Journal of Behavioral Nutrition and Physical Activity, 12, 56. doi:10.1186/s12966-015-0212-6

Barnett, A. G., van der Pols, J. C., & Dobson, A. J. (2005). Regression to the Mean: What it is and How to Deal with It. International Journal of Epidemiology, 34(1), 215-220. https://doi.org/10.1093/ije/dyh299

Bland, J. M., & Altman, D. G. (1994). Some Examples of Regression Towards the Mean. BMJ (Clinical Research Ed.), 309(6957), 780. doi:10.1136/bmj.309.6957.780

Cummings, S. R., Palermo, L., Browner, W., et al. (2000). Monitoring Osteoporosis Therapy With Bone Densitometry: Misleading Changes and Regression to the Mean. JAMA; 283(10), 1318–1321. doi:https://doi.org/10.1001/jama.283.10.1318

Park, Y. S., Hoh, J. K., & Park, M. I. (2012). Fetal Heart Rate Regresses Toward the Mean in the Third Trimester. Journal of Korean Medical Science, 27(7), 794–798. doi:10.3346/jkms.2012.27.7.794

Baker, H. W. G. & Kovacs, G. T. (1985). Spontaneous Improvement in Semen Quality: Regression Towards the Mean. International Journal of Andrology, 8(6), 421-426. doi: 10.1111/j.1365-2605.1985.tb00855.x

Murstein, B. I. (2003) Regression to the Mean: One of the Most Neglected But Important Concepts in the Stock Market. Journal of Behavioral Finance, 4(4), 234-237. doi:10.1207/s15427579jpfm0404_6

Kahneman, D., & Tversky, A. (1973). On the Psychology of Prediction. Psychological Review, 80(4), 237–251. https://doi.org/10.1037/h0034747

Tags

Regression Toward The Mean Personal Control Athlete Performance Normal Distributions Blood Pressure Extreme Values Bell-shaped Distribution Therapy Effectiveness