Polychoric correlation

Updated on Apr 25, 2026

Edit

Comment

In statistics, polychoric correlation is a technique for estimating the correlation between two theorised normally distributed continuous latent variables, from two observed ordinal variables. Tetrachoric correlation is a special case of the polychoric correlation applicable when both observed variables are dichotomous. These names derive from the polychoric and tetrachoric series which are used for estimation of these correlations. These series were mathematical expansions once but not anymore.

Applications and examples

This technique is frequently applied when analysing items on self-report instruments such as personality tests and surveys that often use rating scales with a small number of response options (e.g., strongly disagree to strongly agree). The smaller the number of response categories, the more a correlation between latent continuous variables will tend to be attenuated. Lee, Poon & Bentler (1995) have recommended a two-step approach to factor analysis for assessing the factor structure of tests involving ordinally measured items. This aims to reduce the effect of statistical artifacts, such as the number of response scales or skewness of variables leading to items grouping together in factors.

Software

polycor package in R by John Fox [1]

psych package in R by William Revelle [2]

PRELIS

POLYCORR program

PROC CORR in SAS (with POLYCHORIC or OUTPLC= options) [3]

An extensive list of software for computing the polychoric correlation, by John Uebersax [4]

package polychoric in Stata by Stas Kolenikov [5]

References

Polychoric correlation Wikipedia

(Text) CC BY-SA

Contents

Applications and examples

Software

References