In multivariate statistics and probability theory, the scatter matrix is a statistic that is used to make estimates of the covariance matrix, for instance of the multivariate normal distribution.
Contents
Definition
Given n samples of m-dimensional data, represented as the m-by-n matrix,                     
where                     
The scatter matrix is the m-by-m positive semi-definite matrix
where                     
where                     
Application
The maximum likelihood estimate, given n samples, for the covariance matrix of a multivariate normal distribution can be expressed as the normalized scatter matrix
When the columns of                     
