Puneet Varma (Editor)

Self similarity matrix

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit

In data analysis, the self-similarity matrix is a graphical representation of similar sequences in a data series.

Similarity can be explained by different measures, like spatial distance (distance matrix), correlation, or comparison of local histograms or spectral properties (e.g. IXEGRAM). This technique is also applied for the search of a given pattern in a long data series as in gene matching. A similarity plot can be the starting point for dot plots or recurrence plots.

Definition

To construct a self-similarity matrix, one first transforms a data series into an ordered sequences of feature vectors V = ( v 1 , v 2 , , v n ) , where each vector v i describes the relevant features of a data series in a given local interval. Then the self-similarity matrix is formed by computing the similarity of pairs of feature vectors

S ( j , k ) = s ( v j , v k ) j , k ( 1 , , n )

where s ( v j , v k ) is a function measuring the similarity of the two vectors, for instance, the inner product s ( v j , v k ) = v j v k . Then similar segments of feature vectors will show up as path of high similarity along diagonals of the matrix. Similarity plots are used for action recognition that is invariant to point of view and for audio segmentation using spectral clustering of the self-similarity matrix.

References

Self-similarity matrix Wikipedia