Supriya Ghosh (Editor)

Higher order singular value decomposition

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit

A higher-order singular value decomposition (HOSVD) is a singular value decomposition (SVD). It may be referred to as M-mode SVD, or Multilinear SVD. The M-mode SVD and the Tucker Decomposition terminology was coined in the 1980s by P. Kroonenberg, but it was later referred to as Multilinear SVD/HOSVD by L. De Lathauwer. The term HOSVD is a misnomer since in multilinear algebra, there does not exist an SVD method for multi-way arrays (also known as M-way arrays, or informally data tensors) with all the properties of a matrix SVD. A matrix SVD computes both a

Contents

(a) a rank-R decomposition and (b) the orthonormal row/column matrices.

These two capabilities are embodied by two different decompositions for multi-way arrays.

Property (a) may be computed by employing the CANDECOMP/PARAFAC (CP) decomposition. CP is a linear decomposition that represents a tensor as the sum N of rank-1 tensors, for apriori user specified R. This should not be confused with the rank-R decomposition which is the minimum number of R rank-1 tensors that decomposes a tensor exactly. A rank-1 tensor is the outer product of m vectors, where m is the number of the tensor modes.

Property (b) is extended to higher order tensors by a class of methods known variably as Tucker3, M-mode SVD, "Multilinear SVD" or "Higher Order SVD". (This article will use the term "Tucker decomposition".) These methods compute the orthonormal spaces associated with the different modes (axes) of a tensor. The Tucker decomposition is used in multilinear principal component analysis.

Historically, much of the interest in multilinear SVDs was driven by the need to analyze empirical data, especially in psychometrics and chemometrics. Abstract and general mathematical theorems are rare (though see Kruskal with regard to the CP decomposition); instead, the methods are often designed for analyzing specific data types. The 2008 review article by Kolda and Bader provides a compact summary of the history of these decompositions, and many references for further reading.

Tucker decomposition / M-mode SVD / Multilinear SVD / Higher Order SVD

The Tucker Decomposition is the Multilinear SVD.

History

In 1966, L. Tucker proposed a decomposition method for three-way arrays (referred to as a 3-mode "tensors") as a multidimensional extension of factor analysis. This decomposition was further developed in the 1980s by P. Kroonenberg, who coined the terms Tucker3, Tucker3ALS (an alternating least squares dimensionality reduction algorithm), 3-Mode SVD, and 3-Mode PCA. In the intervening years, several authors developed the decomposition for N-way arrays. Most recently, this work was treated in an elegant fashion and introduced to the SIAM community by L. De Lathauwer et al. who referred to the decomposition as a multilinear SVD and HOSVD.

Definitions

Let the SVD of a real matrix be A = U S V T , then it can be written in an element-wise form as

a i 1 , i 2 = j 1 j 2 s j 1 , j 2 u i 1 , j 1 v i 2 , j 2 .

U and V give, in a certain sense optimal, orthonormal basis for the column and row space, S is diagonal with decreasing elements. The higher-order singular value decomposition (HOSVD) can be defined by the multidimensional generalization of this concept:

a i 1 , i 2 , , i N = j 1 j 2 j N s j 1 , j 2 , , j N u i 1 , j 1 ( 1 ) u i 2 , j 2 ( 2 ) u i N , j N ( N ) ,

where the U ( n ) = [ u i , j ( n ) ] I n × I n matrices and the S = [ s j 1 , , j N ] I 1 × I 2 × × I N core tensor should satisfy certain requirements (similar ones to the matrix SVD), namely

  • Each U ( n ) is an orthogonal matrix.
  • Two subtensors of the core tensor S are orthogonal i.e., S i n = p , S i n = q = 0 if p q .
  • The subtensors in the core tensor S are ordered according to their Frobenius norm, i.e. S i n = 1 S i n = 2 S i n = I n for n = 1, ..., N.
  • Notation:

    A = S × n = 1 N U ( n )

    Algorithm

    The HOSVD can be built from several SVDs, as follows:

    1. Given a tensor A R I 1 × I 2 × × I N , construct the mode-k flattening A ( k ) . That is, the I k × ( j k I j ) matrix that corresponds to A .
    2. Compute the singular value decomposition A ( k ) = U k Σ k V k T , and store the left singular vectors U k .
    3. The core tensor S is then the projection of A onto the tensor basis formed by the factor matrices { U n } n = 1 N , i.e., S = A × n = 1 N U n T .

    Applications

    The HOSVD is most commonly applied to the extraction of relevant information from multi-way arrays.

    Circa 2001, Vasilescu reframed the data analysis, recognition and synthesis problems as multilinear tensor problems based on the insight that most observed data are the result of several causal factors of data formation, and are well suited for multi-modal data tensor analysis. The power of the tensor framework was showcased in a visually and mathematically compelling manner by decomposing and representing an image in terms of its causal factors of data formation, in the context of Human Motion Signatures (CVPR 2001, ICPR 2002), face recognition - TensorFaces, (ECCV 2002, CVPR 2003, etc.) and computer graphics—TensorTextures (Siggraph 2004).

    The Multilinear SVD has been successfully applied to signal processing and big data, e.g., in genomic signal processing. These applications also inspired a higher-order GSVD (HO GSVD) and a tensor GSVD.

    A combination of HOSVD and SVD also has been applied for real-time event detection from complex data streams (multivariate data with space and time dimensions) in Disease surveillance.

    It is also used in tensor product model transformation-based controller design. In multilinear subspace learning, it was applied to modeling tensor objects ]] for gait recognition.

    In machine learning, the CP-decomposition is the central ingredient in learning probabilistic latent variables models via the technique of moment-matching. For example, consider the multi-view model which is a probabilistic latent variable model. In this model, the generation of samples are posited as follows: there exists a hidden random variable that is not observed directly, given which, there are several conditionally independent random variables known as the different "views" of the hidden variable. For simplicity, assume there are three symmetrical views x of a k -state categorical hidden variable h . Then the empirical third moment of this latent variable model can be written as: T = i = 1 k P r ( h = k ) E [ x | h = k ] 3 .

    In applications such as topic modeling, this can be interpreted as the co-occurrence of words in a document. Then the eigenvalues of this empirical moment tensor can be interpreted as the probability of choosing a specific topic and each column of the factor matrix E [ x | h = k ] corresponds to probabilities of words in the vocabulary in the corresponding topic.

    The concept of Multilinear SVD was carried over to functions by Baranyi and Yam via the TP model transformation . This extension led to the definition of the HOSVD based canonical form of tensor product functions and Linear Parameter Varying system models and to convex hull manipulation based control optimization theory, see TP model transformation in control theories.

    References

    Higher-order singular value decomposition Wikipedia