Theorem. Let
d
=
{
d
i
}
i
=
1
N
and
λ
=
{
λ
i
}
i
=
1
N
be vectors in
R
N
such that their entries are in non-increasing order. There is a Hermitian matrix with diagonal values
{
d
i
}
i
=
1
N
and eigenvalues
{
λ
i
}
i
=
1
N
if and only if
∑
i
=
1
n
d
i
≤
∑
i
=
1
n
λ
i
n
=
1
,
2
,
…
,
N
and
∑
i
=
1
N
d
i
=
∑
i
=
1
N
λ
i
.
The permutation polytope generated by
x
~
=
(
x
1
,
x
2
,
…
,
x
n
)
∈
R
n
denoted by
K
x
~
is defined as the convex hull of the set
{
(
x
π
(
1
)
,
x
π
(
2
)
,
…
,
x
π
(
n
)
)
∈
R
n
:
π
∈
S
n
}
. Here
S
n
denotes the symmetric group on
{
1
,
2
,
…
,
n
}
. The following lemma characterizes the permutation polytope of a vector in
R
n
.
Lemma. If
x
1
≥
x
2
≥
⋯
≥
x
n
,
y
1
≥
y
2
≥
⋯
≥
y
n
, and
x
1
+
x
2
+
⋯
+
x
n
=
y
1
+
y
2
+
⋯
+
y
n
,
then the following are equivalent :
(i)
(
y
1
,
y
2
,
⋯
,
y
n
)
(
=
y
~
)
∈
K
x
~
.
(ii)
y
1
≤
x
1
,
y
1
+
y
2
≤
x
1
+
x
2
,
…
,
y
1
+
y
2
+
⋯
+
y
n
−
1
≤
x
1
+
x
2
+
⋯
+
x
n
(iii) There are points
(
x
1
(
1
)
,
x
2
(
1
)
,
⋯
,
x
n
(
1
)
)
(
=
x
~
1
)
,
…
,
(
x
1
(
n
)
,
x
2
(
n
)
,
…
,
x
n
(
n
)
)
(
=
x
~
n
)
in
K
x
~
such that
x
~
1
=
x
~
,
x
~
n
=
y
~
,
and
x
~
k
+
1
=
t
x
~
k
+
(
1
−
t
)
τ
(
x
k
~
)
for each
k
in
{
1
,
2
,
…
,
n
−
1
}
, some transposition
τ
in
S
n
, and some
t
in
[
0
,
1
]
, depending on
k
.
In view of the equivalence of (i) and (ii) in the lemma mentioned above, one may reformulate the theorem in the following manner.
Theorem. Let
d
=
{
d
i
}
i
=
1
N
and
λ
=
{
λ
i
}
i
=
1
N
be real vectors. There is a Hermitian matrix with diagonal entries
{
d
i
}
i
=
1
N
and eigenvalues
{
λ
i
}
i
=
1
N
if and only if the vector
(
d
1
,
d
2
,
…
,
d
n
)
is in the permutation polytope generated by
(
λ
1
,
λ
2
,
…
,
λ
n
)
.
Note that in this formulation, one does not need to impose any ordering on the entries of the vectors
d
and
λ
.
Let
A
(
=
a
j
k
)
be a
n
×
n
Hermitian matrix with eigenvalues
{
λ
i
}
i
=
1
n
, counted with multiplicity. Denote the diagonal of
A
by
a
~
, thought of as a vector in
R
n
, and the vector
(
λ
1
,
λ
2
,
…
,
λ
n
)
by
λ
~
. Let
Λ
be the diagonal matrix having
λ
1
,
λ
2
,
…
,
λ
n
on its diagonal.
(
⇒
)
A
may be written in the form
U
Λ
U
−
1
, where
U
is a unitary matrix. Then
a
i
i
=
∑
j
=
1
n
λ
j
|
u
i
j
|
2
,
i
=
1
,
2
,
…
,
n
Let
S
=
(
s
i
j
)
be the matrix defined by
s
i
j
=
|
u
i
j
|
2
. Since
U
is a unitary matrix,
S
is a doubly stochastic matrix and we have
a
~
=
S
λ
~
. By the Birkhoff–von Neumann theorem,
S
can be written as a convex combination of permutation matrices. Thus
a
~
is in the permutation polytope generated by
λ
~
. This proves Schur's theorem.
(
⇐
) If
a
~
occurs as the diagonal of a Hermitian matrix with eigenvalues
{
λ
i
}
i
=
1
n
, then
t
a
~
+
(
1
−
t
)
τ
(
a
~
)
also occurs as the diagonal of some Hermitian matrix with the same set of eigenvalues, for any transposition
τ
in
S
n
. One may prove that in the following manner.
Let
ξ
be a complex number of modulus
1
such that
ξ
a
j
k
¯
=
−
ξ
a
j
k
and
U
be a unitary matrix with
ξ
t
,
t
in the
j
,
j
and
k
,
k
entries, respectively,
−
1
−
t
2
,
ξ
1
−
t
2
at the
j
,
k
and
k
,
j
entries, respectively,
1
at all diagonal entries other than
j
,
j
and
k
,
k
, and
0
at all other entries. Then
U
A
U
−
1
has
t
a
j
j
+
(
1
−
t
)
a
k
k
at the
j
,
j
entry,
(
1
−
t
)
a
j
j
+
t
a
k
k
at the
k
,
k
entry, and
a
l
l
at the
l
,
l
entry where
l
≠
j
,
k
. Let
τ
be the transposition of
{
1
,
2
,
…
,
n
}
that interchanges
j
and
k
.
Then the diagonal of
U
A
U
−
1
is
t
a
~
+
(
1
−
t
)
τ
(
a
~
)
.
Λ
is a Hermitian matrix with eigenvalues
{
λ
i
}
i
=
1
n
. Using the equivalence of (i) and (iii) in the lemma mentioned above, we see that any vector in the permutation polytope generated by
(
λ
1
,
λ
2
,
…
,
λ
n
)
, occurs as the diagonal of a Hermitian matrix with the prescribed eigenvalues. This proves Horn's theorem.
The Schur–Horn theorem may be viewed as a corollary of the Atiyah–Guillemin–Sternberg convexity theorem in the following manner. Let
U
(
n
)
denote the group of
n
×
n
unitary matrices. Its Lie algebra, denoted by
u
(
n
)
, is the set of skew-Hermitian matrices. One may identify the dual space
u
(
n
)
∗
with the set of Hermitian matrices
H
(
n
)
via the linear isomorphism
Ψ
:
H
(
n
)
→
u
(
n
)
∗
defined by
Ψ
(
A
)
(
B
)
=
t
r
(
i
A
B
)
for
A
∈
H
(
n
)
,
B
∈
u
(
n
)
. The unitary group
U
(
n
)
acts on
H
(
n
)
by conjugation and acts on
u
(
n
)
∗
by the coadjoint action. Under these actions,
Ψ
is an
U
(
n
)
-equivariant map i.e. for every
U
∈
U
(
n
)
the following diagram commutes,
Let
λ
~
=
(
λ
1
,
λ
2
,
…
,
λ
n
)
∈
R
n
and
Λ
∈
H
(
n
)
denote the diagonal matrix with entries given by
λ
~
. Let
O
λ
~
denote the orbit of
Λ
under the
U
(
n
)
-action i.e. conjugation. Under the
U
(
n
)
-equivariant isomorphism
Ψ
, the symplectic structure on the corresponding coadjoint orbit may be brought onto
O
λ
~
. Thus
O
λ
~
is a Hamiltonian
U
(
n
)
-manifold.
Let
T
denote the Cartan subgroup of
U
(
n
)
which consists of diagonal complex matrices with diagonal entries of modulus
1
. The Lie algebra
t
of
T
consists of diagonal skew-Hermitian matrices and the dual space
t
∗
consists of diagonal Hermitian matrices, under the isomorphism
Ψ
. In other words,
t
consists of diagonal matrices with purely imaginary entries and
t
∗
consists of diagonal matrices with real entries. The inclusion map
t
↪
u
(
n
)
induces a map
Φ
:
H
(
n
)
≅
u
(
n
)
∗
→
t
∗
, which projects a matrix
A
to the diagonal matrix with the same diagonal entries as
A
. The set
O
λ
~
is a Hamiltonian
T
-manifold, and the restriction of
Φ
to this set is a moment map for this action.
By the Atiyah–Guillemin–Sternberg theorem,
Φ
(
O
λ
~
)
is a convex polytope. A matrix
A
∈
H
(
n
)
is fixed under conjugation by every element of
T
if and only if
A
is diagonal. The only diagonal matrices in
O
λ
~
are the ones with diagonal entries
λ
1
,
λ
2
,
…
,
λ
n
in some order. Thus, these matrices generate the convex polytope
Φ
(
O
λ
~
)
. This is exactly the statement of the Schur–Horn theorem.