The theory of STAP was first published by Lawrence E. Brennan and Irving S. Reed in the early 1970s. At the time of publication, both Brennan and Reed were at Technology Service Corporation (TSC). While it was formally introduced in 1973, it has theoretical roots dating back to 1959.
For groundbased radar, cluttered returns tend to be at DC, making them easily discriminated by Moving Target Indication (MTI). Thus, a notch filter at the zeroDoppler bin can be used. Airborne platforms with ownship motion experience relative ground clutter motion dependent on the angle, resulting in angleDoppler coupling at the input. In this case, 1D filtering is not sufficient, since clutter can overlap the desired target's Doppler from multiple directions. The resulting interference is typically called a "clutter ridge," since it forms a line in the angleDoppler domain. Narrowband jamming signals are also a source of interference, and exhibit significant spatial correlation. Thus receiver noise and interference must be considered, and detection processors must attempt to maximize the signaltointerference and noise ratio (SINR).
While primarily developed for radar, STAP techniques have applications for communications systems.
STAP is essentially filtering in the spacetime domain. This means that we are filtering over multiple dimensions, and multidimensional signal processing techniques must be employed. The goal is to find the optimal spacetime weights in
N
M
dimensional space, where
N
is the number of antenna elements (our spatial degrees of freedom) and
M
is the number of pulserepetition interval (PRI) taps (our time degrees of freedom), to maximize the signaltointerference and noise ratio (SINR). Thus, the goal is to suppress noise, clutter, jammers, etc., while keeping the desired radar return. It can be thought of as a 2D finiteimpulse response (FIR) filter, with a standard 1D FIR filter for each channel (steered spatial channels from an electronically steered array or individual elements), and the taps of these 1D FIR filters corresponding to multiple returns (spaced at PRI time). Having degrees of freedom in both the spatial domain and time domain is crucial, as clutter can be correlated in time and space, while jammers tend to be correlated spatially (along a specific bearing).
A simple, trivial example of STAP is shown in the first figure, for
N
=
M
=
10
. This is an idealized example of a steering pattern, where the response of the array has been steered to the ideal target response,
s
. Unfortunately, in practice, this is oversimplified, as the interference to be overcome by steering the nulls shown is not deterministic, but statistical in nature. This is what requires STAP to be an adaptive technique. Note that even in this idealized example, in general, we must steer over the 2D angleDoppler plane at discrete points to detect potential targets (moving the location of the 2D sinc main lobe shown in the figure), and do so for each of the range bins in our system.
The basic functional diagram is shown to the right. For each antenna, a down conversion and analogtodigital conversion step is typically completed. Then, a 1D FIR filter with PRI length delay elements is used for each steered antenna channel. The lexicographically ordered weights
W
1
to
W
N
M
are the degrees of freedom to be solved in the STAP problem. That is, STAP aims to find the optimal weights for the antenna array. It can be shown, that for a given
M
N
×
M
N
interference covariance matrix,
R
, the optimal weights maximizing the SINR are calculated as
W
=
κ
R
−
1
s
where
κ
is a scalar that does not affect the SINR. The optimal detector input is given by:
y
=
W
x
where
x
is a spacetime snapshot of the input data. The main difficulty of STAP is solving for and inverting the typically unknown interference covariance matrix,
R
. Other difficulties arise when the interference covariance matrix is illconditioned, making the inversion numerically unstable. In general, this adaptive filtering must be performed for each of the unambiguous range bins in the system, for each target of interest (angleDoppler coordinates), making for a massive computational burden. Steering losses can occur when true target returns do not fall exactly on one of the points in our 2D angleDoppler plane that we've sampled with our steering vector
s
.
The various approaches can be broken down by processing taxonomy, or by simplifying the data space / data sources.
The optimum solution is using all degrees of freedom by processing the adaptive filter on the antenna elements. For adaptive direct methods, Sample Matrix Inversion (SMI) uses the estimated (sample) interference covariance matrix in place of the actual interference covariance matrix. This is because the actual interference covariance matrix is not known in practice. If it is known by some means, then it need not be estimated, and the optimal weights are fixed. This is sometimes called the dataindependent variation. The datadependent variation estimates the interference covariance matrix from the data. In MIMO communications systems, this can be done via a training sequence. The clairvoyant detector is given when the covariance matrix is known perfectly and defined as:
R
k
=
E
[
x
k
x
k
H
]

H
0
where
x
k
is the spacetime snapshot statistic for the
k
t
h
range cell under the interference only hypothesis,
H
0
. For SMI, the interference covariance matrix for the
k
t
h
range cell consisting of the statistics from interfering noise, clutter, and jammers is estimated as follows:
R
k
^
=
1
P
∑
m
=
0
P
−
1
x
m
x
m
H
where
x
m
is the training data obtained from the input processor for the
m
t
h
range cell. Therefore, spacetime snapshots surrounding the desired range cell are averaged. Note that the desired range cell spacetime snapshot is typically excluded (as well as a number of additional cells, or "guard cells") to prevent whitening of the statistics.
The main problem with direct methods is the great computational complexity associated with the estimation and inversion of matrices formed from many degrees of freedom (large number of elements and or pulses). In addition, for methods where
R
must be estimated using data samples, the number of samples required to achieve a particular error is heavily dependent on the dimensionality of the interference covariance matrix. As a result, for high dimensional systems, this may require an unachievable number of unambiguous range cells. Further, these adjacent data cells must contain stationary statistics as a function of range which is rarely a good assumption for the large number of cells required (
2
N
M
for 3 dB SINR degradation from optimal, clairvoyant STAP).
Reduced rank methods aim to overcome the computational burdens of the direct method by reducing the dimensionality of the data or the rank of the interference covariance matrix. This can be accomplished by forming beams and performing STAP on the beamspace. Both pre and post Doppler methods can be used in the beamspace. Post Doppler methods may also be used on the full antenna element input as well to reduce the data in this dimension only. A popular example is displaced phase center antenna (DPCA), which is a form of dataindependent STAP in the beamspace, preDoppler. The goal is to perform beamforming such that the beam appears stationary as the airborne radar is in motion over discrete time periods so the clutter appears without Doppler. However, phase errors can cause significant degradation since the algorithm is not adaptive to the returned data. Many other methods may be used to reduce the rank of the interference covariance matrix, and so all methods in the reduced rank category can be thought of as simplifying the covariance matrix to be inverted:
R
⇒
R
~
PostDoppler methods decompose the STAP problem from an
M
N
×
M
N
adaptive filtering problem to
M
individual adaptive filters of length
N
(an
N
×
N
adaptive filter problem). By performing fixed Doppler processing, the adaptive filters become spatial only. Since the target response is already steered to a specified angleDoppler location, the dimensionality can be reduced by preprocessing multiple Doppler bins and angles surrounding this point. In addition to reducing the dimensionality of the adaptive processor, this in turn reduces the number of required training data frames when estimating the interference covariance matrix since this quantity is dimension dependent.
Since these methods reduce the data dimensionality, they are inherently suboptimal. There are a number of techniques to compare the performance of reducedrank methods and estimated direct methods to clairvoyant STAP (direct with perfect knowledge of interference covariance matrix and target steering vector), mostly based around SINR loss. One such example is
L
s
,
2
=
SINR

W
=
W
^
SINR

W
=
W
o
p
t
where we've taken the ratio of the SINR evaluated with the suboptimal weights
W
^
and the SINR evaluated with the optimal weights
W
o
p
t
. Note in general this quantity is statistical and the expectation must be taken to find the average SINR loss. The clairvoyant SINR loss may also be calculated by taking the ratio of the optimal SINR to the system SNR, indicating the loss due to interference.
There are also model based methods that attempt to force or exploit the structure of the covariance interference matrix. The more generally applicable of these methods is the covariance taper matrix structure. The goal is to compactly model the interference, at which point it can then be processed using principal component techniques or diagonalloading SMI (where a small magnitude, random diagonal matrix is added to attempt to stabilize the matrix prior to inverting). This modeling has an added benefit of decorrelating interference subspace leakage (ISL), and is resistant to internal clutter motion (ICM). The principal component method firsts applies principal component analysis to estimate the dominate eigenvalues and eigenvectors, and then applies a covariance taper and adds an estimated noise floor:
R
~
P
C
−
C
M
T
=
(
∑
m
=
0
P
−
1
λ
m
v
m
v
m
H
)
∘
T
+
σ
n
2
where
λ
m
is the
m
t
h
eigenvalue estimated using PCA,
v
m
is the associated
m
t
h
eigenvector estimated using PCA,
A
∘
B
implies elementbyelement multiplication of matrices
A
and
B
,
T
is the estimated covariance matrix taper, and
σ
n
2
is the estimated noise floor. The estimation of the covariance taper
T
can be complicated, depending on the complexity of the underlying model attempting to emulate the interference environment. The reader is encouraged to see for more information on this particular subject. Once this taper is sufficiently modeled, it may also be applied to the more simple SMI adaptation of CMT as follows:
R
~
S
M
I
−
C
M
T
=
R
~
S
M
I
∘
T
+
δ
I
where
R
~
S
M
I
is the typical SMI estimated matrix seen in the approximate direct method,
δ
is the diagonal loading factor, and
I
is the identity matrix of the appropriate size. It should be seen that this is meant to improve the standard SMI method where SMI uses a smaller number of range bins in its average than the standard SMI technique. Since fewer samples are used in the training data, the matrix often requires stabilization in the form of diagonal loading.
More restrictive examples involve modeling the interference to force Toeplitz structures, and can greatly reduce the computational complexity associated with the processing by exploiting this structure. However, these methods can suffer due to modelmismatch, or the computational savings may be undone by the problem of model fitting (such as the nonlinear problem of fitting to a Toeplitz or blockToeplitz matrix) and order estimation.
Despite nearly 40 years of existence, STAP has modern applications.
For dispersive channels, multipleinput multipleoutput communications can formulate STAP solutions. Frequencyselective channel compensation can be used to extend traditional equalization techniques for SISO systems using STAP. To estimate the transmitted signal
S
^
at a MIMO receiver, we can linearly weight our spacetime input
Z
~
with weighting matrix
W
~
as follows
S
^
=
W
~
T
Z
~
to minimize the mean squared error (MSE). Using STAP with a training sequence
S
~
, the estimated optimal weighting matrix (STAP coefficients) is given by:
W
~
≈
(
Z
~
Z
~
T
)
Z
~
S
~
T
STAP has been extended for MIMO radar to improve spatial resolution for clutter, using modified SIMO radar STAP techniques. New algorithms and formulations are required that depart from the standard technique due to the large rank of the jammerclutter subspace created by MIMO radar virtual arrays, which typically involving exploiting the block diagonal structure of the MIMO interference covariance matrix to break the large matrix inversion problem into smaller ones. In comparison with SIMO radar systems, which will have
M
transmit degrees of freedom, and
N
L
receive degrees of freedom, for a total of
M
+
N
L
, MIMO radar systems have
M
N
L
degrees of freedom, allowing for much greater adaptive spatial resolution for clutter mitigation.