Fourier–Motzkin elimination - Alchetron, the free social encyclopedia

Fourier–Motzkin elimination, also known as the FME method, is a mathematical algorithm for eliminating variables from a system of linear inequalities. It can output real solutions.

Elimination

The elimination of a set of variables, say V, from a system of relations (here linear inequalities) refers to the creation of another system of the same sort, but without the variables in V, such that both systems have the same solutions over the remaining variables.

If all variables are eliminated from a system of linear inequalities, then one obtains a system of constant inequalities. It is then trivial to decide whether the resulting system is true or false. It is true if and only if the original system has solutions. As a consequence, elimination of all variables can be used to detect whether a system of inequalities has solutions or not.

Consider a system S of n inequalities with r variables x 1 to x r , with x r the variable to be eliminated. The linear inequalities in the system can be grouped into three classes depending on the sign (positive, negative or null) of the coefficient for x r .

those inequalities that are of the form x r ≥ b i − ∑ k = 1 r − 1 a i k x k ; denote these by x r ≥ A j ( x 1 , … , x r − 1 ) , for j ranging from 1 to n A where n A is the number of such inequalities;

those inequalities that are of the form x r ≤ b i − ∑ k = 1 r − 1 a i k x k ; denote these by x r ≤ B j ( x 1 , … , x r − 1 ) , for j ranging from 1 to n B where n B is the number of such inequalities;

those inequalities in which x r plays no role, grouped into a single conjunction ϕ .

The original system is thus equivalent to

max ( A 1 ( x 1 , … , x r − 1 ) , … , A n A ( x 1 , … , x r − 1 ) ) ≤ x r ≤ min ( B 1 ( x 1 , … , x r − 1 ) , … , B n B ( x 1 , … , x r − 1 ) ) ∧ ϕ .

Elimination consists in producing a system equivalent to ∃ x r S . Obviously, this formula is equivalent to

max ( A 1 ( x 1 , … , x r − 1 ) , … , A n A ( x 1 , … , x r − 1 ) ) ≤ min ( B 1 ( x 1 , … , x r − 1 ) , … , B n B ( x 1 , … , x r − 1 ) ) ∧ ϕ .

The inequality

max ( A 1 ( x 1 , … , x r − 1 ) , … , A n A ( x 1 , … , x r − 1 ) ) ≤ min ( B 1 ( x 1 , … , x r − 1 ) , … , B n B ( x 1 , … , x r − 1 ) )

is equivalent to n A n B inequalities A i ( x 1 , … , x r − 1 ) ≤ B j ( x 1 , … , x r − 1 ) , for 1 ≤ i ≤ n A and 1 ≤ j ≤ n B .

We have therefore transformed the original system into another system where x r is eliminated. Note that the output system has ( n − n A − n B ) + n A n B inequalities. In particular, if n A = n B = n / 2 , then the number of output inequalities is n 2 / 4 .

Complexity

Running an elimination step over n inequalities can result in at most n 2 / 4 inequalities in the output, thus running d successive steps can result in at most 4 ( n / 4 ) 2 d , a double exponential complexity. This is due to the algorithm producing many unnecessary constraints (constraints that are implied by other constraints). The number of necessary constraints grows as a single exponential. Unnecessary constraints may be detected using linear programming.

Imbert's acceleration theorems

Two "acceleration" theorems due to Imbert permit the elimination of redundant inequalities based solely on syntactic properties of the formula derivation tree, thus curtailing the need to solve linear programs or compute matrix ranks.

Define the history H i of an inequality i as the set of indexes of inequalities from the initial system S used to produce i . Thus, H i = { i } for inequalities i ∈ S of the initial system. When adding a new inequality k : A i ( x 1 , … , x r − 1 ) ≤ B j ( x 1 , … , x r − 1 ) (by eliminating x r ), the new history H k is constructed as H k = H i ∪ H j .

Suppose that the variables O k = { x r , … , x r − k + 1 } have been eliminated. Each inequality i partitions the set O k into:

E i , the set of effectively eliminated variables. A variable x j is in the set as soon as at least of inequality in the history H i of i results from the elimination of x_j.

I i , the set of implicitly eliminated variables. A variable is implicitly eliminated when it appears in at least one inequality of H i , but appears neither in i nor E i

all remaining variables.

A non-redundant inequality has the property that its history is minimal.

Theorem (Imbert's first acceleration theorem). If the history H i of an inequality i is minimal, then 1 + | E i | ≤ | H i | ≤ 1 + | E i ∪ ( I i ∩ O k ) | .

An inequality that does not satisfy these bounds is necessarily redundant, and can be removed from the system without changing its solution set.

The second acceleration theorem detects minimal history sets:

Theorem (Imbert's second acceleration theorem). If the inequality i is such that 1 + | E i | = | H i | , then H i is minimal.

This theorem provides a quick detection criterion and is used in practice to avoid more costly checks, such as those based on matrix ranks. See the reference for implementation details.

Applications in Information Theory

Information-theoretic achievability proofs result in conditions under which the existence of a well-preforming coding scheme is guaranteed. These conditions are often described by linear system of inequalities. The variables of the system include both the transmission rates (that are part of the problem's formulation) and additional auxiliary rates used for the design of the scheme. Commonly, one aims to describe the fundamental limits of communication in terms of the problem's parameters only. This gives rise to the need of eliminating the aforementioned auxiliary rates, which is executed via Fourier–Motzkin elimination. However, the elimination process results in a new system that possibly contains more inequalities than the original. Yet, often some of the inequalities in the reduced system are redundant. Redundancy may be implied by other inequalities or by inequalities in information theory (a.k.a. Shannon type inequalities). A recently developed open-source software for MATLAB performs the elimination, while identifying and removing redundant inequalities. Consequently, the software's outputs a simplified system (without redundancies) that involves the communication rates only.

Redundant constraint can be identified by solving a linear program as follows. Given a linear constraints system, if the i -th inequality is satisfied for any solution of all other inequalities, then it is redundant. Similarly, STIs refers to inequalities that are implied by the non-negativity of information theoretic measures and basic identities they satisfy. For instance, the STI I ( X 1 ; X 2 ) ≤ H ( X 1 ) is a consequence of the identity I ( X 1 ; X 2 ) = H ( X 1 ) − H ( X 1 | X 2 ) and the non-negativity of conditional entropy, i.e., H ( X 1 | X 2 ) ≥ 0 . Shannon-type inequalities define a cone in R 2 n − 1 , where n is the number of random variables appearing in the involved information measures. Consequently, any STI can be proven via linear programming by checking if it is implied by the basic identities and non-negativity constraints. The described algorithm first preforms Fourier–Motzkin elimination to remove the auxiliary rates. Then, it imposes the information theoretic non-negativity constraints on the reduced output system and removes redundant inequalities.

References

Fourier–Motzkin elimination Wikipedia

(Text) CC BY-SA

Contents

Elimination

Complexity

Imbert's acceleration theorems

Applications in Information Theory

References