Recent from talks
Nothing was collected or created yet.
Farkas' lemma
View on WikipediaIn mathematics, Farkas' lemma is a solvability theorem for a finite system of linear inequalities. It was originally proven by the Hungarian mathematician Gyula Farkas.[1] Farkas' lemma is the key result underpinning the linear programming duality and has played a central role in the development of mathematical optimization (alternatively, mathematical programming).[citation needed] Remarkably, in the area of the foundations of quantum theory, the lemma also underlies the complete set of Bell inequalities in the form of necessary and sufficient conditions for the existence of a local hidden-variable theory, given data from any specific set of measurements.[2]
Generalizations of the Farkas' lemma are about the solvability theorem for convex inequalities,[3] i.e., infinite system of linear inequalities. Farkas' lemma belongs to a class of statements called "theorems of the alternative": a theorem stating that exactly one of two systems has a solution.[4]
Statement of the lemma
[edit]There are a number of slightly different (but equivalent) formulations of the lemma in the literature. The one given here is due to Gale, Kuhn and Tucker (1951).[5]
Farkas' lemma— Let and Then exactly one of the following two assertions is true:
- There exists an such that and
- There exists a such that and
Here, the notation means that all components of the vector are nonnegative.
Example
[edit]Let m, n = 2, and The lemma says that exactly one of the following two statements must be true (depending on b1 and b2):
- There exist x1 ≥ 0, x2 ≥ 0 such that 6x1 + 4x2 = b1 and 3x1 = b2, or
- There exist y1, y2 such that 6y1 + 3y2 ≥ 0, 4y1 ≥ 0, and b1 y1 + b2 y2 < 0.
Here is a proof of the lemma in this special case:
- If b2 ≥ 0 and b1 − 2b2 ≥ 0, then option 1 is true, since the solution of the linear equations is and Option 2 is false, since so if the right-hand side is positive, the left-hand side must be positive too.
- Otherwise, option 1 is false, since the unique solution of the linear equations is not weakly positive. But in this case, option 2 is true:
- If b2 < 0, then we can take e.g. y1 = 0 and y2 = 1.
- If b1 − 2b2 < 0, then, for some number B > 0, b1 = 2b2 − B, so: Thus we can take, for example, y1 = 1, y2 = −2.
Geometric interpretation
[edit]Consider the closed convex cone spanned by the columns of A; that is,
Observe that is the set of the vectors b for which the first assertion in the statement of Farkas' lemma holds. On the other hand, the vector y in the second assertion is orthogonal to a hyperplane that separates b and The lemma follows from the observation that b belongs to if and only if there is no hyperplane that separates it from
More precisely, let denote the columns of A. In terms of these vectors, Farkas' lemma states that exactly one of the following two statements is true:
- There exist non-negative coefficients such that
- There exists a vector such that for and
The sums with nonnegative coefficients form the cone spanned by the columns of A. Therefore, the first statement tells that b belongs to
The second statement tells that there exists a vector y such that the angle of y with the vectors ai is at most 90°, while the angle of y with the vector b is more than 90°. The hyperplane normal to this vector has the vectors ai on one side and the vector b on the other side. Hence, this hyperplane separates the cone spanned by from the vector b.
For example, let n, m = 2, a1 = (1, 0)T, and a2 = (1, 1)T. The convex cone spanned by a1 and a2 can be seen as a wedge-shaped slice of the first quadrant in the xy plane. Now, suppose b = (0, 1). Certainly, b is not in the convex cone a1x1 + a2x2. Hence, there must be a separating hyperplane. Let y = (1, −1)T. We can see that a1 · y = 1, a2 · y = 0, and b · y = −1. Hence, the hyperplane with normal y indeed separates the convex cone a1x1 + a2x2 from b.
Logic interpretation
[edit]A particularly suggestive and easy-to-remember version is the following: if a set of linear inequalities has no solution, then a contradiction can be produced from it by linear combination with nonnegative coefficients. In formulas: if is unsolvable then has a solution.[6] Note that is a combination of the left-hand sides, a combination of the right-hand side of the inequalities. Since the positive combination produces a zero vector on the left and a −1 on the right, the contradiction is apparent.
Thus, Farkas' lemma can be viewed as a theorem of logical completeness: is a set of "axioms", the linear combinations are the "derivation rules", and the lemma says that, if the set of axioms is inconsistent, then it can be refuted using the derivation rules.[7]: 92–94
Implications in complexity theory
[edit]Farkas' lemma implies that the decision problem "Given a system of linear equations, does it have a non-negative solution?" is in the intersection of NP and co-NP. This is because, according to the lemma, both a "yes" answer and a "no" answer have a proof that can be verified in polynomial time. The problems in the intersection are also called well-characterized problems. It is a long-standing open question whether is equal to P. In particular, the question of whether a system of linear equations has a non-negative solution was not known to be in P, until it was proved using the ellipsoid method.[8]: 25
Variants
[edit]The Farkas Lemma has several variants with different sign constraints (the first one is the original version):[7]: 92
- Either has a solution or has a solution with
- Either has a solution or has a solution with
- Either has a solution or has a solution with .
- Either has a solution or has a solution with
The latter variant is mentioned for completeness; it is not actually a "Farkas lemma" since it contains only equalities. Its proof is an exercise in linear algebra.
There are also Farkas-like lemmas for integer programs.[8]: 12--14 For systems of equations, the lemma is simple:
- Assume that A and b have rational coefficients. Then either has an integral solution or there exists such that is integral and is not integral.
For system of inequalities, the lemma is much more complicated. It is based on the following two rules of inference:
- Given inequalities and coefficients , infer the inequality .
- Given an inequality , infer the inequality .
The lemma says that:
- Assume that A and b have rational coefficients. Then either has an integral solution , or it is possible to infer from using finitely many applications of inference rules 1,2 the inequality .
The variants are summarized in the table below.
| System | Constraints on x | Alternative system | Constraints on y | |
|---|---|---|---|---|
| , | ||||
| , | , | |||
| , | , | |||
| , | ||||
| integral, not integral | ||||
| can be inferred from |
Generalizations
[edit]Generalized Farkas' lemma— Let is a closed convex cone in and the dual cone of is If convex cone is closed, then exactly one of the following two statements is true:
- There exists an such that and
- There exists a such that and
Generalized Farkas' lemma can be interpreted geometrically as follows: either a vector is in a given closed convex cone, or there exists a hyperplane separating the vector from the cone; there are no other possibilities. The closedness condition is necessary, see Separation theorem I in Hyperplane separation theorem. For original Farkas' lemma, is the nonnegative orthant hence the closedness condition holds automatically. Indeed, for polyhedral convex cone, i.e., there exists a such that the closedness condition holds automatically. In convex optimization, various kinds of constraint qualification, e.g. Slater's condition, are responsible for closedness of the underlying convex cone
By setting and in generalized Farkas' lemma, we obtain the following corollary about the solvability for a finite system of linear equalities:
Corollary— Let and Then exactly one of the following two statements is true:
- There exists an such that
- There exists a such that and
Further implications
[edit]Farkas' lemma can be varied to many further theorems of alternative by simple modifications,[4] such as Gordan's theorem: Either has a solution x, or has a nonzero solution y with y ≥ 0.
Common applications of Farkas' lemma include proving the strong duality theorem associated with linear programming and the Karush–Kuhn–Tucker conditions. An extension of Farkas' lemma can be used to analyze the strong duality conditions for and construct the dual of a semidefinite program. It is sufficient to prove the existence of the Karush–Kuhn–Tucker conditions using the Fredholm alternative but for the condition to be necessary, one must apply von Neumann's minimax theorem to show the equations derived by Cauchy are not violated.
This is used for Dill's Reluplex method for verifying deep neural networks.
See also
[edit]- Dual linear program
- Fourier–Motzkin elimination – can be used to prove Farkas' lemma.
Notes
[edit]- ^ Farkas, Julius (Gyula) (1902), "Theorie der Einfachen Ungleichungen", Journal für die reine und angewandte Mathematik, 1902 (124): 1–27, doi:10.1515/crll.1902.124.1, S2CID 115770124
- ^ Garg, Anupam; Mermin, N. D. (1984), "Farkas's Lemma and the Nature of Reality: Statistical Implications of Quantum Correlations", Foundations of Physics, 14: 1–39, doi:10.1007/BF00741645, S2CID 123622613
- ^ Dinh, N.; Jeyakumar, V. (2014), "Farkas' lemma three decades of generalizations for mathematical optimization", TOP, 22 (1): 1–22, doi:10.1007/s11750-014-0319-y, S2CID 119858980
- ^ a b Border, KC (2013). "Alternative Linear Inequalities" (PDF). Retrieved 2021-11-29.
- ^ Gale, David; Kuhn, Harold; Tucker, Albert W. (1951), "Linear Programming and the Theory of Games - Chapter XII" (PDF), in Koopmans (ed.), Activity Analysis of Production and Allocation, Wiley. See Lemma 1 on page 318.
- ^ Boyd, Stephen P.; Vandenberghe, Lieven (2004), "Section 5.8.3" (pdf), Convex Optimization, Cambridge University Press, ISBN 978-0-521-83378-3, retrieved October 15, 2011.
- ^ a b Gärtner, Bernd; Matoušek, Jiří (2006). Understanding and Using Linear Programming. Berlin: Springer. ISBN 3-540-30697-8. Pages 81–104.
- ^ a b Grötschel, Martin; Lovász, László; Schrijver, Alexander (1993), Geometric algorithms and combinatorial optimization, Algorithms and Combinatorics, vol. 2 (2nd ed.), Springer-Verlag, Berlin, doi:10.1007/978-3-642-78240-4, ISBN 978-3-642-78242-8, MR 1261419
Further reading
[edit]- Goldman, A. J.; Tucker, A. W. (1956). "Polyhedral Convex Cones". In Kuhn, H. W.; Tucker, A. W. (eds.). Linear Inequalities and Related Systems. Princeton: Princeton University Press. pp. 19–40. ISBN 0691079994.
{{citation}}: ISBN / Date incompatibility (help) - Rockafellar, R. T. (1979). Convex Analysis. Princeton University Press. p. 200.
- Kutateladze S.S. The Farkas lemma revisited. Siberian Mathematical Journal, 2010, Vol. 51, No. 1, 78–87.
Farkas' lemma
View on GrokipediaIntroduction and Statement
Historical Background
Gyula Farkas (1847–1930) was a Hungarian mathematician and physicist whose work bridged analytical mechanics, thermodynamics, and linear algebra. Born on March 28, 1847, in Sárosd, Fejér County, he initially studied law and philosophy in Pest before turning to engineering and natural sciences, where he studied physics and chemistry and earning a Ph.D. in mathematics. Farkas held teaching positions, including as a dozent in function theory at the University of Budapest in 1880, and later became an ordinary professor of mathematics and physics at the University of Kolozsvár (now Cluj-Napoca, Romania), where he served as dean and rector. His career also involved practical engineering, such as hydraulic projects, and he retired in 1915 due to deteriorating eyesight, spending his later years in Budapest.[8] Farkas' lemma originated in his efforts to formalize principles of mechanical equilibrium under inequality constraints, building on 19th-century foundations in statics. The result first appeared in his 1902 paper "Theorie der einfachen Ungleichungen," published in the Journal für die reine und angewandte Mathematik (volume 124, pages 1–27), where he proved a key theorem on the consistency of finite systems of linear inequalities. This work was directly inspired by Joseph Fourier's 1826 paper "Solution d’une question particulière du calcul des inégalités," which introduced methods for handling linear inequalities in the context of mechanics and probability, though Fourier's treatment remained incomplete and algorithmic rather than theorem-based. Earlier 19th-century contributions, such as those by Cauchy and others on polyhedral geometry, also influenced the evolving understanding of feasible regions defined by inequalities.[9][10][11] Initially applied to problems in analytical mechanics and statics—such as decomposing reaction forces in rigid body systems or ensuring equilibrium conditions—Farkas' theorem provided a mathematical foundation for inequality-based stability analysis that extended Fourier's qualitative principles. For instance, it enabled precise derivations of necessary and sufficient conditions for mechanical systems with friction or unilateral constraints. The lemma's broader significance emerged post-1940s with the rise of optimization theory; it became central to linear programming duality after George Dantzig's 1947 simplex method and John von Neumann's recognition of its role in proving strong duality theorems, marking its transition from mechanics to operations research.[4][12][13]Formal Statement
Farkas' lemma provides a fundamental condition for the solvability of systems of linear equations with non-negativity constraints. Consider an matrix and a vector . The system , has a solution if and only if for every satisfying , it holds that . Here, means that the vector belongs to the non-negative orthant , defined as , and similarly for vectors in . The transpose is the matrix obtained by interchanging rows and columns of . This formulation ensures the consistency of the system, meaning it has at least one feasible solution in the non-negative orthant.[14][9] Equivalently, the two alternatives are mutually exclusive and exhaustive: either there exists with and , or there exists with and , but not both. The mutual exclusivity follows from the fact that if both held, then (since and ), contradicting . Exhaustiveness is established via the separating hyperplane theorem applied to the convex cone generated by the columns of .[14][5] An alternative equivalent form concerns systems of linear inequalities without non-negativity constraints on the variables. The system (with unrestricted) has no solution if and only if there exists such that and . In this context, again refers to the non-negative orthant , and the equality indicates that lies in the orthogonal complement of the range of . This version characterizes inconsistency by the existence of a non-trivial non-negative certificate violating the right-hand side. The alternatives are also mutually exclusive and exhaustive, with proofs relying on duality in linear programming or Farkas' lemma in its primary form.[14][15]Interpretations
Geometric Interpretation
Farkas' lemma provides a geometric perspective on the feasibility of the system , , where is an matrix and . The feasible set is a polyhedron, defined as the intersection of the half-spaces corresponding to the inequalities. This polyhedron is nonempty precisely when lies in the Minkowski sum of the convex cone generated by the columns of and the nonnegative orthant, that is, .[14] The lemma asserts that the system is inconsistent if and only if there exists satisfying and . Geometrically, such a serves as the normal vector to a separating hyperplane , which strictly separates from the convex set : , while for all , since for and for .[5][14] This certificate lies in the dual cone of , given by , highlighting the role of cone duality in certifying infeasibility. In terms of recession cones, the recession cone of the polyhedron is ; inconsistency occurs when no recession direction or bounded shift allows the polyhedron to contain a point, with the separating hyperplane providing the geometric proof of emptiness.[16][17] Farkas' lemma is a finite-dimensional specialization of Minkowski's separating hyperplane theorem, which states that for two disjoint nonempty convex sets in Euclidean space, with at least one compact, there exists a hyperplane strictly separating them; here, the polyhedral structure of and the singleton ensures the existence of the linear certificate . The lemma further connects to the duality of polyhedral cones, where the dual cone of plays a central role: the alternative condition equates to being separated from the cone by an element of the dual, underscoring the self-duality of polyhedral cones under Farkas' framework.[18][16]Logical Interpretation
Farkas' lemma serves as a completeness result for the first-order theory of linear arithmetic over the real numbers, asserting that a system of linear inequalities (with and ) is inconsistent if and only if there exists a vector with , , and . This vector constitutes a Farkas certificate, a refuting witness that certifies the unsatisfiability of the system by deriving a strict contradiction through a non-negative linear combination of the inequalities. Over the rationals, the lemma is complete and decidable in polynomial time via linear programming techniques, such as the simplex method, which generate such certificates efficiently. In contrast, over the integers, the lemma is incomplete due to the discrete nature of the domain, as some systems solvable over the rationals lack integer solutions.[19] The lemma's statement embodies a proof by cases in first-order logic with linear arithmetic, establishing an exhaustive disjunction: either the primal system has a solution, or the dual certificate system admits a non-negative refutation. This dichotomy ensures that every formula in the theory—expressed as a quantifier-free conjunction of linear inequalities—is either satisfiable or has a short refuting proof via the Farkas certificate, mirroring completeness in proof systems where every inconsistency yields a verifiable derivation. The two alternatives are logically complementary, as their simultaneous satisfaction leads to a contradiction: if solves the primal and the dual, then , contradicting .[20] Farkas' lemma contributes to the decidability of linear arithmetic over the reals, providing a foundational tool for algorithms that resolve satisfiability of linear inequalities, in line with Hilbert's program on determining the solvability of mathematical problems. While Hilbert's tenth problem highlighted the undecidability of general Diophantine equations, the linear case over ordered fields like the reals or rationals is decidable, with the lemma enabling certificate-based verification of infeasibility to support automated theorem proving.[19] The lemma extends to quantifier elimination procedures for linear real arithmetic by facilitating the certification of emptiness in projected solution sets. In algorithms like Fourier-Motzkin elimination or SMT solvers, Farkas' lemma is invoked to check unsatisfiability after variable elimination: if a quantified system reduces to an inconsistent quantifier-free formula, the certificate confirms the non-existence of solutions, allowing refinement of the propositional skeleton. This integration supports efficient decision procedures, such as those combining SAT solving with linear programming, where Farkas certificates propagate contradictions back through quantifiers.[21]Examples and Illustrations
Simple Example
To illustrate Farkas' lemma, consider the system of linear inequalities in , where This yields the inequalities and .Adding these inequalities produces , which is impossible, confirming that no solution exists. According to Farkas' lemma, since the system is inconsistent, there must exist a certificate such that and . One such vector is .
Verify the conditions: , and holds componentwise. Moreover, . This certificate proves inconsistency because if a solution existed, then would imply , or , i.e., , a contradiction. Thus, represents nonnegative weights whose linear combination of the inequalities yields the absurd strict inequality . In contrast, adjust to . The system now has a solution, such as , since and . No such certificate exists in this feasible case, aligning with the lemma's alternatives.
