Hubbry Logo
Model theoryModel theoryMain
Open search
Model theory
Community hub
Model theory
logo
7 pages, 0 posts
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
Model theory
Model theory
from Wikipedia

In mathematical logic, model theory is the study of the relationship between formal theories (a collection of sentences in a formal language expressing statements about a mathematical structure), and their models (those structures in which the statements of the theory hold).[1] The aspects investigated include the number and size of models of a theory, the relationship of different models to each other, and their interaction with the formal language itself. In particular, model theorists also investigate the sets that can be defined in a model of a theory, and the relationship of such definable sets to each other. As a separate discipline, model theory goes back to Alfred Tarski, who first used the term "Theory of Models" in publication in 1954.[2] Since the 1970s, the subject has been shaped decisively by Saharon Shelah's stability theory.

Compared to other areas of mathematical logic such as proof theory, model theory is often less concerned with formal rigour and closer in spirit to classical mathematics. This has prompted the comment that "if proof theory is about the sacred, then model theory is about the profane".[3] The applications of model theory to algebraic and Diophantine geometry reflect this proximity to classical mathematics, as they often involve an integration of algebraic and model-theoretic results and techniques. Consequently, proof theory is syntactic in nature, in contrast to model theory, which is semantic in nature.

The most prominent scholarly organization in the field of model theory is the Association for Symbolic Logic.

Overview

[edit]

This page focuses on finitary first order model theory of infinite structures.

The relative emphasis placed on the class of models of a theory as opposed to the class of definable sets within a model fluctuated in the history of the subject, and the two directions are summarised by the pithy characterisations from 1973 and 1997 respectively:

model theory = universal algebra + logic[1]

where universal algebra stands for mathematical structures and logic for logical theories; and

model theory = algebraic geometryfields.

where logical formulas are to definable sets what equations are to varieties over a field.[4]

Nonetheless, the interplay of classes of models and the sets definable in them has been crucial to the development of model theory throughout its history. For instance, while stability was originally introduced to classify theories by their numbers of models in a given cardinality, stability theory proved crucial to understanding the geometry of definable sets.

Fundamental notions of first-order model theory

[edit]

First-order logic

[edit]

A first-order formula is built out of atomic formulas such as or by means of the Boolean connectives and prefixing of quantifiers or . A sentence is a formula in which each occurrence of a variable is in the scope of a corresponding quantifier. Examples for formulas are (or to indicate is the unbound variable in ) and (or ), defined as follows:

(Note that the equality symbol has a double meaning here.) It is intuitively clear how to translate such formulas into mathematical meaning. In the semiring of natural numbers , viewed as a structure with binary functions for addition and multiplication and constants for 0 and 1 of the natural numbers, for example, an element satisfies the formula if and only if is a prime number. The formula similarly defines irreducibility. Tarski gave a rigorous definition, sometimes called "Tarski's definition of truth", for the satisfaction relation , so that one easily proves:

is a prime number.
is irreducible.

A set of sentences is called a (first-order) theory, which takes the sentences in the set as its axioms. A theory is satisfiable if it has a model , i.e. a structure (of the appropriate signature) which satisfies all the sentences in the set . A complete theory is a theory that contains every sentence or its negation. The complete theory of all sentences satisfied by a structure is also called the theory of that structure.

It's a consequence of Gödel's completeness theorem (not to be confused with his incompleteness theorems) that a theory has a model if and only if it is consistent, i.e. no contradiction is proved by the theory. Therefore, model theorists often use "consistent" as a synonym for "satisfiable".

Basic model-theoretic concepts

[edit]

A signature or language is a set of non-logical symbols such that each symbol is either a constant symbol, or a function or relation symbol with a specified arity. Note that in some literature, constant symbols are considered as function symbols with zero arity, and hence are omitted. A structure is a set together with interpretations of each of the symbols of the signature as relations and functions on (not to be confused with the formal notion of an "interpretation" of one structure in another).

Example: A common signature for ordered rings is , where and are 0-ary function symbols (also known as constant symbols), and are binary (= 2-ary) function symbols, is a unary (= 1-ary) function symbol, and is a binary relation symbol. Then, when these symbols are interpreted to correspond with their usual meaning on (so that e.g. is a function from to and is a subset of ), one obtains a structure .

A structure is said to model a set of first-order sentences in the given language if each sentence in is true in with respect to the interpretation of the signature previously specified for . (Again, not to be confused with the formal notion of an "interpretation" of one structure in another) A model of is a structure that models .

A substructure of a σ-structure is a subset of its domain, closed under all functions in its signature σ, which is regarded as a σ-structure by restricting all functions and relations in σ to the subset. This generalises the analogous concepts from algebra; for instance, a subgroup is a substructure in the signature with multiplication and inverse.

A substructure is said to be elementary if for any first-order formula and any elements a1, ..., an of ,

if and only if .

In particular, if is a sentence and an elementary substructure of , then if and only if . Thus, an elementary substructure is a model of a theory exactly when the superstructure is a model.

Example: While the field of algebraic numbers is an elementary substructure of the field of complex numbers , the rational field is not, as we can express "There is a square root of 2" as a first-order sentence satisfied by but not by .

An embedding of a σ-structure into another σ-structure is a map f: AB between the domains which can be written as an isomorphism of with a substructure of . If it can be written as an isomorphism with an elementary substructure, it is called an elementary embedding. Every embedding is an injective homomorphism, but the converse holds only if the signature contains no relation symbols, such as in groups or fields.

A field or a vector space can be regarded as a (commutative) group by simply ignoring some of its structure. The corresponding notion in model theory is that of a reduct of a structure to a subset of the original signature. The opposite relation is called an expansion - e.g. the (additive) group of the rational numbers, regarded as a structure in the signature {+,0} can be expanded to a field with the signature {×,+,1,0} or to an ordered group with the signature {+,0,<}.

Similarly, if σ' is a signature that extends another signature σ, then a complete σ'-theory can be restricted to σ by intersecting the set of its sentences with the set of σ-formulas. Conversely, a complete σ-theory can be regarded as a σ'-theory, and one can extend it (in more than one way) to a complete σ'-theory. The terms reduct and expansion are sometimes applied to this relation as well.

Compactness and the Löwenheim–Skolem theorem

[edit]

The compactness theorem states that a set of sentences S is satisfiable if every finite subset of S is satisfiable. The analogous statement with consistent instead of satisfiable is trivial, since every proof can have only a finite number of antecedents used in the proof. The completeness theorem allows us to transfer this to satisfiability. However, there are also several direct (semantic) proofs of the compactness theorem. As a corollary (i.e., its contrapositive), the compactness theorem says that every unsatisfiable first-order theory has a finite unsatisfiable subset. This theorem is of central importance in model theory, where the words "by compactness" are commonplace.[5]

Another cornerstone of first-order model theory is the Löwenheim–Skolem theorem. According to the theorem, every infinite structure in a countable signature has a countable elementary substructure. Conversely, for any infinite cardinal κ every infinite structure in a countable signature that is of cardinality less than κ can be elementarily embedded in another structure of cardinality κ (There is a straightforward generalisation to uncountable signatures). In particular, the Löwenheim-Skolem theorem implies that any theory in a countable signature with infinite models has a countable model as well as arbitrarily large models.[6]

In a certain sense made precise by Lindström's theorem, first-order logic is the most expressive logic for which both the Löwenheim–Skolem theorem and the compactness theorem hold.[7]

Definability

[edit]

Definable sets

[edit]

In model theory, definable sets are important objects of study. For instance, in the formula

defines the subset of prime numbers, while the formula

defines the subset of even numbers. In a similar way, formulas with n free variables define subsets of . For example, in a field, the formula

defines the curve of all such that .

Both of the definitions mentioned here are parameter-free, that is, the defining formulas don't mention any fixed domain elements. However, one can also consider definitions with parameters from the model. For instance, in , the formula

uses the parameter from to define a curve.[8]

Eliminating quantifiers

[edit]

In general, definable sets without quantifiers are easy to describe, while definable sets involving possibly nested quantifiers can be much more complicated.[9]

This makes quantifier elimination a crucial tool for analysing definable sets: A theory T has quantifier elimination if every first-order formula φ(x1, ..., xn) over its signature is equivalent modulo T to a first-order formula ψ(x1, ..., xn) without quantifiers, i.e. holds in all models of T.[10] If the theory of a structure has quantifier elimination, every set definable in a structure is definable by a quantifier-free formula over the same parameters as the original definition. For example, the theory of algebraically closed fields in the signature σring = (×,+,−,0,1) has quantifier elimination.[11] This means that in an algebraically closed field, every formula is equivalent to a Boolean combination of equations between polynomials.

If a theory does not have quantifier elimination, one can add additional symbols to its signature so that it does. Axiomatisability and quantifier elimination results for specific theories, especially in algebra, were among the early landmark results of model theory.[12] But often instead of quantifier elimination a weaker property suffices:

A theory T is called model-complete if every substructure of a model of T which is itself a model of T is an elementary substructure. There is a useful criterion for testing whether a substructure is an elementary substructure, called the Tarski–Vaught test.[6] It follows from this criterion that a theory T is model-complete if and only if every first-order formula φ(x1, ..., xn) over its signature is equivalent modulo T to an existential first-order formula, i.e. a formula of the following form:

,

where ψ is quantifier free. A theory that is not model-complete may have a model completion, which is a related model-complete theory that is not, in general, an extension of the original theory. A more general notion is that of a model companion.[13]

Minimality

[edit]

In every structure, every finite subset is definable with parameters: Simply use the formula

.

Since we can negate this formula, every cofinite subset (which includes all but finitely many elements of the domain) is also always definable.

This leads to the concept of a minimal structure. A structure is called minimal if every subset definable with parameters from is either finite or cofinite. The corresponding concept at the level of theories is called strong minimality: A theory T is called strongly minimal if every model of T is minimal. A structure is called strongly minimal if the theory of that structure is strongly minimal. Equivalently, a structure is strongly minimal if every elementary extension is minimal. Since the theory of algebraically closed fields has quantifier elimination, every definable subset of an algebraically closed field is definable by a quantifier-free formula in one variable. Quantifier-free formulas in one variable express Boolean combinations of polynomial equations in one variable, and since a nontrivial polynomial equation in one variable has only a finite number of solutions, the theory of algebraically closed fields is strongly minimal.[14]

On the other hand, the field of real numbers is not minimal: Consider, for instance, the definable set

.

This defines the subset of non-negative real numbers, which is neither finite nor cofinite. One can in fact use to define arbitrary intervals on the real number line. It turns out that these suffice to represent every definable subset of .[15] This generalisation of minimality has been very useful in the model theory of ordered structures. A densely totally ordered structure in a signature including a symbol for the order relation is called o-minimal if every subset definable with parameters from is a finite union of points and intervals.[16]

Definable and interpretable structures

[edit]

Particularly important are those definable sets that are also substructures, i. e. contain all constants and are closed under function application. For instance, one can study the definable subgroups of a certain group. However, there is no need to limit oneself to substructures in the same signature. Since formulas with n free variables define subsets of , n-ary relations can also be definable. Functions are definable if the function graph is a definable relation, and constants are definable if there is a formula such that a is the only element of such that is true. In this way, one can study definable groups and fields in general structures, for instance, which has been important in geometric stability theory.

One can even go one step further, and move beyond immediate substructures. Given a mathematical structure, there are very often associated structures which can be constructed as a quotient of part of the original structure via an equivalence relation. An important example is a quotient group of a group. One might say that to understand the full structure one must understand these quotients. When the equivalence relation is definable, we can give the previous sentence a precise meaning. We say that these structures are interpretable. A key fact is that one can translate sentences from the language of the interpreted structures to the language of the original structure. Thus one can show that if a structure interprets another whose theory is undecidable, then itself is undecidable.[17]

Types

[edit]

Basic notions

[edit]

For a sequence of elements of a structure and a subset A of , one can consider the set of all first-order formulas with parameters in A that are satisfied by . This is called the complete (n-)type realised by over A. If there is an automorphism of that is constant on A and sends to respectively, then and realise the same complete type over A.

The real number line , viewed as a structure with only the order relation {<}, will serve as a running example in this section. Every element satisfies the same 1-type over the empty set. This is clear since any two real numbers a and b are connected by the order automorphism that shifts all numbers by b-a. The complete 2-type over the empty set realised by a pair of numbers depends on their order: either , or . Over the subset of integers, the 1-type of a non-integer real number a depends on its value rounded down to the nearest integer.

More generally, whenever is a structure and A a subset of , a (partial) n-type over A is a set of formulas p with at most n free variables that are realised in an elementary extension of . If p contains every such formula or its negation, then p is complete. The set of complete n-types over A is often written as . If A is the empty set, then the type space only depends on the theory of . The notation is commonly used for the set of types over the empty set consistent with . If there is a single formula such that the theory of implies for every formula in p, then p is called isolated.

Since the real numbers are Archimedean, there is no real number larger than every integer. However, a compactness argument shows that there is an elementary extension of the real number line in which there is an element larger than any integer. Therefore, the set of formulas is a 1-type over that is not realised in the real number line .

A subset of that can be expressed as exactly those elements of realising a certain type over A is called type-definable over A. For an algebraic example, suppose is an algebraically closed field. The theory has quantifier elimination . This allows us to show that a type is determined exactly by the polynomial equations it contains. Thus the set of complete -types over a subfield corresponds to the set of prime ideals of the polynomial ring , and the type-definable sets are exactly the affine varieties.[18]

Structures and types

[edit]

While not every type is realised in every structure, every structure realises its isolated types. If the only types over the empty set that are realised in a structure are the isolated types, then the structure is called atomic.

On the other hand, no structure realises every type over every parameter set; if one takes all of as the parameter set, then every 1-type over realised in is isolated by a formula of the form a = x for an . However, any proper elementary extension of contains an element that is not in . Therefore, a weaker notion has been introduced that captures the idea of a structure realising all types it could be expected to realise. A structure is called saturated if it realises every type over a parameter set that is of smaller cardinality than itself.

While an automorphism that is constant on A will always preserve types over A, it is generally not true that any two sequences and that satisfy the same type over A can be mapped to each other by such an automorphism. A structure in which this converse does hold for all A of smaller cardinality than is called (strongly) homogeneous.

The real number line is atomic in the language that contains only the order , since all n-types over the empty set realised by in are isolated by the order relations between the . It is not saturated, however, since it does not realise any 1-type over the countable set that implies x to be larger than any integer. The rational number line is saturated, in contrast, since is itself countable and therefore only has to realise types over finite subsets to be saturated.[19]

Stone spaces

[edit]

The set of definable subsets of over some parameters is a Boolean algebra. By Stone's representation theorem for Boolean algebras there is a natural dual topological space, which consists exactly of the complete -types over . The topology generated by sets of the form for single formulas . This is called the Stone space of n-types over A.[20] This topology explains some of the terminology used in model theory: The compactness theorem says that the Stone space is a compact topological space, and a type p is isolated if and only if p is an isolated point in the Stone topology.

While types in algebraically closed fields correspond to the spectrum of the polynomial ring, the topology on the type space is the constructible topology: a set of types is basic open iff it is of the form or of the form . This is finer than the Zariski topology.[21]

Constructing models

[edit]

Realising and omitting types

[edit]

Constructing models that realise certain types and do not realise others is an important task in model theory. Not realising a type is referred to as omitting it, and is generally possible by the (Countable) Omitting types theorem:

Let be a theory in a countable signature and let be a countable set of non-isolated types over the empty set.
Then there is a model of which omits every type in .[22]

This implies that if a theory in a countable signature has only countably many types over the empty set, then this theory has an atomic model.

On the other hand, there is always an elementary extension in which any set of types over a fixed parameter set is realised:

Let be a structure and let be a set of complete types over a given parameter set
Then there is an elementary extension of which realises every type in .[23]

However, since the parameter set is fixed and there is no mention here of the cardinality of , this does not imply that every theory has a saturated model. In fact, whether every theory has a saturated model is independent of the axioms of Zermelo–Fraenkel set theory, and is true if the generalised continuum hypothesis holds.[24]

Ultraproducts

[edit]

Ultraproducts are used as a general technique for constructing models that realise certain types. An ultraproduct is obtained from the direct product of a set of structures over an index set I by identifying those tuples that agree on almost all entries, where almost all is made precise by an ultrafilter U on I. An ultraproduct of copies of the same structure is known as an ultrapower. The key to using ultraproducts in model theory is Łoś's theorem:

Let be a set of σ-structures indexed by an index set I and U an ultrafilter on I. Then any σ-formula is true in the ultraproduct of the by if the set of all for which lies in U.[25]

In particular, any ultraproduct of models of a theory is itself a model of that theory, and thus if two models have isomorphic ultrapowers, they are elementarily equivalent. The Keisler-Shelah theorem provides a converse:

If M and N are elementarily equivalent, then there is a set I and an ultrafilter U on I such that the ultrapowers by U of M and :N are isomorphic.[26]

Therefore, ultraproducts provide a way to talk about elementary equivalence that avoids mentioning first-order theories at all. Basic theorems of model theory such as the compactness theorem have alternative proofs using ultraproducts,[27] and they can be used to construct saturated elementary extensions if they exist.[24]

Categoricity

[edit]

A theory was originally called categorical if it determines a structure up to isomorphism. It turns out that this definition is not useful, due to serious restrictions in the expressivity of first-order logic. The Löwenheim–Skolem theorem implies that if a theory T has an infinite model for some infinite cardinal number, then it has a model of size κ for any sufficiently large cardinal number κ. Since two models of different sizes cannot possibly be isomorphic, only finite structures can be described by a categorical theory.

However, the weaker notion of κ-categoricity for a cardinal κ has become a key concept in model theory. A theory T is called κ-categorical if any two models of T that are of cardinality κ are isomorphic. It turns out that the question of κ-categoricity depends critically on whether κ is bigger than the cardinality of the language (i.e. , where |σ| is the cardinality of the signature). For finite or countable signatures this means that there is a fundamental difference between ω-cardinality and κ-cardinality for uncountable κ.

ω-categoricity

[edit]

ω-categorical theories can be characterised by properties of their type space:

For a complete first-order theory T in a finite or countable signature the following conditions are equivalent:
  1. T is ω-categorical.
  2. Every type in Sn(T) is isolated.
  3. For every natural number n, Sn(T) is finite.
  4. For every natural number n, the number of formulas φ(x1, ..., xn) in n free variables, up to equivalence modulo T, is finite.

The theory of , which is also the theory of , is ω-categorical, as every n-type over the empty set is isolated by the pairwise order relation between the . This means that every countable dense linear order is order-isomorphic to the rational number line. On the other hand, the theories of ℚ, ℝ and ℂ as fields are not -categorical. This follows from the fact that in all those fields, any of the infinitely many natural numbers can be defined by a formula of the form .

-categorical theories and their countable models also have strong ties with oligomorphic groups:

A complete first-order theory T in a finite or countable signature is ω-categorical if and only if its automorphism group is oligomorphic.

The equivalent characterisations of this subsection, due independently to Engeler, Ryll-Nardzewski and Svenonius, are sometimes referred to as the Ryll-Nardzewski theorem.

In combinatorial signatures, a common source of ω-categorical theories are Fraïssé limits, which are obtained as the limit of amalgamating all possible configurations of a class of finite relational structures.

Uncountable categoricity

[edit]

Michael Morley showed in 1963 that there is only one notion of uncountable categoricity for theories in countable languages.[28]

Morley's categoricity theorem
If a first-order theory T in a finite or countable signature is κ-categorical for some uncountable cardinal κ, then T is κ-categorical for all uncountable cardinals κ.

Morley's proof revealed deep connections between uncountable categoricity and the internal structure of the models, which became the starting point of classification theory and stability theory. Uncountably categorical theories are from many points of view the most well-behaved theories. In particular, complete strongly minimal theories are uncountably categorical. This shows that the theory of algebraically closed fields of a given characteristic is uncountably categorical, with the transcendence degree of the field determining its isomorphism type.

A theory that is both ω-categorical and uncountably categorical is called totally categorical.

Stability theory

[edit]

A key factor in the structure of the class of models of a first-order theory is its place in the stability hierarchy.

A complete theory T is called -stable for a cardinal if for any model of T and any parameter set of cardinality not exceeding , there are at most complete T-types over A.

A theory is called stable if it is -stable for some infinite cardinal . Traditionally, theories that are -stable are called -stable.[29]

The stability hierarchy

[edit]

A fundamental result in stability theory is the stability spectrum theorem,[30] which implies that every complete theory T in a countable signature falls in one of the following classes:

  1. There are no cardinals such that T is -stable.
  2. T is -stable if and only if (see Cardinal exponentiation for an explanation of ).
  3. T is -stable for any (where is the cardinality of the continuum).

A theory of the first type is called unstable, a theory of the second type is called strictly stable and a theory of the third type is called superstable. Furthermore, if a theory is -stable, it is stable in every infinite cardinal,[31] so -stability is stronger than superstability.

Many construction in model theory are easier when restricted to stable theories; for instance, every model of a stable theory has a saturated elementary extension, regardless of whether the generalised continuum hypothesis is true.[32]

Shelah's original motivation for studying stable theories was to decide how many models a countable theory has of any uncountable cardinality.[33] If a theory is uncountably categorical, then it is -stable. More generally, the Main gap theorem implies that if there is an uncountable cardinal such that a theory T has less than models of cardinality , then T is superstable.

Geometric stability theory

[edit]

The stability hierarchy is also crucial for analysing the geometry of definable sets within a model of a theory. In -stable theories, Morley rank is an important dimension notion for definable sets S within a model. It is defined by transfinite induction:

  • The Morley rank is at least 0 if S is non-empty.
  • For α a successor ordinal, the Morley rank is at least α if in some elementary extension N of M, the set S has infinitely many disjoint definable subsets, each of rank at least α − 1.
  • For α a non-zero limit ordinal, the Morley rank is at least α if it is at least β for all β less than α.

A theory T in which every definable set has well-defined Morley rank is called totally transcendental; if T is countable, then T is totally transcendental if and only if T is -stable. Morley Rank can be extended to types by setting the Morley rank of a type to be the minimum of the Morley ranks of the formulas in the type. Thus, one can also speak of the Morley rank of an element a over a parameter set A, defined as the Morley rank of the type of a over A. There are also analogues of Morley rank which are well-defined if and only if a theory is superstable (U-rank) or merely stable (Shelah's -rank). Those dimension notions can be used to define notions of independence and of generic extensions.

More recently, stability has been decomposed into simplicity and "not the independence property" (NIP). Simple theories are those theories in which a well-behaved notion of independence can be defined, while NIP theories generalise o-minimal structures. They are related to stability since a theory is stable if and only if it is NIP and simple,[34] and various aspects of stability theory have been generalised to theories in one of these classes.

Non-elementary model theory

[edit]

Model-theoretic results have been generalised beyond elementary classes, that is, classes axiomatisable by a first-order theory.

Model theory in higher-order logics or infinitary logics is hampered by the fact that completeness and compactness do not in general hold for these logics. This is made concrete by Lindström's theorem, stating roughly that first-order logic is essentially the strongest logic in which both the Löwenheim-Skolem theorems and compactness hold. However, model theoretic techniques have been developed extensively for these logics too.[35] It turns out, however, that much of the model theory of more expressive logical languages is independent of Zermelo–Fraenkel set theory.[36]

More recently, alongside the shift in focus to complete stable and categorical theories, there has been work on classes of models defined semantically rather than axiomatised by a logical theory. One example is homogeneous model theory, which studies the class of substructures of arbitrarily large homogeneous models. Fundamental results of stability theory and geometric stability theory generalise to this setting.[37] As a generalisation of strongly minimal theories, quasiminimally excellent classes are those in which every definable set is either countable or co-countable. They are key to the model theory of the complex exponential function.[38] The most general semantic framework in which stability is studied are abstract elementary classes, which are defined by a strong substructure relation generalising that of an elementary substructure. Even though its definition is purely semantic, every abstract elementary class can be presented as the models of a first-order theory which omit certain types. Generalising stability-theoretic notions to abstract elementary classes is an ongoing research program.[39]

Selected applications

[edit]

Among the early successes of model theory are Tarski's proofs of quantifier elimination for various algebraically interesting classes, such as the real closed fields, Boolean algebras and algebraically closed fields of a given characteristic. Quantifier elimination allowed Tarski to show that the first-order theories of real-closed and algebraically closed fields as well as the first-order theory of Boolean algebras are decidable, classify the Boolean algebras up to elementary equivalence and show that the theories of real-closed fields and algebraically closed fields of a given characteristic are unique. Furthermore, quantifier elimination provided a precise description of definable relations on algebraically closed fields as algebraic varieties and of the definable relations on real-closed fields as semialgebraic sets[40][41]

In the 1960s, the introduction of the ultraproduct construction led to new applications in algebra. This includes Ax's work on pseudofinite fields, proving that the theory of finite fields is decidable,[42] and Ax and Kochen's proof of as special case of Artin's conjecture on diophantine equations, the Ax–Kochen theorem.[43] The ultraproduct construction also led to Abraham Robinson's development of nonstandard analysis, which aims to provide a rigorous calculus of infinitesimals.[44]

More recently, the connection between stability and the geometry of definable sets led to several applications from algebraic and diophantine geometry, including Ehud Hrushovski's 1996 proof of the geometric Mordell–Lang conjecture in all characteristics[45] In 2001, similar methods were used to prove a generalisation of the Manin-Mumford conjecture. In 2011, Jonathan Pila applied techniques around o-minimality to prove the André–Oort conjecture for products of Modular curves.[46]

In a separate strand of inquiries that also grew around stable theories, Laskowski showed in 1992 that NIP theories describe exactly those definable classes that are PAC-learnable in machine learning theory. This has led to several interactions between these separate areas. In 2018, the correspondence was extended as Hunter and Chase showed that stable theories correspond to online learnable classes.[47]

History

[edit]

Model theory as a subject has existed since approximately the middle of the 20th century, and the name was coined by Alfred Tarski, a member of the Lwów–Warsaw school, in 1954.[48] However some earlier research, especially in mathematical logic, is often regarded as being of a model-theoretical nature in retrospect.[49] The first significant result in what is now model theory was a special case of the downward Löwenheim–Skolem theorem, published by Leopold Löwenheim in 1915. The compactness theorem was implicit in work by Thoralf Skolem,[50] but it was first published in 1930, as a lemma in Kurt Gödel's proof of his completeness theorem. The Löwenheim–Skolem theorem and the compactness theorem received their respective general forms in 1936 and 1941 from Anatoly Maltsev. The development of model theory as an independent discipline was brought on by Alfred Tarski during the interbellum. Tarski's work included logical consequence, deductive systems, the algebra of logic, the theory of definability, and the semantic definition of truth, among other topics. His semantic methods culminated in the model theory he and a number of his Berkeley students developed in the 1950s and '60s.

In the further history of the discipline, different strands began to emerge, and the focus of the subject shifted. In the 1960s, techniques around ultraproducts became a popular tool in model theory.[51] At the same time, researchers such as James Ax were investigating the first-order model theory of various algebraic classes, and others such as H. Jerome Keisler were extending the concepts and results of first-order model theory to other logical systems. Then, inspired by Morley's problem, Shelah developed stability theory. His work around stability changed the complexion of model theory, giving rise to a whole new class of concepts. This is known as the paradigm shift.[52] Over the next decades, it became clear that the resulting stability hierarchy is closely connected to the geometry of sets that are definable in those models; this gave rise to the subdiscipline now known as geometric stability theory. An example of an influential proof from geometric model theory is Hrushovski's proof of the Mordell–Lang conjecture for function fields.[53]

[edit]

Finite model theory

[edit]

Finite model theory, which concentrates on finite structures, diverges significantly from the study of infinite structures in both the problems studied and the techniques used.[54] In particular, many central results of classical model theory fail when restricted to finite structures. This includes the compactness theorem, Gödel's completeness theorem, and the method of ultraproducts for first-order logic. At the interface of finite and infinite model theory are algorithmic or computable model theory and the study of 0-1 laws, where the infinite models of a generic theory of a class of structures provide information on the distribution of finite models.[55] Prominent application areas of FMT are descriptive complexity theory, database theory and formal language theory.[56]

Set theory

[edit]

Any set theory (which is expressed in a countable language), if it is consistent, has a countable model; this is known as Skolem's paradox, since there are sentences in set theory which postulate the existence of uncountable sets and yet these sentences are true in our countable model. Particularly the proof of the independence of the continuum hypothesis requires considering sets in models which appear to be uncountable when viewed from within the model, but are countable to someone outside the model.[57]

The model-theoretic viewpoint has been useful in set theory; for example in Kurt Gödel's work on the constructible universe, which, along with the method of forcing developed by Paul Cohen can be shown to prove the (again philosophically interesting) independence of the axiom of choice and the continuum hypothesis from the other axioms of set theory.[58]

In the other direction, model theory is itself formalised within Zermelo–Fraenkel set theory. For instance, the development of the fundamentals of model theory (such as the compactness theorem) rely on the axiom of choice, and is in fact equivalent over Zermelo–Fraenkel set theory without choice to the Boolean prime ideal theorem.[59] Other results in model theory depend on set-theoretic axioms beyond the standard ZFC framework. For example, if the Continuum Hypothesis holds then every countable model has an ultrapower which is saturated (in its own cardinality). Similarly, if the Generalized Continuum Hypothesis holds then every model has a saturated elementary extension. Neither of these results are provable in ZFC alone. Finally, some questions arising from model theory (such as compactness for infinitary logics) have been shown to be equivalent to large cardinal axioms.[60]

See also

[edit]

Notes

[edit]

References

[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
Model theory is a branch of that studies mathematical structures by means of logical languages, focusing on the interpretations (models) of formal theories and the properties definable within them. It examines how sentences in a are satisfied by structures, providing tools to classify theories up to and explore their definability and decidability. The origins of model theory trace back to the early , with foundational work by Leopold Löwenheim in 1915, who proved the first theorem linking the of sentences to the existence of a countable model. This was followed by key developments in the 1920s and 1930s, including Alfred Tarski's contributions to truth definitions and , as well as Kurt Gödel's in 1930, which established that every valid sentence has a . The field matured in the mid-20th century through Abraham Robinson's non-standard in the 1960s and Michael Morley's 1965 categoricity theorem, which initiated modern model theory by showing that certain theories have unique models up to in specific cardinalities. Central concepts in model theory include the , which states that a set of first-order sentences has a model if and only if every finite subset does, and the Löwenheim-Skolem Theorem, guaranteeing models of any infinite cardinality for consistent theories. Other key ideas encompass types, saturation, stability, and forking, which classify theories based on their combinatorial properties and enable applications in algebra, geometry, and . Notably, Saharon Shelah's in the 1970s provided a classification of theories based on their complexity, influencing much of modern model theory. For instance, model-theoretic techniques have contributed to undecidability results in various algebraic theories and advanced o-minimality in real .

Introduction

Overview

Model theory is a branch of that studies the relationship between , particularly , and their interpretations as mathematical structures known as models. It examines how logical theories—sets of sentences in a formal language—are satisfied by these models, focusing on semantic consequences and the isomorphism types of models that realize the same theory. In this framework, a model consists of a domain equipped with interpretations for the language's constants, functions, and relations, allowing one to evaluate the truth of sentences within that structure. First-order logic serves as the primary language in model theory, enabling the expression of properties using quantifiers over individual elements, logical connectives, and predicates. For instance, the theory of dense linear orders without endpoints, axiomatized by sentences ensuring totality, irreflexivity, transitivity, density, and lack of endpoints, has models like the rational numbers under the usual order. Similarly, the theory of algebraically closed fields of a fixed characteristic captures structures such as the complex numbers, where every non-constant has a . These examples illustrate how model theory connects abstract syntax to concrete mathematical objects. Key motivations in model theory include classifying structures up to elementary equivalence—where two models satisfy exactly the same sentences—and exploring definability, which concerns the sets and functions expressible by formulas within a model. These pursuits reveal deep interconnections across , from to , by providing tools to understand when structures behave similarly from a logical perspective. Such classifications often leverage results like the , which allows infinite models to be approximated by finite subtheories, and types, which describe consistent collections of properties for elements. Model theory originated in the early as a means to bridge the syntax of formal languages with their semantics, with foundational contributions from Leopold Löwenheim's 1915 theorem on the existence of models for satisfiable theories and Alfred Tarski's work in on truth definitions and undecidability. This development marked a shift toward systematically studying interpretations of logical systems, evolving into a vibrant field by the mid-century.

Importance and Scope

Model theory has played a pivotal role in advancing problems across various branches of by providing logical tools to analyze structures and their properties. In , it facilitates proofs of deep results such as the Ax-Grothendieck theorem, which asserts that every injective polynomial from Cn\mathbb{C}^n to itself is surjective; this is established using model-theoretic methods, such as ultraproducts of finite fields, to transfer the property that injectivity implies surjectivity from finite fields to algebraically closed fields of characteristic zero, leveraging in the theory of ACF_0. In , model-theoretic techniques have resolved conjectures like the function field analogue of the Mordell-Lang conjecture, using notions such as stable groups to bound solutions to Diophantine equations in positive characteristic. Similarly, in , model theory contributes to o-minimal structures, enabling the study of definable sets in real and yielding theorems on the topology of semi-algebraic sets. Despite these successes, , the primary framework of model theory, has inherent limitations in capturing all mathematical truths. For instance, Cantor's , which posits that there is no cardinal between that of the integers and the reals, is independent of the first-order Zermelo-Fraenkel with (ZFC); Gödel constructed a model where it holds (the constructible universe), while Cohen used forcing to build a model where it fails, demonstrating that no first-order axiomatization can decide it. This independence highlights how first-order theories may admit non-standard models that diverge from intuitive mathematical structures, underscoring the boundaries of expressiveness in formal languages. The scope of model theory centers on , where key results like and Löwenheim-Skolem theorems ensure the existence of models of varying cardinalities, providing a robust foundation for classifying theories. Extensions to higher-order logics or infinitary logics, such as L,ωL_{\infty,\omega}, expand this scope but often lead to non-elementary theories, where decidability fails and complexity grows beyond recursive bounds. Briefly, serves as a classification tool for theories based on the behavior of types, while ultraproducts construct models preserving elementary equivalence, aiding applications in these extended settings. Philosophically, model theory offers a model-theoretic approach to semantics, exemplified by Tarski's definition of truth, which formalizes truth for sentences in a language relative to a model as satisfaction of the sentence by all assignments in that structure, resolving paradoxes like the through hierarchical languages. This framework bridges logic and by grounding semantic notions in set-theoretic models, influencing understandings of meaning and interpretation in .

Fundamentals of First-Order Model Theory

Languages and Structures

A first-order language, or simply language, in model theory is defined by a signature σ\sigma, which consists of a set of constant symbols CC, a set of function symbols FF each equipped with a positive integer arity, and a set of relation symbols RR each with a non-negative integer arity (where arity zero for relations corresponds to propositions). Terms in the language are built inductively from variables, constants, and function applications, such as f(t1,,tn)f(t_1, \dots, t_n) where fFf \in F has arity nn and tit_i are terms. Atomic formulas are equations t1=t2t_1 = t_2 between terms or atomic relations r(t1,,tn)r(t_1, \dots, t_n) for rRr \in R, and full formulas are constructed from these using logical connectives (¬,,,,\neg, \land, \lor, \to, \leftrightarrow) and quantifiers (,\forall, \exists) applied to variables. A structure M\mathcal{M} for a language with signature σ\sigma, often denoted M=(M,M)\mathcal{M} = (M, \cdot^\mathcal{M}), interprets the language semantically: it comprises a non-empty universe set MM (the domain), an interpretation cMMc^\mathcal{M} \in M for each constant cCc \in C, a function fM:MnMf^\mathcal{M}: M^n \to M for each nn-ary fFf \in F, and a relation RMMnR^\mathcal{M} \subseteq M^n for each nn-ary rRr \in R. Structures provide the concrete realizations where formulas can be evaluated for truth: for example, the integers Z\mathbb{Z} with binary operations +Z+^\mathbb{Z} and Z\cdot^\mathbb{Z} form a structure for the signature of rings σ={0,1,+,}\sigma = \{0, 1, +, \cdot\}. A canonical example is the class of groups, where the signature σG={,1,e}\sigma_G = \{\cdot, ^{-1}, e\} (binary multiplication, unary inverse, constant identity) is interpreted in a set GG as a binary operation G:G×GG\cdot^G: G \times G \to G, unary (1)G:GG( \cdot^{-1} )^G: G \to G, and eGGe^G \in G, satisfying the group axioms when formulas are true in the structure. Homomorphisms between structures A\mathcal{A} and B\mathcal{B} for the same preserve the interpretations: a function h:ABh: A \to B such that h(fA(a1,,an))=fB(h(a1),,h(an))h(f^\mathcal{A}(a_1, \dots, a_n)) = f^\mathcal{B}(h(a_1), \dots, h(a_n)) for all fFf \in F, h(cA)=cBh(c^\mathcal{A}) = c^\mathcal{B} for cCc \in C, and (a1,,an)RA(a_1, \dots, a_n) \in R^\mathcal{A} implies (h(a1),,h(an))RB(h(a_1), \dots, h(a_n)) \in R^\mathcal{B} for rRr \in R. An embedding is an injective homomorphism that additionally preserves the relations in both directions (i.e., the preimage under hh of any relation tuple in B\mathcal{B} is in the relation in A\mathcal{A}), effectively viewing A\mathcal{A} as a substructure of B\mathcal{B}. Isomorphisms are bijective homomorphisms with bijective inverses that are also homomorphisms, meaning A\mathcal{A} and B\mathcal{B} are essentially the same up to relabeling of elements. The natural numbers N={0,1,2,}\mathbb{N} = \{0, 1, 2, \dots \} with interpretations 0N=00^\mathbb{N} = 0, successor SN(n)=n+1S^\mathbb{N}(n) = n+1, addition +N+^\mathbb{N}, and multiplication N\cdot^\mathbb{N} form the standard model of Peano arithmetic in the signature σPA={0,S,+,}\sigma_{PA} = \{0, S, +, \cdot\}. However, by the Löwenheim-Skolem theorem, there exist non-standard models of Peano arithmetic, which include N\mathbb{N} as an initial segment but extend to "infinite" elements larger than any standard natural number, yet still satisfying the same first-order axioms. These non-isomorphic models illustrate how structures can realize the same language interpretations differently while preserving formula truth.

Theories, Models, and Satisfiability

In first-order model theory, a theory TT over a language L\mathcal{L} is a set of L\mathcal{L}-sentences that is closed under logical deduction, meaning that if ϕ\phi is an L\mathcal{L}-sentence and TϕT \vdash \phi, then ϕT\phi \in T. Theories are often specified by a (possibly infinite) set of axioms Γ\Gamma, with T=Th(Γ)T = \mathrm{Th}(\Gamma) denoting the deductive closure of Γ\Gamma under the rules of first-order logic. A theory TT is consistent if it does not prove both a sentence ϕ\phi and its negation ¬ϕ\neg \phi for any L\mathcal{L}-sentence ϕ\phi. Consistent theories can be incomplete, meaning there exist L\mathcal{L}-sentences ϕ\phi such that neither ϕ\phi nor ¬ϕ\neg \phi is in TT; examples include the theory of linear orders (which does not decide the existence of a least element) and Peano arithmetic (which, by Gödel's second incompleteness theorem, does not prove its own consistency). In contrast, a complete theory TT decides every L\mathcal{L}-sentence, so for every ϕ\phi, exactly one of ϕ\phi or ¬ϕ\neg \phi belongs to TT. A model of a TT is an L\mathcal{L}- M\mathcal{M} such that Mϕ\mathcal{M} \models \phi for every sentence ϕT\phi \in T. The set of all models of TT, denoted Mod(T)\mathrm{Mod}(T), consists of those that satisfy every and of TT. Two L\mathcal{L}- M\mathcal{M} and N\mathcal{N} are elementarily equivalent, written MN\mathcal{M} \equiv \mathcal{N}, if they satisfy exactly the same L\mathcal{L}-sentences, or equivalently, if Th(M)=Th(N)\mathrm{Th}(\mathcal{M}) = \mathrm{Th}(\mathcal{N}), where Th(A)\mathrm{Th}(\mathcal{A}) is the of A\mathcal{A} (the set of all L\mathcal{L}-sentences true in A\mathcal{A}). Thus, all models of a TT are elementarily equivalent to one another, though they may differ in or other non-first-order properties; for instance, the of algebraically closed fields of characteristic zero has models of every infinite , all elementarily equivalent but non-isomorphic. Satisfiability concerns the existence of models for formulas or sets of sentences. A formula ψ\psi (with free variables) is satisfiable if there exists an L\mathcal{L}-structure M\mathcal{M} and an assignment ss to the free variables such that M,sψ\mathcal{M}, s \models \psi; for sentences (closed formulas), satisfiability simplifies to the existence of a model M\mathcal{M} with Mψ\mathcal{M} \models \psi. A set of sentences Γ\Gamma (or the theory it generates) is satisfiable if it has a model. In first-order logic, consistency and satisfiability coincide: a theory TT is consistent if and only if it is satisfiable. This equivalence follows from Gödel's completeness theorem, which states that every consistent first-order theory TT has a model (in fact, models of every infinite cardinality, by the Löwenheim-Skolem theorems). The theorem establishes deductive completeness for first-order logic, meaning that semantic entailment Γϕ\Gamma \models \phi (every model of Γ\Gamma satisfies ϕ\phi) holds if and only if syntactic entailment Γϕ\Gamma \vdash \phi holds.

Compactness Theorem

The compactness theorem is a cornerstone of first-order model theory, asserting that a theory TT in a first-order language has a model if and only if every finite subset of TT has a model. Formally, for any set TT of LL-sentences, TT is satisfiable precisely when, for every finite T0TT_0 \subseteq T, there exists a structure M\mathcal{M} such that MT0\mathcal{M} \models T_0. This equivalence underscores the local, finite nature of first-order satisfiability, where global consistency reduces to checking finitely many axioms at a time. The theorem, first derived as a consequence of Kurt Gödel's 1930 completeness theorem for first-order logic, plays a pivotal role in constructing models and analyzing the limitations of axiomatic systems. A standard proof of the compactness theorem relies on , which states that every consistent has a model. To see this, suppose every finite subset of TT is satisfiable; by the soundness theorem, each such subset is consistent. If TT were inconsistent, then there would exist a finite proof of a contradiction \bot from TT, implying some finite T0TT_0 \subseteq T proves \bot, contradicting the consistency of T0T_0. Thus, TT is consistent and, by completeness, satisfiable. The converse direction is immediate: if TT has a model M\mathcal{M}, then any finite T0TT_0 \subseteq T is also satisfied in M\mathcal{M}. An alternative proof sketch uses a Henkin-style , extending partial models by adding witnesses for existentials in a countable expansion of the language, ensuring consistency at each finite stage and yielding a model for the whole via the . Key consequences of the include the existence of non-standard models for theories like Peano arithmetic (PA). Consider the theory T=PA{cn>nnN}T = \mathrm{PA} \cup \{c_n > \overline{n} \mid n \in \mathbb{N}\}, where cnc_n are new constants and n\overline{n} denotes the standard numeral for nn. Every finite subset of TT is satisfiable in the standard model N\mathbb{N}, as one can choose sufficiently large values for the finitely many cnc_n involved. By , TT has a model M\mathcal{M}, which extends N\mathbb{N} but includes "infinite" elements satisfying c1<c2<c_1 < c_2 < \cdots, demonstrating that PA admits non-standard models beyond the natural numbers. This illustrates how enables infinite approximations from finite data, revealing the incompleteness of first-order theories in capturing unique structures. The theorem also implies the finitary character of first-order logic: unlike infinitary logics, first-order satisfiability depends only on finite fragments, facilitating model constructions from local properties.

Löwenheim-Skolem Theorems

The Löwenheim–Skolem theorems form a cornerstone of model theory, establishing control over the possible cardinalities of models for first-order theories and structures. Named after Leopold Löwenheim, who proved an initial version in 1915, and , who provided key generalizations in 1920 and 1922, these results demonstrate that the existence of a model in one infinite cardinality implies the existence of models in many others, often through elementary embeddings or extensions. The downward Löwenheim–Skolem theorem states that every infinite first-order structure has a countable elementary substructure. In a countable language, if A\mathfrak{A} is an infinite L\mathcal{L}-structure, then there exists a countable subset AAA \subseteq |\mathfrak{A}| such that the substructure AA\mathfrak{A}|_A is elementary in A\mathfrak{A}, meaning that for every first-order formula ϕ(xˉ)\phi(\bar{x}) in L\mathcal{L} and every aˉAxˉ\bar{a} \in A^{|\bar{x}|}, Aϕ(aˉ)\mathfrak{A} \models \phi(\bar{a}) if and only if AAϕ(aˉ)\mathfrak{A}|_A \models \phi(\bar{a}). This holds more generally for languages of arbitrary cardinality λ\lambda: every structure of cardinality greater than λ\lambda has an elementary substructure of cardinality at most λ+0\lambda + \aleph_0. The proof proceeds by iteratively constructing a countable (or small) set closed under witnesses for existential formulas, often via an expansion of the language with Skolem functions that define choice functions for existentially quantified variables in the theory of the structure. The upward Löwenheim–Skolem theorem complements this by showing that first-order theories with infinite models admit models of arbitrarily large cardinality. Specifically, if TT is a first-order theory in a language L\mathcal{L} with L=λ|\mathcal{L}| = \lambda and TT has an infinite model, then for every cardinal κλ\kappa \geq \lambda, TT has a model of cardinality κ\kappa. The proof outline involves expanding the language with κ\kappa many new constant symbols and using Skolem functions in an elementary extension of an existing model to ensure the new elements satisfy the theory while maintaining elementarity; compactness of first-order logic serves as a key lemma to guarantee consistency of the expanded theory. This construction yields an elementary extension of the original model with the desired cardinality. These theorems have profound implications for the expressive power of first-order logic, revealing that no first-order theory can uniquely characterize uncountable structures up to isomorphism. For instance, the theory of the real numbers as an ordered field has uncountable models like R\mathbb{R}, but by the downward theorem, it also admits countable elementary submodels that are elementarily equivalent yet not isomorphic, underscoring the limitations of first-order axiomatizations in capturing specific infinite cardinalities.

Definability

Definable Sets and Functions

In model theory, a definable set in an LL-structure M\mathcal{M} is a subset XMnX \subseteq M^n for which there exists a first-order formula ϕ(x,b)\phi(\mathbf{x}, \mathbf{b}) in the language LL, with parameters bMm\mathbf{b} \in M^m, such that X={aMnMϕ(a,b)}X = \{\mathbf{a} \in M^n \mid \mathcal{M} \models \phi(\mathbf{a}, \mathbf{b})\}. More generally, XX is AA-definable for a subset AMA \subseteq M if the parameters b\mathbf{b} can be chosen from AA. A definable function in M\mathcal{M} from MkM^k to MnM^n is a function ff whose graph {(a,f(a))aMk}\{( \mathbf{a}, f(\mathbf{a}) ) \mid \mathbf{a} \in M^k\} is a definable set in M\mathcal{M}. Equivalently, there exists a formula ψ(x,y,b)\psi(\mathbf{x}, \mathbf{y}, \mathbf{b}) such that for each aMk\mathbf{a} \in M^k, there is a unique yMn\mathbf{y} \in M^n satisfying Mψ(a,y,b)\mathcal{M} \models \psi(\mathbf{a}, \mathbf{y}, \mathbf{b}), and this y=f(a)\mathbf{y} = f(\mathbf{a}). In the structure (R,+,×)(\mathbb{R}, +, \times), the definable sets are precisely the semialgebraic sets, which in one dimension consist of finite unions of points and intervals. For example, the set {xRx>2}\{x \in \mathbb{R} \mid x > \sqrt{2}\}
Add your contribution
Related Hubs
User Avatar
No comments yet.