Monthly Archives: April 2012

Pila-Wilkie Theorem and Its Consequences

Recent years have witnessed some very exciting applications of model theory in various areas of mathematics; among which is the (nth) proof of Manin-Mumford Conjecture (I will refer to it as MMC) by Pila and Zannier ([7]). I will not get into why MMC is important as it is the subject of an article in a number theory blog. I will, however, get into details of how o-minimality helps proving MMC. In vague terms, MMC states that there aren’t too many torsion points on a subvariety of an abelian variety, unless that subvariety itself is close to being an abelian (sub)variety.

Recall that an abelian variety A is a complex torus \mathbb C^n/\Lambda that can be embedded into a projective space \mathbb P^N(\mathbb C) (hence it has the structure of a projective complex variety). With this definition, an abelian variety is obviously an algebraic group which, in turn, is abelian.

There are various ways to state MMC with various strengths. Here is the version I’d like to discuss as it appears in [7].

Theorem 1. Let A be an abelian variety and X a subvariety, both defined over a number field. Suppose that X does not contain a translate of an infinite abelian subvariety of A.  ThenX contains only finitely many torsions of A.

This has been proven first by Reynaud, and later came the proofs many others like Hindry, Buium and Hrushovski (this last one is a model theoretic proof as well). The proof I’ll discuss is due to Pila and Zannier and is quite different in vein, as uses o-minimality.

Without further ado, I’ll start explaining the main tool in that proof.

Pila-Wilkie Theorem

In a nutshell, Pila-Wilkie Theorem ([6]) gives an upper bound on the number of rational points of a set definable in an o-minimal expansion \mathcal R of the real field, in terms of their heights. Well, this is very vague; anything this vague has to be correct. So where is the hardness? Let’s make things more precise.

First of all, the height of a rational number \frac{a}{b} is the maximum of |a| or |b|, provided that \gcd(a,b)=1 and the height of a tuple of rational numbers is the maximum of the heights of its components. This is the most naive notion of height, but it works fine in the case in hand.

We wish to state that given a definable set Y\subseteq\mathbb R^m, the number of rational points in Y of height less than T is bounded by a linear function of  T. But of course this cannot be the case, taking Y to be the whole \mathbb R^m; besides such a result would finish a huge part of number theory.  For this reason we need to take the (semi)algebraic part   Y^{alg} of Y out; which is defined to be the union of all connected infinite semialgebraic sets contained in Y.

With all this notation in hand, here is the precise statement of Pila-Wilkie Theorem (I’ll refer to it as PW).

Theorem 2.  Let Y\subseteq\mathbb R^n be definable and \epsilon>0. There exists a constant c=c(Y,\epsilon)>0 such that for any T>0 we have 

|\{\alpha\in (Y\setminus Y^{alg})\cap\mathbb Q^n: H(\alpha)<T\}|\leq c T^{\epsilon}.

The proof of this result is quite involved, but is a nice combination of the earlier individual works of the authors.  Some years ago, Pila has proven a similar results in dimensions one and two ([1] (with Bombieri)  and [5]) and Wilkie has a result on the integer points of one dimensional sets definable in o-minimal expansions of the real field ([8]).

Proof of MMC

The main difficulty in getting MMC from PW is to determine the algebraic part of a definable set. In general, we don’t really have any way to do that, but in our situation we deal with special kind of definable sets, definable in certain o-minimal structures.

Remember that we want to understand the torsions in a subvariety X of an abelian variety A=\mathbb C^n/\Lambda.  In this case we have the universal covering map:

\pi: \mathbb C^n\to A.

Let \lambda_1,\dots,\lambda_{2n}\in\mathbb C^n be a basis of the lattice \Lambda. If we represent \mathbb C^n with respect this basis, we see that the torsion points of A correspond to the rational points of \mathbb C^n.

In particular, the torsion points of X correspond to the rational points of the set Y'=\pi^{-1}(X). The main virtue of this set Y' is that it is periodic with respect to \Lambda. Therefore we could restrict our attention to its rational points in [0,1]^n.  Now this new set Y:=Y'\cap [0,1]^n is definable in the expansion \mathbb R_{an} of the real field by restricted analytic functions. From now on we play with this set. Before forgetting X, note that the assumption that it doesn’t contain any translate of any infinite abelian subvariety of A translates to Y as it doesn’t contain any translate of any \mathbb C-linear subspace V of \mathbb C^n such that dim V= rank (V\cap \Lambda). Such translates are called torus cosets.

A large part of the paper ([7]) is devoted to the following.

Theorem 3.  Let Z\subseteq\mathbb C^n be an analytic set which is periodic with respect to a full-lattice. Suppose that Z does not contain any torus coset. Then Z does not contain any infinite connected semialgebraic set. This is to say that Z^{alg}=\emptyset.

Note that this result holds for any complex torus, rather than just the abelian varieties. In short, the proof of this is just local (and tedious) analysis.

Now using this we get that (Y')^{alg}=\emptyset and hence Y^{alg}=\emptyset. So applying Theorem 2, we get that for a given \epsilon>0, there is a constant c=c(Y,\epsilon)>0 such that there are at most cT^{\epsilon} many rational points in Y with common denominator T, hence at most that many torsions in X of exponent at most T. This only gives an upper bound and the following result by Masser ([3]) gives the lower bound, finishing the proof of MMC.

Theorem 4. Let A be an abelian variety defined over a number field K. Then there is d>0 and \rho depending only on the dimension of A such that for every torsion point P\in A of order T we have

[K(P):\mathbb Q]\geq d T^{\rho}.

Final remarks

Recently this strategy played the role of a template for the proofs of even more complicated results such as the proof of certain cases of Andre-Oort conjecture by  Habbegger and Pila ([2]). Other than finding the right setting, the main obstacles in such proofs are determining the algebraic part of the appropriate set and finding the substitute for Theorem 4.  Of course, finding the right kind of o-minimal structure is another important step; as done by Peterzil and Starchenko in ([4]).

References

  1. E. Bombieri, J. Pila. The number of integral points on arcs and ovals. Duke Math. J. 59 (1989), no. 2, 337–357.
  2. P. Habegger, J. Pila. Some Unlikely Intersections Beyond André-Oort. Compositio Math. 148 (2012).
  3. D. W. Masser. Small values of the quadratic part of the Néron-Tate height on an abelian variety. Compositio Math., 53(2):153–170, 1984.
  4. Y. Peterzil and S. Starchenko. Definablity of restricted theta functions and families of abelian varieties. arXiv:1103.3110.
  5. J. Pila. Rational points on a subanalytic surface.  Ann. Inst. Fourier (Grenoble) 55 (2005), no. 5, 1501–1516.
  6. J. Pila and A. Wilkie. The rational points of a definable set. Duke Math. J. 133 (2006), no. 3, 591–616.
  7. Jonathan Pila and Umberto Zannier. Rational points in periodic analytic sets and the Manin-Mumford conjecture. Atti Accad. Naz. Lincei Cl. Sci. Fis. Mat. Natur. Rend. Lincei (9) Mat. Appl., 19(2):149–162, 2008.
  8. A. J. Wilkie. Diophantine properties of sets definable in o-minimal structures. J. Symbolic Logic 69 (2004), no. 3, 851–861.

When are bi-embeddable structures isomorphic?

Let (\textbf{K}, \textbf{Mor}) be a class of algebraic structures \textbf{K} with a distinguished class of morphisms \textbf{Mor} between them. We say that the class has the Schröder-Bernstein (SB) property if any two elements of \mathbf{K} which are bi-embeddable with respect to \mathbf{Mor} are isomorphic.

Question 1: How can we tell when (\textbf{K}, \textbf{Mor}) has the SB property?

So, for instance, when \textbf{K} = \textbf{Sets} and \textbf{Mor} is all injective functions, then this class will have the SB property by the usual Cantor-Schröder-Bernstein theorem. The reader should try to think of examples which do not have the SB property, and check whether or not his or her favorite categories have SB.

The model-theoretic angle here is that we will concentrate on the case where \textbf{K} is all of the models of some first-order theory T and \textbf{Mor} is the class of all elementary embeddings between them (that is, the maps which preserve the truth of all first-order formulas). If this is true, then we will say that the theory T has the SB property.

But why is the Schröder-Bernstein property interesting? One could easily come up with a list of other strange, never-before-studied properties that some theories have and others do not, so what makes the Schröder-Bernstein property worth studying?

In this post I’ll try to explain a bit about why I am interested in this property as well as give some partial answers to Question 1 in model theory.

The naturality of considering the Schröder-Bernstein property

When many people first hear about it, the interest in studying the Schröder-Bernstein property seems evident. Others are skeptical. Partly for the skeptics, I’ll explain what made me first start thinking about it (within model theory).

A prime model of a theory T is a model M of T which is elementarily embeddable into every other model of T. Some theories have prime models, others do not. Naturally if one has a prime model of a theory T, one would like to know whether it is unique (up to isomorphism).

“Theorem 2” If T has a prime model, then it this prime model is unique.

Here is the “proof” of this statement that occurred to me as a grad student, and must have occurred to hundreds of other students before me: let M and N be two prime models of T. Then, by definition, M is elementarily embeddable into N, and also N is elementarily embeddable into M. So they must be isomorphic, right?

But this argument only works if we somehow know that T has the SB property, and (to me, as I write this) it seems no easier to prove that a given theory has the SB property than to show that it has a unique prime model. I now know about the famous, famously difficult result of Shelah that totally transcendental theories have unique prime models, as well as his examples showing that there exist theories with nonisomorphic prime models (see Shelah, “On uniqueness of prime models,” Journal of Symbolic Logic 44 (1979) no. 2, 215-220). For me, trying to repair this too-hasty argument for “Theorem 2” is what got me thinking about SB for theories.

Schröder-Bernstein and classifiability

Probably the very first time I thought about the Schröder-Bernstein property (in a totally different context than logic) was while reading Irving Kaplansky’s book Infinite Abelian Groups many years ago. In this book, Kaplansky at one point asks the following question (to paraphrase): say we have a system for classifying all objects of a certain type — say, abelian groups — up to isomorphism, using some sorts of “invariants.” How do we know if this system is really useful or enlightening? After all, we always have the “trivial classification system” of associating each object to its isomorphism class! This is silly, but the interesting question is: how do we know that our classification system is really better than the trivial classification system?

In a neat bit of parallel evolution (I doubt Kaplansky was influenced much by Shelah, or vice versa), Kaplansky basically answers this by saying that we should come up with test questions to which any good classification scheme should give a definite “Yes” or “No” answer. Crucially, these test questions should be phrased in terminology which is independent of our supposed classification system, so that we know that the system can answer “real questions” about the classified objects themselves as opposed to merely solving problems that arise from the internal complexity of our classifying scheme.

One of Kaplansky’s test questions was:

Question 3: If G and H are two abelian groups such that each one is isomorphic to a direct summand of the other, then are G and H isomorphic?

(Clearly this is a case of Schröder-Bernstein with \textbf{Mor} being all injections whose images are direct summands. If we instead considered all injective group maps, then Question 3 would have many simple counterexamples, as the reader should verify.)

Somewhat surprisingly, the answer to Question 3 is “No,” with a counterexample so complex that Kaplansky himself could not find it. But the details of this do not matter to us here. What’s important to note is that we can know that Question 3 is true for certain classes of abelian groups using classification schemes.

For instance, as Kaplansky explains in his book, the answer to Question 3 is “Yes” if G and H are both countable abelian torsion groups (that is, for each element g \in G, there is some n \in \mathbb{N} such that n g = 0, but there is not necessarily a uniform bound on n for all g in G). This is immediate using Ulm invariants. Another example, much simpler to explain in a blog post, is if G and H are finitely-generated abelian groups: then each one is a direct sum of indecomposable summands that are isomorphic to either \mathbb{Z} or \mathbb{Z} / p^k \mathbb{Z} for some prime power p^k, and we just have to count the number of summands of each type to prove the result.

Thinking about examples like the ones above leads to the following intuition, partly justifying the study of the SB property (and I won’t try to make this too precise here):

Idea 4: If the objects in \mathbf{K} can be classified by a bounded set of cardinal-number invariants which are preserved by embeddings in \mathbf{Mor}, then we should expect (\mathbf{K}, \mathbf{Mor}) to have the SB property.

These cardinal-number invariants could be things like the number of direct summands belonging to a certain isomorphism class, or they could be dimensions (as for a vector space).

Famously, Shelah’s primary test question (which he used to help justify the development of his “classification theory,” now usually called stability theory) was to compute all possible spectrum functions I(T, \kappa) for a complete theory T, where I(T, \kappa) is the number of isomorphism types of models of T with cardinality \kappa. But we could ask whether Shelahian stability theory is also useful for answering other test questions.

I’ll end this section with a couple of very naïve test questions of my own which I think that the classification theory for models should eventually be able to give definite answers to. But even though I have been thinking off and on about them for years now, I still cannot answer them except in certain special cases.

Question 5: Suppose T is a first-order theory without the Schröder-Bernstein property. Is there an infinite collection of models of T which are pairwise elementarily bi-embeddable but pairwise non-isomorphic?

Question 6: Suppose T is a complete first-order theory with the Schröder-Bernstein property. Is it true that any expansion of T by new constant symbols also has the Schröder-Bernstein property?

Schröder-Bernstein and stability theory

Now I’ll explain a few cases where I know that the Schröder-Bernstein property does not hold. Before giving precise statements, here is another vague generality to motivate the discussion:

Idea 7: If a class of structures is not easily classifiable, then often it is because it contains two objects A and B which “look very similar” and yet are not isomorphic. If A and B “look sufficiently similar,” then often they will be bi-embeddable, thus giving a counterexample to the Schröder-Bernstein property.

So we might expect that those theories which Shelah calls unclassifiable to not have the SB property. And it turns out that this is true! One result from my thesis was:

Theorem 8: Suppose that T is a countable first-order theory. If T is not classifiable (either unstable, or not superstable, or DOP, or else OTOP), then T does not have the SB property. If T is unstable, DOP, or OTOP, then T has an infinite collection of pairwise-bi-embeddable, pairwise nonisomorphic models.

Maybe I will try to explain what this “classifiability” is all about in a future post; it is a fascinating and complicated subject. But a simple example of an unclassifiable (unstable) theory is the theory of all dense linear orderings without endpoints, and it is a fun exercise to construct counterexamples by hand (HINT: by quantifier elimination, any injective order-preserving map will be elementary).

A simple example of a theory which is classifiable yet still does not have the SB property is the theory of all equivalence relations with infinitely many classes, each of which is infinite. For this example, the word “classifiable” naïvely makes sense because any model is “classified” simply by a list of cardinal numbers (that is, the cardinalities of each of the equivalence classes). Again, it is interesting to construct one’s own counterexamples to the SB property here, and to compare this with Idea 4 above (note I was very careful to add the word “bounded,” and in this example there is no fixed prior bound on the length of the list of cardinals classifying a model).

Further reading

I’ll wrap up this post now since it’s already longer than I planned, and I could try to develop various of the ideas here in more detail in future posts. But I should at least give some references to precise theorems and proofs for those who are interested.

Before my thesis (finished in 2007), I am not aware of anyone else attempting a systematic study of the SB property within model theory other than T. A. Nurmagambetov in a pair of articles from the 1980’s (“The mutual embeddability of models” (in Russian), from The Theory of
Algebraic Structures, Karagand. Gos. Univ., 1985, pp. 109–
115, and “Characterization of \omega-stable theories with a bounded number of dimensions,” Algebra i Logika 28 (1989), no. 5, 388–396.) In these articles, Nurmagambetov only considered totally transcendental theories, and he showed that in this case the SB property is equivalent to nonmultidimensionality.
An interesting variation on Idea 7 comes from Shelah’s paper “Existence of many L_{\infty,\lambda}-equivalent, non- isomorphic models of T of power \lambda” (Annals of Pure and Applied Logic 34 (1987), no. 3, 291-310). Note that L_{\infty,\lambda} is a very powerful non-first-order logic (allowing infinitely long conjuctions and disjunctions, among other things), so two models that are L_{\infty,\lambda}-equivalent really do look quite similar. Shelah establishes the property in the title for any theory which is not superstable or which has DOP or OTOP. However, this does not, as far as I know, have obvious consequences for bi-embeddability.

In joint work with Chris Laskowski, we treated special cases of the SB property for theories in the papers “The Schröder-Bernstein property for weakly minimal theories” and “The Schröder-Bernstein property for a-saturated models.” The first paper only considers weakly minimal theories, but we give a complete characterization of the SB property in this context in terms of weak orthogonality of types, and we show that given a failure of SB for one of these theories, one can construct and infinite family of pairwise bi-embeddable, pairwise non-isomorphic structures. The second paper linked above was written with a broader audience in mind (especially the first section) and it gives a complete answer to when a theory has an expansion by constants with the SB property as well as a partial answer to when the sufficiently saturated models have the SB property.

For me, the way in which one can manage to prove such results is much more interesting than merely listing the statements of the theorems themselves… but maybe more about this later!

A new blog in Model Theory and Set Theory?

Inspired by other such initiatives (in different parts of Mathematics), a group of us Model Theorists and Set Theorists have decided to start a fresh blog on that hard to define area that is perhaps 80% Model Theory and 20% Set Theory