We all know about Pythagoras and his obsession with triangles. Usually when we first learn about the rational and real numbers, we are also told about the Pythagoreans, a 6th century BCE cult that started with the followers of Pythagoras and which dealt with mysticism and Mathematics (and if it still exists and you are a member in it, please send me a message – I would love to join). The legend says that they loved ratios between natural numbers, i.e. rational numbers, and didn’t believe that there exist any other type of numbers. Unfortunately, Hippasus which was one of their own “betrayed” them and proved that is not rational. When he died by drowning, the Pythagoreans said that this was a punishment by the gods themselves for showing that the world of Mathematics is not perfectly rational. Well, it was either this, or because he built a dodecahedron inside a sphere. Anyway, these days we know that there are irrational numbers, of course, but it is nice to know that if you insult the gods of Mathematics you will be punished.
The moral from this story is that rational numbers are divine, but there are real numbers which are not rational. While there are irrational numbers, there are so many rational numbers so that we can get as close as we want to any real number. In this post we will try to investigate when should we say that an approximation is good, how good of an approximation can we hope to find for a given real number, and finish with a bit about the connection to lattices (see the post about lattices for the definitions and some examples).
What is a rational approximation
To start off, given a real number , a rational approximation is simply a rational number where is “small”. Whether it is a “good” rational approximation will be determined by how “small” this is.
As it is right now, this starting point can be very confusing to anyone who knows a little bit about rational numbers. Indeed, we can find such rational numbers which make this absolute value as small as we want, or more formally, for any there is a rational number which solves .
To get a better intuition of the “goodness” of approximation, think about the problem of storing in your computer’s memory. Usually, we store numbers in memory using their binary presentation, however in general might have an infinite such presentation. One solution is instead to store an approximation of instead which has a finite presentation, and keep in mind that we have a computation error which depends on the approximation. The smaller the distance , the smaller our computation error.
For example, if is rational, then we only need to store and which of course have finite presentation. The problem is that if we want a very small computation error, we might end up using very big and , so they would require a lot of memory to store. As we are quite greedy mathematicians – can we find rationals which have very small error, but also the numerator and denominator are small? This is a much more interesting question, and the smaller the sizes of the numerator and denominator are with respect to the error, the better we think of the approximation.
What can we say about the comparison between these sizes and the error? As a starting point, we can always find such integers such that , in which case
Moreover, if we approximate with the rational or closer to it, then the error is at most . For example, if we want a error, we can always do it with a denominator at most .
As for bounding the numerator, for simplicity, lets assume that , so that This way bounding the denominator also bounds the numerator . In particular, in the example above to get the error , we look for which are at most .
What is a GOOD rational approximation (according to Dirichlet)
At this point Dirichlet steps in, and shows us how to do real approximation with one of the most simplest and elegant proofs there is in Mathematics that uses nothing more than the pigeon hole principle and just with it shows that there are infinitely many solutions to . As this is clearly true for rational, we shall prove it under the assumption that is irrational.
Before giving the proof, let’s reformulate it in a way which will be much easier to work with. The main idea of this reformulation is that we prefer to work with integers, rather than rational numbers (or more specifically, we want to work with which is the real parameter of our problem and not ).
As a first step, we simply move from trying to solve , to trying to solve which doesn’t have any rationals inside the absolute value. We can multiply again by , but then we will move from the simple linear function to the more complicated function . To avoid this, we use an interesting trick where we break our inequality to two inequalities as follows:
where is an integer. Note that for any fixed , the inequalities are linear in both and . Also, if we find a solution to both of these inequalities, then we have a solution to our original inequality, since
Finally, our new setting is much more natural when looking for better and better approximations. It basically says that for an approximation which is at most we allow ourselves to have the denominator at most . This formulation also easily implies that if we have such an approximation for every , then we will get infinity many solutions to (prove it!).
This new formulation has a very nice visualization attached to it. First, note that in a sense is the only parameter of the inequalities. Once we choose it, will always be either or in order to minimize . In particular, we can assume that this absolute value is always in . Equivalently, we can look on the fractional part of , which is in , and hope to find such where is either very small, or very close to . Or, if we glue together and (e.g. we live in ) to get a circle, then the question is whether is close to the gluing point on the circle.
Let’s look on the example with and :
In this picture we have the red points of the form rotate around the circle. The circle is also divided to equal parts, and our problem was finding a point in one of the parts touching the gluing point . In this example we can see one such point which is . The question is if we just had a very good luck by choosing and or maybe there is something else at work.
What would happen if there are no red points in the two blue parts near the gluing point? In this case we will need to distribute 7 points in the rest of the 5 blue regions, in which case one of them will contain at least two points by the pigeon hole principle (in the example above, we have in the same blue region). In particular, the distance between these two points will be small, and this is what Dirichlet used in his proof.
Theorem (Dirichlet): Let be any real number. Then for any integer there exists a solution to the inequalities
Proof: Consider the fractionsl parts and their locations in the cells of the form for . If one of them is in , then we are done. Otherwise, each of them falls in one of the cells where . By the pigeon hole principle we can find two distinct integers such that fall in the same cell, and hence
Setting and , we get a solution as required.
Exercise: Try to reprove the claim, but with instead of . Can you also prove it for general and not just integers?
The lattice of approximations
We are now ready to an even more geometric interpretation to Diophantine approximation. Consider again the inequalities
To make things a little more symmetric, we can change the second inequality to . The problem here is that then we will have a trivial solution, namely . But if we exclude this solution, then we get that same problem as above (i.e. we don’t care if is a negative integer, because we can always multiply by to get a solution to the original problem). So, rewriting our problem we see that we are looking for a nonzero vector such that , or an even better notation
In other words, we consider all the element in the lattice
and we ask whether this lattice contains a nontrivial (i.e. nonzero) point inside the rectangle . For example, for and as above and for , the picture we see is:
The interpretation of what we see is as follows: The height of a point is the integer and once we choose the height, then the different points on the same horizontal line correspond to choosing different ‘s. What we want to do is to choose such , so that the point would be as close as we can to the -axis. For we already saw that choosing is enough, since , so that its fractional part is which is smaller that . When we continue to , this fractional part is not enough since , but as Dirichlet’s theorem tells us, we can find another point which is in the box corresponding to . In the picture above we see that taking will do the job, and indeed and its distance from is .
To summarize, what we saw is that finding good rational approximations to a real number is equivalent to finding nontrivial points in a suitable lattice that are contained in certain boxes. We already know that this is possible, since this is exactly what Dirichlet showed us, but is there a geometric intuition for this result? Well, if is any lattice, and is a symmetric box around the origin, then of course it will contain a lattice point, namely the origin. But if we also assume that the box is “big enough” we would expect that it would contain at least one more lattice point, and the question that we are left with is what is “big enough” (in the example above, it had area equals to 4). This intuition is indeed true, and this result is called Minkowski’s theorem which started the field of what is now known as the geometry of numbers, and which I will write about in a future post.