We all know about Pythagoras and his obsession with triangles. Usually when we first learn about the rational and real numbers, we are also told about the Pythagoreans, a 6th century BCE cult that started with the followers of Pythagoras and which dealt with mysticism and Mathematics (and if it still exists and you are a member in it, please send me a message – I would love to join). The legend says that they loved ratios between natural numbers, i.e. rational numbers, and didn’t believe that there exist any other type of numbers. Unfortunately, Hippasus which was one of their own “betrayed” them and proved that is not rational. When he died by drowning, the Pythagoreans said that this was a punishment by the gods themselves for showing that the world of Mathematics is not perfectly rational. Well, it was either this, or because he built a dodecahedron inside a sphere. Anyway, these days we know that there are irrational numbers, of course, but it is nice to know that if you insult the gods of Mathematics you will be punished.
The moral from this story is that rational numbers are divine, but there are real numbers which are not rational. While there are irrational numbers, there are so many rational numbers so that we can get as close as we want to any real number. In this post we will try to investigate when should we say that an approximation is good, how good of an approximation can we hope to find for a given real number, and finish with a bit about the connection to lattices (see the post about lattices for the definitions and some examples).
What is a rational approximation
To start off, given a real number , a rational approximation is simply a rational number
where
is “small”. Whether it is a “good” rational approximation will be determined by how “small” this is.
As it is right now, this starting point can be very confusing to anyone who knows a little bit about rational numbers. Indeed, we can find such rational numbers which make this absolute value as small as we want, or more formally, for any there is a rational number which solves
.
To get a better intuition of the “goodness” of approximation, think about the problem of storing in your computer’s memory. Usually, we store numbers in memory using their binary presentation, however in general
might have an infinite such presentation. One solution is instead to store an approximation
of
instead which has a finite presentation, and keep in mind that we have a computation error which depends on the approximation. The smaller the distance
, the smaller our computation error.
For example, if is rational, then we only need to store
and
which of course have finite presentation. The problem is that if we want a very small computation error, we might end up using very big
and
, so they would require a lot of memory to store. As we are quite greedy mathematicians – can we find rationals which have very small error, but also the numerator and denominator are small? This is a much more interesting question, and the smaller the sizes of the numerator and denominator are with respect to the error, the better we think of the approximation.
What can we say about the comparison between these sizes and the error? As a starting point, we can always find such integers such that
, in which case
Moreover, if we approximate with the rational
or
closer to it, then the error is at most
. For example, if we want a
error, we can always do it with a denominator at most
.
As for bounding the numerator, for simplicity, lets assume that , so that
This way bounding the denominator
also bounds the numerator
. In particular, in the example above to get the error
, we look for
which are at most
.
What is a GOOD rational approximation (according to Dirichlet)
At this point Dirichlet steps in, and shows us how to do real approximation with one of the most simplest and elegant proofs there is in Mathematics that uses nothing more than the pigeon hole principle and just with it shows that there are infinitely many solutions to . As this is clearly true for
rational, we shall prove it under the assumption that
is irrational.
Before giving the proof, let’s reformulate it in a way which will be much easier to work with. The main idea of this reformulation is that we prefer to work with integers, rather than rational numbers (or more specifically, we want to work with which is the real parameter of our problem and not
).
As a first step, we simply move from trying to solve , to trying to solve
which doesn’t have any rationals inside the absolute value. We can multiply again by
, but then we will move from the simple linear function
to the more complicated function
. To avoid this, we use an interesting trick where we break our inequality to two inequalities as follows:
where is an integer. Note that for any fixed
, the inequalities are linear in both
and
. Also, if we find a solution to both of these inequalities, then we have a solution to our original inequality, since
Finally, our new setting is much more natural when looking for better and better approximations. It basically says that for an approximation which is at most
we allow ourselves to have the denominator
at most
. This formulation also easily implies that if we have such an approximation for every
, then we will get infinity many solutions to
(prove it!).
This new formulation has a very nice visualization attached to it. First, note that in a sense is the only parameter of the inequalities. Once we choose it,
will always be either
or
in order to minimize
. In particular, we can assume that this absolute value is always in
. Equivalently, we can look on the fractional part
of
, which is in
, and hope to find such
where
is either very small, or very close to
. Or, if we glue together
and
(e.g. we live in
) to get a circle, then the question is whether
is close to the gluing point on the circle.
Let’s look on the example with and
:

In this picture we have the red points of the form
rotate around the circle. The circle is also divided to
equal parts, and our problem was finding a point in one of the parts touching the gluing point
. In this example we can see one such point which is
. The question is if we just had a very good luck by choosing
and
or maybe there is something else at work.
What would happen if there are no red points in the two blue parts near the gluing point? In this case we will need to distribute 7 points in the rest of the 5 blue regions, in which case one of them will contain at least two points by the pigeon hole principle (in the example above, we have in the same blue region). In particular, the distance between these two points will be small, and this is what Dirichlet used in his proof.
Theorem (Dirichlet): Let be any real number. Then for any integer
there exists a solution
to the inequalities
Proof: Consider the fractionsl parts and their locations in the cells of the form
for
. If one of them is in
, then we are done. Otherwise, each of them falls in one of the
cells
where
. By the pigeon hole principle we can find two distinct integers
such that
fall in the same cell, and hence
Setting and
, we get a solution as required.
Exercise: Try to reprove the claim, but with instead of
. Can you also prove it for general
and not just integers?
The lattice of approximations
We are now ready to an even more geometric interpretation to Diophantine approximation. Consider again the inequalities
To make things a little more symmetric, we can change the second inequality to . The problem here is that then we will have a trivial solution, namely
. But if we exclude this solution, then we get that same problem as above (i.e. we don’t care if
is a negative integer, because we can always multiply
by
to get a solution to the original problem). So, rewriting our problem we see that we are looking for a nonzero vector
such that
, or an even better notation
In other words, we consider all the element in the lattice
and we ask whether this lattice contains a nontrivial (i.e. nonzero) point inside the rectangle . For example, for
and
as above and for
, the picture we see is:

The interpretation of what we see is as follows: The height of a point is the integer and once we choose the height, then the different points on the same horizontal line correspond to choosing different
‘s. What we want to do is to choose such
, so that the point would be as close as we can to the
-axis. For
we already saw that choosing
is enough, since
, so that its fractional part is
which is smaller that
. When we continue to
, this fractional part is not enough since
, but as Dirichlet’s theorem tells us, we can find another point which is in the box corresponding to
. In the picture above we see that taking
will do the job, and indeed
and its distance from
is
.
To summarize, what we saw is that finding good rational approximations to a real number is equivalent to finding nontrivial points in a suitable lattice that are contained in certain boxes. We already know that this is possible, since this is exactly what Dirichlet showed us, but is there a geometric intuition for this result? Well, if
is any lattice, and
is a symmetric box around the origin, then of course it will contain a lattice point, namely the origin. But if we also assume that the box is “big enough” we would expect that it would contain at least one more lattice point, and the question that we are left with is what is “big enough” (in the example above, it had area equals to 4). This intuition is indeed true, and this result is called Minkowski’s theorem which started the field of what is now known as the geometry of numbers, and which I will write about in a future post.