Hilbert's Tenth Problem. Diophantine Equations. Part 2

4.3. Investigation of Fermat's Equation

We will investigate only a special (the simplest!) case of Fermat's equation - where D=a²-1 for some natural number a>1:

x² - (a²-1)y² = 1.

No problems to prove the existence of non-trivial solutions for this equation: you can simply take x=a, y=1. After this, all the other natural solutions we can calculate by using the following smart idea. Let us note that

x² - (a²-1)y² = (x+y*sqrt(a²-1)) * (x-y*sqrt(a²-1)) = 1.

Take our first non-trivial solution x=a, y=1:

a² - (a²-1) = (a+sqrt(a²-1)) * (a-sqrt(a²-1)) = 1.

Consider the n-th power:

(a+sqrt(a²-1))ⁿ * (a-sqrt(a²-1))ⁿ = 1.

Now let us apply the Newton's binomial formula to the expression (a+sqrt(a²-1))ⁿ. For example, if n=2, then

(a+sqrt(a²-1))² = a² + 2a*sqrt(a²-1) + (a²-1).

I.e. some of the items contain sqrt(a²-1), and some do not. Let us sum up either kind of the items:

(a+sqrt(a²-1))ⁿ = x_n(a) + y_n (a)sqrt(a²-1), -----------(1)

where x_n (a), y_n (a) are natural numbers. For example, x₂(a)=2a²-1, y₂ (a)=2a. Still, in this way we can obtain also

(a-sqrt(a²-1))ⁿ = x_n (a) - y_n (a)sqrt(a²-1) ----------(2)

with the same x_n (a) and y_n (a) (verify!). Now multiply (1) by (2):

(a²-a²+1)ⁿ = x_n² - (a²-1)y_n²,

x_n² - (a²-1)y_n² =1.

Hence, for any number n>=0 the pair

x = x_n(a),

y = y_n (a)

is a solution of the equation x²-(a²-1)y²=1. The values n=0, 1 yield the solutions that we already know: x=1, y=0, and x=1, y=1. Still, n=2 yields a new solution; x=2a²-1, y=2a.

From our definition of the numbers x_n (a), y_n (a) the following recurrent identities can be derived (m, n>=0):

x_m+n(a) = x_m(a)*x_n(a) + y_m(a)*y_n(a)*(a²-1),

y_m+n(a) = x_m(a)*y_n(a) + y_m(a)*x_n(a).

For m=1 this means:

x_n+1(a) = a*x_n(a) + (a²-1)*y_n(a),

y_n+1 (a) = x_n(a) + a*y_m(a).

Exercise 4.8. Prove these identities. Verify also that x_n (a) and y_n (a) are increasing functions of n (i.e. that they really yield an infinite set of solutions of the equation x²-(a²-1)y²=1).

It appears that the sequence {(x_n(a), y_n(a) | n>=0} covers all natural solutions of Fermat's equation.

Lemma 1. If a>1, then

x²-(a²-1)y²=1 <-> En(x=x_n(a) & y=y_n(a)).

Proof. 1) Leftwards. This we already have proved.

2) Rightwards. Let x, y be a solution of our equation. If x<=1, then x=1 and y=0, i.e. x=x₀(a), y=y₀(a). Now let x>1. Then y>0. If x, y would be x_n (a), y_n (a), and u, v would be x_n-1(a), y_n-1(a), then we would have:

x = au+(a²-1)v, -----------(3)

y = u+av.

Let us express u, v from these equations:

u = ax-(a²-1)y, -----------(3a)

v = -x+ay.

Now forget about x_n, y_n, x_n-1, y_n-1: the numbers u, v are simply calculated from x, y by formulas (3a).

Exercise 4.9. Verify that u²-(a²-1)v²=1, i.e. that (u, v) is a solution. Verify also that 0<u<x and v>=0.

Thus, if (x, y) is a solution of our equation, x>1, then these numbers can be expressed by formulas (3) through another solution (u, v) such that u<x. If u>1, again, we can express (u, v) through another solution (u', v') such that u'<u, etc. until we reach the solution (1, 0). If n is the number of these downward steps, then x=x_n(a) and y=y_n(a). Q.E.D.

Thus we have an elegant (more than 300 years old) algorithm allowing to calculate the sequence of all natural solutions of the equation x²-(a²-1)y²=1. What makes this algorithm important in the context of Hilbert's 10th problem?

Lemma 2. If a>1 and n>=0, then

aⁿ <= x_n (a) <= (a+sqrt(a²-1))ⁿ.

Proof.

x_n(a)+y_n(a)*sqrt(a²-1) = (a+sqrt(a²-1))ⁿ,

x_n(a) =a* x_n-1(a)+(a²-1)y_n(a) >= a* x_n-1(a).

Q.E.D.

Hence, as function of n, x_n(a) is growing exponentially. And this is achieved by a Diophantine condition F on x:

F(x) <-> Ey(x²-(a²-1)y²=1).

Not bad as the first step - if we wish to find, among others, a polynomial P(x, z₁, ..., z_m) such that

Ey(x=2^y) <-> Ez₁...Ez_m P(x, z₁, ..., z_m).

(These considerations were proposed by J.Robinson in her 1952 paper.)

Now let us follow the idea due to Matiyasevich: let us investigate the remainders from dividing the numbers x_n(a), y_n(a) by each other.

First, let n be fixed, n>0, and let us observe the remainders from dividing x_N(a) and y_N(a) by x_n(a) as N = 0, 1, 2, .... For this purpose we will consider mod x_n(a) the above recurrent identities for x_m+n, y_m+n. I.e. we will ignore items divisible by x_n (a):

x_m+n(a) = x_m(a)*x_n(a) + y_m(a)*y_n(a)*(a²-1) = y_my_n*(a²-1) mod x_n,

y_m+n(a) = x_m(a)*y_n(a) + y_m(a)*x_n(a) = x_my_n mod x_n.

Substitute m+n for m:

x_m+2n = (a²-1)y_m+ny_n = (a²-1)x_my_n² mod x_n,

y_m+2n = x_m+ny_n = (a²-1)y_my_n² mod x_n.

Now let us note that x_n²-(a²-1)y_n²=1, hence (a²-1)y_n² = x_n²-1 = -1 mod x_n. Thus, we can replace (a²-1)y_n² by -1:

x_m+2n = -x_m mod x_n, -----------(4)

y_m+2n = -y_m mod x_n.

Substitute m+2n for m:

x_m+4n = -x_m=2n = x_m mod x_n,

y_m+4n = -y_m+2n = y_m mod x_n.

Thus, the remainders of x_N(a) and y_N(a) mod x_n(a) are changing with the period 4n, and we can concentrate on investigating these remainders for N = 0, 1, 2, ..., 4n-1.

According to (4) we have (mod x_n):

x₀= x₀, x₁= x₁, ..., x_2n-1= x_2n-1,

x_2n= - x₀, x_2n+1= - x₁, ..., x_4n-1= - x_2n-1,

y₀= y₀, y₁= y₁, ..., y_2n-1= y_2n-1,

y_2n = - y₀, y_2n+1= - y₁, ..., y_4n-1= - y_2n-1.

Since the numbers x_n+1, ..., x_2n-1 exceed the divisor x_n, our analysis is not yet complete. To complete it, let us consider the recurrent identities expressing x_2n, y_2n through x_2n-m, y_2n-mand x_m, y_m:

x_2n = x_2n-mx_m + (a2-1)y_2n-my_m,

y_2n = x_2n-m y_m + y_2n-m x_m.

Let us express x_2n-m, y_2n-m from these equations:

x_2n-m = x_2n x_m - (a2-1)y_2n y_m,

y_2n-m = y_2n x_m - x_2n y_m.

By (mod x_n): x_2n = -x₀ = -1 and y_2n = -y₀ = 0, thus we obtain:

x_2n-m = -x_m mod x_n,

y_2n-m = y_m mod x_n.

Now we can complete our analysis:

x₀= x₀, x₁= x₁, ..., x_n-1= x_n-1,

x_n= - x_n, x_n+1= - x_n-1, ..., x_2n-1= - x₁,

x_2n= - x₀, x_2n+1= - x₁, ..., x_3n-1= - x_n-1,

x_3n= x_n, x_3n+1= x_n-1, ..., x_4n-1= x₁,

y₀= y₀, y₁= y₁, ..., y_n-1= y_n-1,

y_n= y_n, y_n+1= y_n-1, ..., y_2n-1= y₁,

y_2n = - y₀, y_2n+1= - y₁, ..., y_3n-1= - y_n-1.

y_3n = - y_n, y_n3n+1= - y_n-1, ..., y_4n-1= - y₁.

This result allows proving of the following lemma (due to Matiyasevich):

Lemma 3. Let a>=3, n>=1, 0<=m<=n. Then for all N:

x_N(a) = x_m(a) mod x_n(a) <-> (N=+m mod 4n) v (N=-m mod 4n).

Proof. 1) Leftwards. If N=4kn+m or N=4kn-m, then x_N=x_m mod x_nfollows from the results of the above analysis.

2) Rightwards. Let x_N=x_m mod x_n, where 0<=m<=n. Let us divide N by 4n: N=4kn+m', where 0<=m'<4n. If (accidentally) m'<n, then (according to the results of the above analysis) m'=m, and N=4kn+m - Q.E.D.

Similarly, if 3n<=m', then m'=4n-m, and N=4(k+1)n-m - Q.E.D.

Exercise 4.10. Verify that the third alternative n<m'<3n is impossible. (Hint: if a>2, then i<j -> x_i(a) < x_j(a)/2.)

End of proof.

Now we must perform a similar investigation of remainders from dividing y_N(a) by y_n(a) (n is fixed, n>=1, N = 0, 1, 2, ...).

Exercise 4.11. Perform this investigation yourself. You will obtain that y_N(a) mod y_n(a) is changing with the period 2n, and (mod y_n):

y₀= y₀, y₁= y₁, ..., y_n-1= y_n-1,

y_n= - y_n, y_n+1= - y_n-1, ..., y_2n-1= - y_1.

From this result we can derive another lemma (due to Matiyasevich):

Lemma 4. Let a>=2, n>=1. Then y_N(a) is divisible by y_n(a), iff N is divisible by n.

Proof. Immediately from the results of the exercise 4.11.

The following very important (see below) lemma also is due to Matiyasevich:

Lemma 5. Let a>=2, n>=1. Then y_N(a) is divisible by y_n²(a), iff N is divisible by n*y_n(a).

Proof. You can easily verify (induction by k) that:

x_kn = x_n^k mod y_n²,

y_kn = kx_n^k-1y_nmod y_n².

1) Rightwards. If y_N (a) is divisible by y_n²(a), then by lemma 4: N=kn. If y_kn is divisible by y_n², then kx_n^k-1y_n also is divisible by y_n², i.e. kx_n^k-1 is divisible by y_n. Since x_n²-(a²-1)y_n²=1, the number x_n cannot have common divisors with y_n, hence, k is divisible by y_n. And since N=kn, N is divisible by ny_n.

2) Leftwards. If N is divisible by ny_n, then N=kn, where k is divisible by y_n. Hence, kx_n^k-1y_n is divisible by y_n², i.e. y_N=y_kn also is divisible by y_n².

Q.E.D.

We will need also the following three lemmas (Lemma 6 is from the 1952 paper by J.Robinson):

Lemma 6. Let a>=2, n>=1. Then:

x_n (a) = 1 mod(a-1),

y_n (a) = n mod(a-1).

Lemma 7. Let a, a' >=2, b>==1. Then, if a=a' mod b, then for all n:

x_n(a) = x_n(a') mod b,

y_n(a) = y_n(a') mod b.

Lemma 8. Let a>=2, k>=0. Then:

x_2k(a) = 1 mod 2, x_2k+1(a) = a mod 2,

y_2k (a) = 0 mod 2, x_2k+1(a) = 1 mod 2.

Exercise 4.12. Prove these lemmas by induction.

4.4. Diophantine Representation of Solutions of Fermat's Equation

Now, following Matiyasevich, we must build a Diophantine representation of the predicate

F(a, x, y, n) <-> a>=3 & x=x_n(a) & y = y_n(a).

I.e. we must put on x, y some "Diophantine conditions" forcing x equal to x_n(a), and y - equal to y_n(a). Of course, we will begin with the condition

F₁: x²-(a²-1)y²=1.

Hence, there is m such that x=x_m(a) and y = y_m(a), and we must force m equal to n.

By Lemma 6, y_m (a) = m mod(a-1), hence, we could try putting the second condition y=n mod(a-1), then we would have m=n mod(a-1). Unfortunately, if n>=a-1, then we will not be able to conclude that m=n.

To avoid this difficulty, a turning movement (literally!) is necessary. Let us introduce another Fermat's equation with a free parameter A:

F₂: X²-(A²-1)Y²=1.

And now we will require not y=n mod(a-1), but

F₃: Y = n mod(A-1).

(Since A is free, we may hope to ensure n<A-1). Since, for some M, X=x_M(A) and Y=y_M(A), then by Lemma 6, Y=M mod(A-1), hence,

M = n mod(A-1). -----------(1)

This conclusion will be useful only, if we will be able to connect the new numbers (X, Y) with our initial numbers (x, y). So, let us introduce an additional module U, and let us require

F₄: A = a mod U & X = x mod U.

By Lemma 7, A = a mod U implies

x = x_m(a) = x_m(A) mod U,

X = x_M(A) = x_M(a) mod U.

From F₄ we have X = x mod U, hence

x_M(a) = x_m(a) mod U. ----------(2)

We could apply here Lemma 3, yet then U must be a solution of Fermat's equation with the same parameter a. So, let us introduce another number V, and let us require

F₅: U²-(a²-1)V²=1.

Hence, for some N: U=x_N(a) and V=y_N(a), and we can rewrite (2) as

x_M(a) = x_m(a) mod x_N(a).

To apply Lemma 3, we must ensure that m<=N. This can be achieved by putting the condition

F₆: x<=U

(since x_i(a) is increasing by i, x_m (a)=x<=U=x_N(a) means m<=N). Finally, we can apply Lemma 3:

(M = m mod 4N) v (M = -m mod 4N). ----------(3)

Now we are at the end of our turning movement. Let us compare (3) with (1):

M = n mod(A-1).

Our intention was to force m=n. We would have achieved this, if 4N would exceed M and m (then (3) would yield M=m or M=-m), and if A-1 would exceed M and n (this would yield M=n, i.e. m=n). The way to ensure both "exceed's" would be to force a large common divisor of A-1 and 4N. Still, we do not know the number N, how could we find a large divisor of 4N? On the other hand, we have Lemma 5: y_N(a) is divisible by y_m²(a), iff N is divisible by my_m(a). Or simply, V is divisible by y², iff N is divisible by my. Hence, if we will put the condition

F₇: V is divisible by y²,

then 4y will be a divisor of 4N (we omit m as an unknown number that we could not force to divide A-1). Now we must put the condition

F₈: A-1 is divisible by 4y

to force 4y to be a common divisor of 4N and A-1. After this, (1) and (3) yield:

(M = n mod 4y) & ((M = m mod 4y) v (M = -m mod 4y)).

Hence,

(n = m mod 4y) v (n = -m mod 4y).

Since y=y_m(a) is increasing by m, we have y>=m. On the other hand, we may put the condition

F₉: n<=y.

Finally, we must consider two possibilities:

1) n = m mod 4y, i.e. n-m is divisible by 4y. Since |n-m|<=y, this is possible, iff n=m. Q.E.D.

2) n = -m mod 4y, i.e. n+m is divisible by 4y. Since n+m<=2y, this is possible, iff n=m=0. Q.E.D.

Thus we have established that the condition

a>=3 & EAEXEYEUEV(F₁ & F₂ & F₃ & F₄ & F₅ & F₆ & F₇ & F₈ & F₉) ----------(4)

implies that x=x_n(a) and y=y_n(a), i.e. F(a, x, y, n).

Our task will be completed, if we will show that F(a, x, y, n) also implies (4). So, having a>=3 & x=x_n(a) & y=y_n(a), we must find the numbers A, X, Y, U, V such that F_i are satisfied for all i=1, 2, ..., 9.

F₁: x²-(a²-1)y²=1 is satisfied by Lemma 1.

F₉: n<=y is satisfied, since y_n(a) is increasing by n.

The numbers U, V (a solution of the same equation as x, y) we can choose in the following way: let N be the least even (see below!) multiple of ny, such that x_N(a)>=x (see F₆!), and let U=x_N(a), V=y_N(a). Then:

F₆: x<=U is satisfied.

F₅: U²-(a²-1)V²=1 is satisfied.

And by Lemma 5, V is divisible by y², i.e. F₇ is satisfied.

It remains to determine the parameter A of our auxiliary equation and its solution X, Y. The following conditions must be satisfied:

F₂: X²-(A²-1)Y²=1,

F₃: Y = n mod(A-1),

F₄: A = a mod U & X = x mod U,

F₈: A-1 is divisible by 4y.

1) Case n=0. Then x=1, y=0. F₄ is satisfied, since U=1. F₈ will be satisfied, iff we take A=1. After this, F₂ will be satisfied, iff we take X=1, and F₃ - iff we take Y=0. Q.E.D.

2) Case n>0. Then y>0. As the first step, let us use F₄ and F₈ to choose A. If the numbers U and 4y would have no common divisors, then we could obtain A from Chinese remainder theorem (see Section 3.3) - as a number A>1 that satisfies simultaneously A = a mod U and A = 1 mod 4y. Then F₈ and the first part of F₄ would be satisfied.

So, let us prove that U and 4y have no common divisors. On the one hand, U is an odd number (by Lemma 8, since N is even number, see above). On the other hand, V is divisible by y², and U²-(a²-1)V²=1, hence, U and y have no common divisors.

It remains to choose X, Y. Let us choose X=x_n(A) and Y=y_n(A). Then F₂ is satisfied. By Lemma 6, F₃ also is satisfied. And finally, since x=x_n(a) and A = a mod U, by Lemma 7 we obtain x_n(A) = x_n(a) mod U, and X = x mod U, i.e. the second part of F₄ also is satisfied. Q.E.D.

Thus, we have established the equivalence of F(a, x, y, n) and (4).

Exercise 4.13. Transform (4) into a Diophantine representation E(P=0). Determine the number of quantifiers E, the order and the sum of coefficient modules of the polynomial P.

4.5. Diophantine Representation of the Exponential Function

Now we will use the Diophantine representation of "Fermat's" predicate F(a, x, y, n) from the previous section to obtain a Diophantine representation of the exponentional function, i.e. of the predicate

E(u, v, n) <-> u=vⁿ & v>=3

(assuming that 0⁰=1).

Let us start with our fundamental equality

(a+sqrt(a²-1))ⁿ = x_n(a) + y_n(a)*sqrt(a²-1).

Let us denote v = a+sqrt(a²-1). Then we will have simply vⁿ on the left hand side. On the right hand side we can replace sqrt(a²-1) by v-a:

vⁿ-x_n(a)-y_n(a)(v-a)=0.

Hence, this equation has the solution v₁=a+sqrt(a²-1). Since all the coefficients of it are rational numbers, the number v₂=a-sqrt(a²-1) also is its solution. On the other hand, v₁, v₂ are solutions of the equation

v²-2av+1=0.

Hence, the polynomial vⁿ-x_n(a)-y_n(a)(v-a) is divisible by v²-2av+1 in the field of rational numbers. Moreover, the coefficients of this fraction polynomial are integer numbers (because the leading coefficient of the divisor is 1). Thus, if v is integer, then the number vⁿ-x_n(a)-y_n(a)(v-a) is divisible by the number v²-2av+1. This is the main lemma from the 1952 paper by Julia Robinson:

Lemma 9. If a>=1 and n>=0, then

vⁿ = x_n(a)+y_n (a)(v-a) mod(v²-2av+1).

Exercise 4.14. a) Verify Lemma 9 for n=0 and n=1 (the above argument is working only for n>=2).

b) Verify that

vⁿ-x_n(a)-y_n(a)(v-a) = (v²-2av+1) (y₁v^n-2+y₂v^n-3+...+y_n-2v+y_n-1).

(This will be a direct proof of Lemma 9 - without the above "smart" algebraic considerations.)

Lemma 9 allows to connect the power vⁿ with the numbers x_n(a), y_n(a) by using polynomials of restricted order (v-a and v²-2av+1). Having this result, we can easily obtain a Diophantine representation of u=vⁿ.

Indeed, having the variables u, v, n, we must put some Diophantine conditions that will force u=vⁿ. As the first step, let us take some numbers a, x, y, n under the condition

E₁: F(a, x, y, n).

Then x=x_n(a) and y=y_n(a), and by Lemma 9:

vⁿ = x+y(v-a) mod(v²-2av+1).

In order to "bind" u and vⁿ, let us put the condition

E₂: u = x+y(v-a) mod(v²-2av+1).

Then

u = vⁿ mod(v²-2av+1). --------(1)

We could derive u=vⁿ from this congruence, if the module v²-2av+1 would be greater than both u and vⁿ. This can be achieved by increasing the free parameter a - then |v²-2av+1| will grow as 2av-v²-1. Thus the condition

E₃: u < 2av-v²-1

ensures one half of the necessary. Still, how to ensure vⁿ<2av-v²-1 - by using Diophantine conditions? I.e. we must force the parameter a to grow exponentially by n. We know already from Lemma 2, that x_n(v) is growing exponentially by n: x_n(v)>=vⁿ. Hence, we can try to force x_n(v)<2av-v²-1 instead of vⁿ<2av-v²-1. So, let us introduce the numbers X, Y such that

E₄: F(v, X, Y, n),

i.e. X=x_n(v) and Y=y_n(v). If we add also

E₅: X < 2av-v²-1,

then vⁿ <= x_n(v) = X < 2av-v²-1. Having this result plus E₃ and (1) we obtain u=vⁿ.

Thus, we have succeeded in deriving u=vⁿ from the condition

EaExEyEXEY(E₁ & E₂ & E₃ & E₄ & E₅). --------(2)

Since, v>=3 is included in E₄, we have derived also E(u, v, n) from (2).

Exercise 4.15. a) To complete the proof, derive (2) from E(u, v, n).

b) Transform (2) into a Diophantine representation E(P=0). Determine the number of quantifiers E, the order and the sum of coefficient modules of the polynomial P.

Thus, following the work by Matiyasevich and Julia Robinson, we have obtained for the predicate u=vⁿ & v>=3 a Diophantine representation

Ez₁...Ez_k P(u, v, n, z₁, ..., z_k)=0.

If we substitute v=3 and add the quantifier En, we obtain a Diophantine representation

Ev₁...Ev_s P₁ (u, v₁, ..., v_s)=0

of the predicate "u is power of 3". Hence, the equation P₁ (u, v₁, ..., v_s)=0 has natural solutions, iff the parameter u is 3ⁿ. This result was qualified as unexpected by some (anonymous?) number-theorists.

4.6. Diophantine Representation of Binomial Coefficients and Factorial Function

C_y^z denotes the coefficient at p^z in the Newton's binomial formula for (1+p)^y.

The factorial function y! is defined as follows: 0!=0, and if y>0, then y!=1*2*...*y.

Julia Robinson showed in 1952 how the predicates z<=y & x= C_y^z and x=y! can be "Diophantine expressed" through the predicate x=y^z. Now, using these methods, we can obtain Diophantine representations of these predicates.

Matiyasevich improved the first method in the following way. Let us start with the Newton's binomial formula for (1+p)^y:

(1+p)^y = sum { C_y^z p^z | for z=0 to y}. --------(1)

For p=1 we would have

2^y = sum { C_y^z | for z=0 to y}.

Thus, C_y^z <=2^y for all z<=y.

From (1) we can obtain also:

(1+p)^y = u + (C_y^z + vp)p^z,

where

u = sum {C_yⁱ pⁱ | for i=0 to z-1},

v = sum {C_yⁱp^i-z-1 | for i=z+1 to y}.

If we had u<p^z, then we could compute u as (1+p)^y mod p^z. And if we had also C_y^z <p, then we could compute C_y^z as ((1+p)^y-u)/p^z mod p, i.e. we had reduced computing of C_y^z to computing of the exponential function.

Of course, if p would be large enough (for example, p=3^y+1), then C_y^z <p would be ensured. Still, how about u<p^z? Fortunately, for such a large p:

u<= sum {2^ypⁱ | for i=0 to z-1} = 2^ysum {pⁱ | for i=0 to z-1} = 2^y(p^z-1)/(p-1) = (2/3)^y(p^z-1) < p^z.

Hence, if we wish to force x= C_y^z & z<=y by putting Diophantine conditions, we may try to put

z<=y & EpEuEv (p=3^y+1 & (1+p)^y = u + (x + vp)p^z & x<p & u<p^z). --------(2)

We have already established, x= C_y^z & z<=y implies (2). The converse also is true. Indeed, according to (2), we can compute the value of u as (1+p)^y mod p^z, and the value of x - as ((1+p)^y-u)/p^z mod p. This is the way C_y^z is computed (see above), hence x= C_y^z.

Exercise 4.16. Transform (2) into a Diophantine representation E(P=0). Determine the number of quantifiers E, the order and the sum of coefficient modules of the polynomial P.

Now let follow another idea due to Julia Robinson to obtain a Diophantine representation of the predicate x=y!. As you may know:

C_w^y = w(w-1)...(w-y+1)/y!.

If w would be much greater than y, then the product w(w-1)...(w-y+1) would be "approximately" w^y, and hence, y! would be "approximately" w^y/C_w^y. Let us examine this fraction more closely:

w^y/C_w^y = y! (w/w) (w/(w-1)) ... (w/(w-y+1)).

Let us replace w, w-1, ..., w-y+1 by w-y, then we will have:

y! <= w^y/C_w^y <= y!(w/(w-y))^y = y!(1+y/(w-y))^y.

Now, take w=y+yt:

y! <= w^y/C_w^y <= y!(1+1/t)^y = y!(1+sum{ C_yⁱt ^-i | for i=1 to y}).

Since C_yⁱ <= 2^y, let us take t=u2^y, then

y! <= w^y/C_w^y <= y!(1+y/u).

And finally, by taking u=2yy^y we will have (since y!<=y^y):

y! <= w^y/C_w^y <= y!+1/2.

Hence, if w=y+2y²2^yy^y, then y! can be computed as the integer part of the fraction w^y/C_w^y, and we can represent the predicate x=y! as

Ew(w=y+2y²2^yy^y & x=integer(w^y/C_w^y)). --------(3)

Exercise 4.17. Transform (3) into a Diophantine representation E(P=0). Determine the number of quantifiers E, the order and the sum of coefficient modules of the polynomial P.

Exercise 4.18. Build a Diophantine representation of the predicate "x is prime number". Hint (J.Robinson, 1952): x is prime, iff x and (x-1)! do not have common divisors. You can use also Wilson's theorem: x is prime number <-> x>1 & (x-1)!+1 is divisible by x. Which way is better?

Putnam's idea (Exercise 4.3) allows to obtain from this representation a polynomial Q(x₁, ..., x_n) such that the set of positive values of Q is exactly the set of all prime numbers. Hence, despite the current number-theoretic intuition of 1969 some kind of a "formula for prime numbers" does exist!

4.7. Elimination of Restricted Universal Quantifiers

Now we have arrived at our target - producing a method that will allow converting any formula

(Az<U)Ex₁...Ex_n P(b₁, ..., b_k, z, x₁, ..., x_n)=0, --------(1)

where U is a linear function of b₁, ..., b_k with natural coefficients, into an equivalent formula

Ey₁...Ey_q Q(b₁, ..., b_k, y₁, ..., y_q)=0.

We will follow mainly the 1961 paper by Davis, Putnam and Julia Robinson with some later improvements proposed by Matiyasevich and Julia Robinson.

For any fixed values of b₁, ..., b_k the formula (1) is an existential assertion (despite the universal quantifier Az<U) - it asserts the existence of U*n numbers: the values of x₁, ..., x_n for each z = 0, 1, ..., U-1. Let us denote these U*n numbers by x_i^(z):

for z=0: x₁⁽⁰⁾, x₂⁽⁰⁾,..., x_n⁽⁰⁾,

for z=1: x₁⁽¹⁾, x₂⁽¹⁾,..., x_n⁽¹⁾,

...

for z=U-1: x₁^(U-1), x₂^(U-1),..., x_n^(U-1).

We could eliminate the universal quantifier Az<U, if we could find some coding that allowed to represent this table by a sequence of m natural numbers y₁, ..., y_m (where m does not depend on U). Then we could try to replace Az<U by Ey₁...Ey_m (plus solving, of course, all the other remaining technical problems).

For example, let us try to code each of the n columns of our table by a single number using the Chinese Remainder theorem. If we had numbers u₀, u₁, ..., u_U-1 such that two of them never had common divisors, then we could obtain n numbers w₁, ..., w_n such that each x_i^(z) would be w_i mod u_z, i.e.

x_i^(z)<u_z & wi = x_i^(z) mod u_z --------(2)

for all z<U and i = 1, ..., n. Of course, the numbers u_z must be large enough to serve this purpose.

Still, even if we will succeed in finding u₀, u₁, ..., u_U-1, then how to force the remainders x₁^(z), ..., x_n^(z) to satisfy the equation of (1) for all z = 0, 1, ..., U-1? Let us simply try to substitute the numbers w₁, ..., w_n for x₁, ..., x_n into the equation of (1). For z let us substitute some number Z to be determined later. What could we say about the value of P(b₁, ..., b_k, Z, w₁, ..., w_n)? If we added to (2) the condition

Z = z mod u_z for all z = 0, 1, ..., U-1, ---------(3)

then we could conclude that

P(b₁, ..., b_k, Z, w₁, ..., w_n) = P(b₁, ..., b_k, z, x₁^(z), ..., x_n^(z)) mod u_z.

Since all the right hand side values of P are 0, we obtain that

P(b₁, ..., b_k, Z, w₁, ..., w_n) = 0 mod u_z

for all z<U, i.e. the left hand side number is divisible by all the numbers y_z. Since two of these numbers never have common divisors, the left hand side number is divisible also by the product of them, i.e.

P(b₁, ..., b_k, Z, w₁, ..., w_n) = 0 mod u₀u₁...u_U-1. ----------(4)

Now let us view (4) not as a consequence of some assumptions, but as a condition that is put on the numbers w₁, ..., w_n. If the numbers x_i^(z) are defined as w_i mod u_z, then from (2), (3) and (4) we obtain that for all z<U:

P(b₁, ..., b_k, z, x₁^(z), ..., x_n^(z)) = 0 mod u_z.

We would like to force an "absolute" 0 on the right hand side instead of 0 mod u_z. This would be achieved, if the left hand side number would be less than u_z.

Exercise 4.19. Let N be the order of the polynomial P, M - the sum of its coefficient modules, z<U, and let X exceed all x_i^(z). Verify that

|P(b₁, ..., b_k, z, x₁^(z), ..., x_n^(z))| <= T,

where T = M((b₁+1)...(b_k+1)U(X+1))^N.

Hence, we must produce a (possibly simple) generator of divisors u_z (z = 0, 1, ..., U-1) such that:

a) u_z > T for all z<U.

b) The module of (4), i.e. the product u₀u₁...u_U-1 is a possibly simple (i.e. "Diophantine") function. Otherwise we will have problems with finding a Diophantine representation of u₀u₁...u_U-1.

c) Two of the numbers u_z never have common divisors.

The following idea of producing u_z is due to Matiyasevich and Julia Robinson. Let V be a large number (to start let U<=V), then we can generate u_z in such a way that u₀u₁...u_U-1 = C_V^U (i.e. b) will be satisfied). Indeed,

C_V^U = V(V-1)...(V-U+1)/U! = ((V+1)/1-1)((V+1)/2-1)...((V+1)/U-1).

Let us take

u_z = (V+1)/(z+1)-1.

If we put the condition "V+1 is divisible by U!", then all u_z will be integer numbers. If we put a stronger condition "V+1 is divisible by (U!)²", then two of these numbers never have common divisors (i.e. c) is satisfied).

Exercise 4.20. Verify that this is the case. (Hint: let d be a common prime divisor of u_i and u_j, consider u_i and u_i-u_j.)

If we put also the condition u_U-1>T (note that u_U-1 is the least of all u_z), i.e.

u_U-1 > M((b₁+1)...(b_k+1)U(X+1))^N,

then a) also is satisfied.

Now let us sum up all the conditions we have put on the numbers we have introduced, i.e. w₁, ..., w_n, Z, X, V:

G₁: P(b₁, ..., b_k, Z, w₁, ..., w_n) = 0 mod C_V^U,

G₂: (Az<U) Z = z mod u_z,

G_3i: (Az<U) w_i mod u_z < X for each i = 1, ..., n,

G₄: u_U-1 > M((b₁+1)...(b_k+1)U(X+1))^N,

G₅: V+1 is divisible by (U!)²,

where, of course, u_z = (V+1)/(z+1)-1.

About G_3i: since T depends on X, we must ensure also: w_i mod u_z < X for all z<U and i = 1, ..., n (otherwise the estimate of the exercise 4.19 will not hold).

Exercise 4.21. Verify that (1) is equivalent to the following formula:

EZEXEVEw₁...Ew_n G₁ & G₂ & G₄ & G₅ & G₃₁ & ... & G_3n. -------(5)

(Hints. Rightwards: first choose X to satisfy G_3i's, then choose V to satisfy G₅ and G₄, generate the divisors u_z, obtain the number Z by using Chinese Remainder theorem to satisfy G₂, obtain the numbers w₁, ..., w_n by using Chinese Remainder theorem to satisfy (2), and finally, derive G₁. Leftwards: having the numbers w₁, ..., w_n, Z, X, V take for each z<U: x_i^(z) = w_i mod u_z, etc.)

Why should we view (5) as a step forward from (1), when G₂ and G_3icontain the same quantifier Az<U? In (1) this quantifier stands over an arbitrary Diophantine representation, but in G₂ and G_3i it stands over simple specific formulas!

First, we need not to eliminate Az<U from G₂, we can eliminate the entire G₂. Indeed, we can take Z equal to V: since V-z is divisible by

u_z = (V+1)/(z+1)-1 = (V-z)/(z+1)

(the fraction is equal to z+1), we have V = z mod u_z for all z<U.

So, we can delete G₂ from our list of conditions, replace G₁ by

G₁': P(b₁, ..., b_k, V, w₁, ..., w_n) = 0 mod C_V^U,

and delete the quantifier EZ from (5).

Now let us set to eliminating Az<U from G_3i. If w_i mod u_z < X, then one of the numbers w_i, w_i-1, ..., w_i-X+1 is divisible by u_z, i.e. their product

w_i (w_i-1)...(w_i-X+1) = w_i!/(w_i-X)! --------(6)

also is divisible by u_z for all z<U. Since two of the numbers u_z never have common divisors, the number w_i!/(w_i-X)! is divisible by their product u₀u₁...u_U-1 = C_V^U. Hence, if G_3i, then

G_3i': w_i!/(w_i-X)! is divisible by C_V^U.

Thus we have got rid of the quantifier Az<U by introducing well-known functions! Still, unfortunately, G_3i' does not imply G_3i, i.e. these conditions are not equivalent! Indeed, if we know only that the product (6) is divisible by another product C_V^U, then we cannot guarantee that for each z<U the factor u_z will divide one of the factors w_i, w_i-1, ..., w_i-X+1.

If the number R divides the product P₁P₂...P_k, then R=R₁R₂...R_k, where each factor R_i divides the corresponding P_i. If R_j is maximum among the factors R_i, then R_j^k>=R, i.e. R_j>=root_k(R). Hence, if R divides the product P₁P₂...P_k, then R and one of the factors P_jhave a common divisor >= root_k(R). This is maximum we can guarantee!

Thus, if we replace G_3i by G_3i', then we can guarantee only that some w_i-j (where 0<=j<X) and u_z have a common divisor >= root_X(u_z). Fortunately, this is enough to solve our problem completely!

Indeed, for a fixed z<U let us proceed from w₁ to w_n in the following way. We know that the product w₁(w₁-1)...(w₁-X+1) always is divisible by u_z. Then, first, for some number x₁^(z)<X the difference w₁- x₁^(z) is divisible by some divisor S₁>=root_X(u_z) of the number u_z. Of course, the product w₂(w₂-1)...(w₂-X+1) also is divisible by S₁. Hence, next, for some number x₂^(z)<X the difference w₂- x₂^(z) is divisible by some divisor S₂>=root_X(S₁)>=root_X2(u_z) of the number S₁ (and of u_z). Etc., finally, for some number x_n^(z)<X the difference w_n-x_n^(z) is divisible by some divisor S_n>=root_X(S_n-1)>=root_Xn(u_z) of the number S_n-1 (and of u_z).

Hence, for all i = 1, ..., n:

w_i = x_i^(z) mod S_n, -------(7)

where S_n divides u_z (and hence, C_V^U), and S_n>=root_Xn(u_z). From G₁' we have:

P(b₁, ..., b_k, V, w₁, ..., w_n) = 0 mod S_n,

hence, by (7) and, since V = z mod u_z,

P(b₁, ..., b_k, z, x₁^(z), ..., x_n^(z)) = 0 mod S_n.

Since z<U and all x_i^(z)<X, the left hand side value of P does not exceed

T = M((b₁+1)...(b_k+1)U(X+1))^N.

Hence, this value of P will be forced to be an "absolute" 0, if root_Xn(u_z) will be greater than T. Thus, we must replace G₄ by a stronger condition

G₄': root_Xn(u_U-1) > M((b₁+1)...(b_k+1)U(X+1))^N,

and our problem finally is 100% solved!

Exercise 4.22. Verify once more that (1) is equivalent to the formula

EXEVEw₁...Ew_n G₁' & G₄' & G₅ & G₃₁' & ... & G_3n'.

Transform this formula into a Diophantine representation.

Q.E.D.

4. Hilbert's Tenth Problem

4.1. History of the Problem. Story of the Solution

4.2. Plan of the Proof

4.3. Investigation of Fermat's Equation

4.4. Diophantine Representation of Solutions of Fermat's Equation

4.5. Diophantine Representation of the Exponential Function

4.6. Diophantine Representation of Binomial Coefficients and Factorial Function

4.7. Elimination of Restricted Universal Quantifiers

4.8. 30 Ans Apres