additive combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · additive combinatorics...

96
Additive Combinatorics Summer School, Catalina Island August 10th - August 15th 2008 Organizers: Ciprian Demeter, IAS and Indiana University, Bloomington Christoph Thiele, University of California, Los Angeles * supported by NSF grant DMS 0701302 1

Upload: others

Post on 28-Jun-2020

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Additive Combinatorics

Summer School, Catalina Island ∗

August 10th - August 15th 2008

Organizers:

Ciprian Demeter, IAS and Indiana University, Bloomington

Christoph Thiele, University of California, Los Angeles

∗supported by NSF grant DMS 0701302

1

Page 2: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Contents

1 On the Erdos-Volkmann and Katz-Tao Ring Conjectures 5Jonas Azzam, UCLA . . . . . . . . . . . . . . . . . . . . . . . . . . 51.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51.2 The Ring, Distance, and Furstenburg Conjectures . . . . . . . 51.3 Main Results . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

2 Quantitative idempotent theorem 10Yen Do, UCLA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102.2 The main argument . . . . . . . . . . . . . . . . . . . . . . . . 112.3 Proof of the induction step . . . . . . . . . . . . . . . . . . . . 122.4 Construction of the required Bourgain system . . . . . . . . . 15

3 A Sum-Product Estimate in Finite Fields, and Applications 21Jacob Fox, Princeton . . . . . . . . . . . . . . . . . . . . . . . . . . 213.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 213.2 Preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . 233.3 Proof outline of Theorem 1 . . . . . . . . . . . . . . . . . . . . 23

4 Growth and generation in SL2(Z/pZ) 26S. Zubin Gautam, UCLA . . . . . . . . . . . . . . . . . . . . . . . . 264.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 264.2 Outline of the proof . . . . . . . . . . . . . . . . . . . . . . . . 274.3 Part (b) from part (a) . . . . . . . . . . . . . . . . . . . . . . 284.4 Proof of part (a) . . . . . . . . . . . . . . . . . . . . . . . . . 29

4.4.1 A reduction via additive combinatorics . . . . . . . . . 294.4.2 Traces and growth . . . . . . . . . . . . . . . . . . . . 304.4.3 A reduction to additive combinatorics . . . . . . . . . . 31

4.5 Expander graphs . . . . . . . . . . . . . . . . . . . . . . . . . 324.6 Recent further progress . . . . . . . . . . . . . . . . . . . . . . 33

5 The true complexity of a system of linear equations 35Derrick Hart, UCLA . . . . . . . . . . . . . . . . . . . . . . . . . . 355.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 355.2 Quadratic fourier analysis and initial reductions . . . . . . . . 375.3 Dealing with f1 . . . . . . . . . . . . . . . . . . . . . . . . . . 39

2

Page 3: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

5.4 Finishing the proof of the main theorem . . . . . . . . . . . . 42

6 On an Argument of Shkredov on Two-Dimensional Corners 43Vjekoslav Kovac, UCLA . . . . . . . . . . . . . . . . . . . . . . . . 436.1 Some history and the main result . . . . . . . . . . . . . . . . 436.2 Outline of the proof . . . . . . . . . . . . . . . . . . . . . . . . 446.3 Some notation . . . . . . . . . . . . . . . . . . . . . . . . . . . 456.4 Main ingredients of the proof . . . . . . . . . . . . . . . . . . 46

6.4.1 Generalized von Neumann lemma . . . . . . . . . . . . 466.4.2 Density increment lemma . . . . . . . . . . . . . . . . 466.4.3 Uniformizing a sublattice . . . . . . . . . . . . . . . . . 47

6.5 Proof of the main theorem . . . . . . . . . . . . . . . . . . . . 48

7 An inverse theorem for the Gowers U3(G) norm over the finitefield F

n5 51

Choongbum Lee, UCLA . . . . . . . . . . . . . . . . . . . . . . . . 517.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 517.2 Preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . 51

7.2.1 Notation . . . . . . . . . . . . . . . . . . . . . . . . . . 527.2.2 Gowers uniformity norm Ud(G), ‖·‖Ud(G) . . . . . . . . 527.2.3 Local polynomial bias of order d, ‖·‖ud(B) . . . . . . . 53

7.3 Main Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . 557.4 Outline of the proof . . . . . . . . . . . . . . . . . . . . . . . . 567.5 Application . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58

8 New bounds for Szemeredi’s Theorem, II: A new bound forr4(N) 60Kenneth Maples, UCLA . . . . . . . . . . . . . . . . . . . . . . . . 608.1 The Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . 608.2 Notation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 618.3 Strategy and Initial Reductions . . . . . . . . . . . . . . . . . 618.4 Relevant Definitions . . . . . . . . . . . . . . . . . . . . . . . 638.5 Proof Outline . . . . . . . . . . . . . . . . . . . . . . . . . . . 648.6 The Gowers U3-norm and the Quadratic Bohr Sets . . . . . . 65

9 An inverse theorem for the Gowers U3(G) norm 67Eyvindur Ari Palsson, Cornell . . . . . . . . . . . . . . . . . . . . . 679.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67

3

Page 4: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

9.2 The inverse theorem . . . . . . . . . . . . . . . . . . . . . . . 719.3 Outline of proof for the inverse theorem . . . . . . . . . . . . . 72

10 On the Erdos-Volkmann and Katz-Tao ring conjectures 74Chun-Yen Shen, Indiana . . . . . . . . . . . . . . . . . . . . . . . . 7410.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7410.2 Preliminary results . . . . . . . . . . . . . . . . . . . . . . . . 7610.3 Outline of Proof of Theorem 1 . . . . . . . . . . . . . . . . . . 7810.4 Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79

10.4.1 The Falconer distance problem . . . . . . . . . . . . . 7910.4.2 Dimension of sets of Furstenburg type . . . . . . . . . 80

11 Quantitative bounds for Freiman’s Theorem 82Betsy Stovall, UC Berkeley . . . . . . . . . . . . . . . . . . . . . . . 8211.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8211.2 Applications, remaining conjectures . . . . . . . . . . . . . . . 8311.3 The proof of Theorem 2 . . . . . . . . . . . . . . . . . . . . . 84

11.3.1 Reduction to A ⊂ ZN . . . . . . . . . . . . . . . . . . . 8411.3.2 Finding a progression in 2A− 2A. . . . . . . . . . . . . 8511.3.3 From P0 ⊂ 2A− 2A to P ⊃ A . . . . . . . . . . . . . . 86

11.4 Producing a proper progression of small rank. . . . . . . . . . 86

12 Norm Convergence of Multiple Ergodic Averages of Com-muting Transformations 88Zhiren Wang, Princeton University . . . . . . . . . . . . . . . . . . 8812.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8812.2 Finitary versions of the main theorem . . . . . . . . . . . . . . 88

12.2.1 Finite type convergence statement . . . . . . . . . . . . 8912.2.2 Discretization of the space . . . . . . . . . . . . . . . . 89

12.3 Sketch of proof . . . . . . . . . . . . . . . . . . . . . . . . . . 9212.3.1 Koopman-von Neumann type decomposition . . . . . . 9212.3.2 Inductive step . . . . . . . . . . . . . . . . . . . . . . . 94

4

Page 5: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

1 On the Erdos-Volkmann and Katz-Tao Ring

Conjectures

after J. Bourgain [1]A summary written by Jonas Azzam

Abstract

In this paper, Bourgain solves the long standing conjecture firstposed by Erdos and Volkmann on whether or not there exists a Borelsubring of the real line of nonintegral dimension. In this summary, wediscuss the problem, Bourgain’s approach, and outline the preliminar-ies for the proof of his main result.

1.1 Introduction

The main aim of this paper is to prove the following:

Theorem 1. A Borel subring of the real line must have dimension 0 or 1.

This solves a long standing conjecture by Erdos and Volkmann aboutwhether such sets exist with fractional dimension. While this result wasproved simultaneously with G. Edgar and C. Miller [2], Bourgain’s approachhas a wider range of consequences due to the work of Katz and Tao. Here,we will go over their reformulation of the problem, as well as the other closelyrelated conjectures before discussing the preliminaries results to Bourgainsmain proof.

1.2 The Ring, Distance, and Furstenburg Conjectures

Falconer previously showed that if the ring has dimension greater than 1/2,then its dimension must be 1. This is a corollary of Falconer’s theorem thatif A ⊆ R satisfies dimA > 1/2, then

D(A) = |x− y| : x, y ∈ A

has positive Lebesgue measure (let alone dimension 1). With this fact, theproof is short: if A is a ring, then since D2(A×A) = |x−y|2 : x, y ∈ A×A ⊆

5

Page 6: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

A, the square function preserves dimension, and dim(A×A) ≥ 2 dimA > 1,we have

1 ≥ dim(A) ≥ dimD2(A×A) ≥ D(A× A) ≥ min1, 2 dimA = 1.

For more information on dimension geometric techniques employed here,see [4].

Falconer posed a general conjecture that if K is a compact subset ofthe plane with dimK ≥ 1, then dimD(K) = 1. A weaker version of thisconjecture is the following:

Conjecture 2. (Distance Conjecture) There is an absolute constant c > 0such that if dimK ≥ 1, then dimD(K) ≥ 1

2+ c.

So clearly, there is a relationship between the distance and ring con-jectures. Katz and Tao explored this relationship in more detail (see [3]).There, they developed discrete analogues of the distance and ring conjec-tures, in hopes that proving these analogues would lead to a proof of thenondiscrete versions.

Definition 3. We say A ⊆ Rn is a (δ, σ)n set if A is a union of balls ofradius δ and

|A ∩B(x, r)| ≤ Cδn−ǫ(r/δ)σ

for any x ∈ Rn and r ∈ [δ, 1].

This essentially says that A acts like the δ-neighborhood of a σ dimen-sional set. With this discretized version of a fractal set, Katz and Tao devel-oped a discretized version of the distance conjecture, as well as a discretizedversion of the Furstenburg conjecture. This latter problem asks whether thereis a lower bound γ(β) for the dimension of sets A such that for any directions ∈ S1, there is a line Ls in that direction such that dim(A ∩ Ls) ≥ β.

Conjecture 4. γ(1/2) ≥ 1 + c for some constant c > 0.

Finally, Katz and Tao develop the following analog for the ring conjec-ture1:

1This actually conjectures that there is no subring of dimension 1/2, however, byreplacing the 1/2 with σ, we get the more general conjecture.

6

Page 7: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Conjecture 5. (Discretized Ring Conjecture) Let A be a (δ, 1/2)1 set ofmeasure ∼ δ1/2. Then

|A+ A|+ |A.A| ≥ Cδ1/2−c

where c > 0 is some absolute constant.

As said earlier, these conjectures are all in fact related. In particular,Katz and Tao show:

1. The discritized distance conjecture implies the distance conjecture,

2. the discritized Furstenburg conjecture implies the Furstenberg conjec-ture, and

3. all three discritized conjectures are equivalent.

1.3 Main Results

In Bourgain’s paper, he proves the discretized ring conjecture and furthershows that this implies the original distance conjecture, and by the results ofKatz and Tao, this gives positive results for the other two conjectures. Thetwo main results are the following:

Theorem 6. If A is a (δ, σ)1 set, σ ∈ (0, 1), such that |A| > δσ+ǫ, then

|A+ A|+ |A.A| > δσ−c

for some absolute constant c = c(σ) > 0.

Theorem 7. The discretized ring conjecture (i.e. the previous theorem) im-plies the ring conjecture.

He first proves the latter theorem for dimension 2, that is, the discretizedring conjecture implies there does not exist a Borel subring of dimension 2(which we will demonstrate in lecture) and later shows how one can adaptthis to include all dimensions between 0 and 1.

Next, he introduces and develops some preliminary lemmas for the mainproof of the discretized ring conjecture.

First, he develops a partition theorem that says for sets A whose sumsetsare not too large, then the sets have some regular spacing.

7

Page 8: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Lemma 8. Let A ⊆ R such that

N(A + A, δ) < KN(A, δ)

and thatσ ∼ K(δN(A, δ))K−3

is small. Then there is q ∈ N such that σq> δ and

A ⊆⋃

a∈Z

B(ξ0 +a

q,σ

q).

Next, he’d like to prove a weaker version of this theorem but with aweaker restriction on the scale σ of the size of the intervals with respect oftheir spacing of each other. It uses the previous lemma the following basiclemma:

Lemma 9. AssumeN(2A, δ)

N(A, δ)< K.

Let η > δ and A =⋃

j∈S Aj where Aj = A∩Ij 6= ∅ and Ij is a partitionof the real line into intervals of length η. Let j∗ satisfy

N(Aj∗ , δ) = maxjN(Aj , δ)

and letS1 = j ∈ S : N(Aj , δ) > TK−1N(Aj + Aj∗ , δ).

Then ∑

j∈Sc1

N(Aj , δ) < 4TN(A, δ)

and if j ∈ S1,N(2Aj , δ)

N(Aj , δ)< (

K

T)3.

Lemma 10 (Main Lemma). Let A ⊆ R be a bounded set, δ > 0, and assume

N(2A, δ)

N(A, δ)< K and δN(A, δ) < κ.

Then there exist σ, δ′ > 0 with σδ′ > δ and A′ ⊆ A such that

8

Page 9: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

1. σ < (logK)Cκ(log K)−C,

2. A′ is contained in a union of intervals of length σδ′ and spaced apartby at least δ′, and

3. N(A′, δ) > N(A,δ)K log(1/δ)

.

So we can’t guarantee that the entire set A is evenly spaced apart, butwe can find a sufficiently large subset which is spaced by δ′, that is to saythat it’s porous. In the proof of the discretized ring conjecture, one assumes|A| > δ1/2+ǫ and |A + A| + |A.A| < δ1/2−ǫ. According to Katz and Tao,one may assume this implies |A.A − A.A| < δ1/2−ǫ. This lemma is used infinding a large subset C of A which is sufficiently porous such that on averagemultiplicative translates xC have small intersection and hence |x0C + xC|will be large on average, and this will help show that |A.A−A.A| is large fora desired contradiction.

References

[1] J. Bourgain, On the Erdos-Vokmann and Katz-Tao Ring Conjectures,GAFA 13 (2003), 334-365.

[2] G. A. Edgar, C. Miller, Borel subrings of the reals, Proc. Amer. Math.Soc. 131, no. 4 (2003), 1121-1129.

[3] N. Katz, T. Tao, Some connections between Falconer’s distance set con-jecture, and sets of Furstenberg type, New York J. Math. 7 (2001), 149-187(electronic).

[4] P. Mattila, Geometry of Sets and Measures in Euclidean Space. Cam-bridge Press, New York, NY, 1999.

Jonas Azzam

email: [email protected]

9

Page 10: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

2 Quantitative idempotent theorem

after B. Green and T. Sanders [1]A summary written by Yen Do

Abstract

The classical theorem proved by P. Cohen [2] states that any idem-potent measure on locally compact abelian group G lies in the cosetring of G, i.e.

µ =

L∑

j=1

±1γj+Γj

where the Γj are open subgroups of G. This theorem however does notgive any information about L. We will prove that L can be bounded by

eeC‖µ‖4

and the number of distinct open subgroups Γj may be boundedabove by ‖µ‖+ 1

100 .

2.1 Introduction

Let G be a locally compact abelian group with dual group G. Let M(G) bethe algebra of bounded regular Borel measures on G. A measure µ ∈ M(G)

is idempotent iff µ ∗ µ = µ, i.e. µ is a characteristics function on G.

Theorem 1 (Cohen’s idempotent theorem). µ is idempotent iff γ ∈ G :

µ(γ) = 1 lies in the coset ring of G, i.e.

µ =L∑

j=1

±1γj+Γj(1)

where the Γj are open subgroups of G.

This was proved by Paul Cohen [2]. Partial results was previously obtainedfor G = T by Helson [3] and G = Td by Rudin [4]. This theorem howeverdoes not give any information about L; for instance, it is trivial when G isfinite. We will prove the following quantitative version of Cohen’s theorem:

Theorem 2 (Quantitative idempotent theorem). Suppose that µ ∈ M(G)

is idempotent. Then in (1) we can bound L by eeC‖µ‖4

and the number ofdistinct open subgroups Γj by ‖µ‖+ 1

100.

10

Page 11: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

When G is finite, theorem 2 describes characteristics functions of the finiteabelian group G. These results, indeed, hold for Z-valued functions on G:

Theorem 3. Suppose that G is a finite abelian group. For any f : G → Z

with ‖f‖A := ‖f‖L1( bG) ≤M , we may write

f =

L∑

j=1

±1xj+Hj

where all xj ∈ G, Hj ≤ G subgroups and L ≤ eeCM4

. The number of distinctsubgroups Hj may be bounded above by M + 1

100

Notes. In theorem 3, G and ‖f‖A play the roles of G and ‖µ‖ in the idem-potent theorem. The number 1

100in the estimates of theorem 2 and 3 can be

improved to any given positive δ.

Theorem 2 can be deduced from theorem 3 by a standard argument usingfinite group approximation. In this exposition, we will prove theorem 3.

2.2 The main argument

The proof of theorem 3 is essentially by inducting on the algebra norm ‖f‖A.Ideally we would like to decompose f into f1 +f2 where f1 and f2 are integer-valued functions, with either smaller algebra norm or nice behavior (to bespecified below). It turns out to be easier to work with almost integer-valuedfunctions, i.e. functions that take values in Z + [−ǫ, ǫ] for some ǫ small.If f is an ǫ-almost Z-valued function, we define fZ to be its integer-valuedapproximation and write d(f,Z) ≤ ǫ.

Lemma 4 (Induction step). Suppose f : G→ R has fZ 6≡ 0, ‖f‖A ≤ M for

some M ≥ 1, and d(f,Z) ≤ δ, for some very small δ (says δ = ee−C1M4

, C1

large given). For any ǫ = ee−C0M4

, we may write f = f1 + f2, such that:

(i) The algebra norm is preserved: ‖f‖A = ‖f1‖A + ‖f2‖A;

(ii) the components are still almost Z-valued: d(f1,Z) ≤ d(f,Z) + ǫ, and(consequently) d(f2,Z) ≤ 2d(f,Z) + ǫ;

(iii) f2 has a significantly smaller algebra norm: ‖f2‖A ≤ ‖f‖A − 1/2;

11

Page 12: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

(iv) either f1 has smaller algebra norm (‖f2‖A ≤ ‖f‖A−1/2), or (f1)Z is a

sum of at most eeCM4

functions of the form ±1xi+H (in which case wesay it is finished).

Proof of Theorem 3 using Lemma 4. . Let f : G→ Z be a nonzero function,

with ‖f‖A ≤ M . Let ǫ = ee−C0M4

be a small parameter. If at every stepwe can apply lemma 4 on all unfinished components of f , the number ofsteps can not exceed 2M , because otherwise ‖f‖A ≥ 1

2(2M + 1) > M . Since

d(f,Z) = 0, estimate (ii) in lemma 4 shows that all the obtained componentsare δ-almost Z-valued functions, with

δ ≤ 2(2(. . . 2d(f,Z) + ǫ) . . . ) + ǫ) + ǫ < 22Mǫ

Thus, if C0 was chosen to be very large at the beginning, all intermediate un-finished components of f are admissible candidates for lemma 4. Thus, afterat most 2M steps, all functions will be finished, i.e. we have a decompositionf =

∑Lk=1 fk where:

(a) L ≤ 22M ;

(b) each (fk)Z can be written as a sum of at most eeCM4

functions of theform ±1xj,k+Hk

, where Hk ≤ G is a subgroup;

(c) d(fk,Z) ≤ 22Mǫ for all k.

From here it is not hard to see that f =∑L

j=1(fk)Z. Theorem 3 will be

proved if we can show that L ≤ M + 1100

. To see this, just notice that thetotal algebra norm is preserved and if (fk)Z 6≡ 0 for some k, we’ll have:

‖fk‖A ≥ ‖fk‖∞ ≥ ‖(fk)Z‖∞ − d(fk,Z) ≥ 1− 22Mǫ ≥ M

M + 1100

2.3 Proof of the induction step

¿From now on, we will always assume that G is finite abelian. The decom-position in lemma 4 will be of the form:

f = ψSf + (f − ψSf)

12

Page 13: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

here ψS is the Fourier multiplier operator with multiplier β1 and β1 has theform

1X1

|X1| ∗1X1

|X1| for some X1 ⊂ G will be a nice subset of G:

ψSf(γ) = f(γ)β1(γ)

It is not hard to see that 0 ≤ β1(γ) ≤ 1 for all γ, so the required condition oflemma 4 on algebra norm preservation is automatically satisfied. To meet theother requirements, we will construct X1 as part of a nice regular Bourgainsystem:

Definition 5 (Bourgain system). Let G be a finite abelian group and d ≥ 1be an integer. A Bourgain system S of dimension d is a collection (Xρ)ρ∈[0,4]

of subsets of G satisfying:

bs1 (Nesting) If ρ′ ≤ ρ we have Xρ′ ⊆ Xρ;

bs2 (Zero) 0 ∈ X0;

bs3 (Symmetry) If x ∈ Xρ then −x ∈ Xρ;

bs4 (Addition) For all ρ, ρ′ such that ρ+ ρ′ ≤ 4 we have Xρ +Xρ′ ⊆ Xρ+ρ′;

bs5 (Doubling) If ρ ≤ 1 we have |X2ρ| ≤ 2d|Xρ|.

For a Bourgain system S, we define |S| := |X1| to be its size and µ(S) :=|S|/|G| to be its density in G. Notice that there is a canonical probability

measure βρ on any X2ρ, defined by βρ :=1Xρ

|Xρ| ∗1Xρ

|Xρ| . Examples of a nontrivial

Bourgain system include Bohr systems, and the collection of closed balls atthe 0 in a translation-invariant pseudometrics space with doubling properties.

If λ > 0, we define the dilated of S by λS := (Xλρ)ρ∈[0,4]. The followingsimple lemma is useful in the sequel:

Lemma 6 (Dilated Bourgain systems). For any Bourgain system S and forλ ∈ (0, 1], we have dim(λS) = dim(S) and |λS| ≥ (λ/2)d|S|.Definition 7 (Regular Bourgain system). A Bourgain system S = (Xρ)ρ∈[0,4]

with dimention d is said to be regular if

∣∣∣1− |X1||X1+κ|

∣∣∣ ≤ 10dκ

whenever |κ| ≤ 110d

.

13

Page 14: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Remark. Really, this is the definition of a Bourgain system being regularat ρ = 1. The number 10 in the definition can be replaced by any positivenumber, it is chosen only for simplicity of the argument.

It is not hard to show that:

Lemma 8 (Regular dilated Bourgain systems). For any Bourgain system S,∃λ ∈ [1/2, 1] such that λS is regular.

Regular Bourgain systems has the following important property:

Lemma 9 (Almost invariance). If f is any complex-valued function on Gand S is a regular Bourgain system of dimension d on G, then for everyκ ∈ (0, 1) and y ∈ Xκ we have:

|ψSf(x+ y)− ψSf(x)| ≤ 20dκ‖f‖∞Now, to prove the induction step (i.e. lemma 4), we will use the followingproposition:

Proposition 10. Under the same assumptions as in the main lemma (Lemma4), we can find a Bourgain system S such that:

(A) Bounded dimension: d = dim(S) ≤ eCM4;

(B) Positive density µ(S) ≥ ee−CM4

‖fZ‖1;

(C) ψS does not kill f : ‖ψSf‖∞ ≥ 2e−CM4(to avoid a trivial decomposition

of f);

(D) ψSf remains almost Z-valued: d(ψSf,Z) ≤ d(f,Z) + ǫ

Proof that such a system gives a desired decomposition. First, recall that ǫ =e−C0M4

was given, and we can choose C0 to be very large compared to the con-stant C in property (C). Now, as previously indicated, we define f1 := ψSfand f2 := f − ψSf . Property (D) immediately gives us the desired controlon d(f1,Z) and d(f2,Z).

Now, since C0 is very large compared to C we will have ‖ψSf‖∞ > d(ψSf,Z)therefore (ψSf)Z 6≡ 0. This easily implies ‖f1|A ≡ ‖ψSf‖A ≥ 1− ǫ > 1/2 andgives the desired bound for ‖f2‖A.

14

Page 15: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Finally, assume ‖f1‖A > ‖f‖A − 1/2 and we will show that f1 is finished byshowing (f1)Z ≡ fZ and is constant on H :=< Xρ > for ρ = ǫ

20dM. If this is

done, we clearly can write:

(f1)Z = fZ =L∑

j=1

±1xj+H

where we may take

L ≤ M‖fZ‖‖1H‖1

and the desired bound L ≤ eeC′′M4

can be obtain by observing that:

‖1H‖1 ≥|Xρ||G| ≥ (ρ/2)dµ(S)

To prove the above claims, first notice that ‖f2‖A = ‖f‖A − ‖f1‖A < 1/2,this easily implies (f2)Z ≡ 0 and so fZ = (ψSf)Z. Now using the almostinvariance of regular Bourgain system and the bound (A) on dim(S), for anyx− x′ ∈ Xρ we easily have:

|(ψSf)Z(x)− (ψSf)Z(x′)| ≤ 2d(f,Z) + 3ǫ < 1

2.4 Construction of the required Bourgain system

In this section we sketch the main ideas used prove proposition 10. Theconstruction is done incrementally. It is not hard to construct a system sat-isfying given bounded dimension and given positive density; it is howeverharder to meet (C). After a Bourgain system with (A,B,C) have been con-structed, an averaging argument will be used to refine it to a system withadditional property (D):

Theorem 11. Suppose that f : G → M satisfies ‖f‖A ≤ M for someM ≥ 1, and also assume that d(f,Z) < 1/4. For any positive ǫ ≤ 1/4 andany Bourgain system S with dimension d ≥ 2, there is a regular Bourgainsystem S’ with

(i) dim(S ′) ≤ 4d+ 64M2

ǫ2;

15

Page 16: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

(ii) |S ′| ≥ e−CdM4

ǫ4log(dM/ǫ)|S|;

(iii) ‖ψS′f‖∞ ≥ ‖ψSf‖ − ǫ;

(iv) d(ψS′f,Z) ≤ d(f,Z) + ǫ.

Sketch of proof. We’ll construct a nested sequence of Bourgain systems S(0) =S, S(1), S(2), . . . until a system satisfying all (i-iv) is found. In this sequence,for every S(j) not having property (iv) (this is true for everyone, except for

the last one - if the sequence terminates), there is a subset Γ(j) ⊂ G suchthat

γ∈Γ(j)

|f(γ)| > ǫ2

16M(2)

These Γ(j) will be disjointly constructed, and this immediately implies thatthe sequence must stop at some j = J ≤ 16M2

ǫ2. Now the bound on j can be

used to show that every member S(j) of the sequence (including the last one,which we will take as S ′) will satisfy (i,ii,iii). The construction of Γ(j) andS(j+1) out of an S(j) without property (iv) is done through the following steps:

Step 1 : If property (iv) failed for S then for any x0 ∈ G and any ρ suffi-ciently small (says, ρ ≤ ǫ

80d′M) that makes ρS ′ regular, we can use the almost

invariance property (lemma 9) to show:

Ex∈G(f − ψS′)(x)2β ′ρ(x− x0) > ǫ2/4

Step 2 : By lemma 6 on dilated Bourgain systems, we can take ρ ∼ ǫd′M

in the

previous step. Plancherel’s theorem then implies the existence of γ(j+1)0 ∈ G

such that

|∑

γ∈ bG

f(γ)(1− β(j)1 (γ))β(j)

ρ (γ(j+1)0 − γ)| > ǫ2

8M

Now, (2) is satisfied if we define Γ(j) by truncating from G the tails in the

above sum, i.e. avoiding places where 1−β(j)1 (γ) and β

(j)ρ (γ

(j+1)0 − γ) are too

small:

16

Page 17: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Γ(j) := 1− β(j)1 (γ) >

ǫ2

32M2 ∩ |β(j)

ρ (γ(j+1)0 − γ)| > ǫ2

64M2

≡(γ

(j+1)0 + Specǫ2/64M2(β(j)

ρ ))\ Spec1−ǫ2/32M2(β

(j)1 )

Step 3 : The disjointness of Γ(j)’s will be ensured if we choose a regularBourgain system S(j+1) where

γ(j+1)0 + Specǫ2/64M2(β(j)

ρ ) ⊂ Spec1−ǫ2/32M2(β(j+1)1 )

and for every k ≤ j,

Spec1−ǫ2/32M2(β(k)1 ) ⊂ Spec1−ǫ2/32M2(β

(j+1)1 )

This could be done by joining a suitable dilated of S(j) and a small Bohrsystem of the character γ

(j+1)0 (needed to control the behavior of γ

(j+1)0 ).

The resulting system might need to be dilated by some λ ∈ [1/2, 1] to ensureregularity. More explicitly, we can choose:

S(j+1) := λ(κρS(j) ∧Bohrκ′(λ(j+1)

0 ))

where κ ∼ ǫ4/djM4, κ′ ∼ ǫ2/M2.

Step 4 : Standard bounds on dimension and size of Bohr systems then easilygive the required bounds (i,ii) on the size and dimension for S(j+1). Also,regularity and the nesting property of S(j) gives

|ψSf − ψS′f | = |f ∗ (β1 ∗ β ′1 − β1)|

which is small if β ′1 is supported on a very small neighbourhood of 0. The

latter is however guaranteed by the construction of the sequence S(j) (noticethat if the sequence actually stops at S(0) = S then we wouldn’t have thissmall support property, but clearly if this was the case nothing need to beproved).

We now sketch the main steps in constructing a Bourgain system with prop-erties (A,B,C) from given function f satisfying the condition of lemma 4.This is essentially based on the following result:

17

Page 18: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Proposition 12 (Weak Freiman). If A ⊂ G is a nonempty finite subset of afinite abelian group G, with |A+A| ≤ K|A| then there is a regular Bourgainsystem S = (Xρ)ρ∈[0,4] such that

dim(S) ≤ CKC , |S| ≥ e−CKC |A|, and ‖ψS1A‖∞ ≥ cK−C

When A is a large subset of G, a Bogolyubov-Chang’s argument [6, 7] canbe used to proved the above proposition; furthermore in that case it is pos-sible to show that X4 ⊂ 2A− 2A. The general case can then be reduced tothis special case by applying an analogue of Freiman’s theorem for arbitraryabelian groups [5].

To continue, we need the notion of arithmetic connectedness. First, we saythat a set a1, . . . , ak is dissociated if the only solution to ǫ1a1+· · ·+ǫkak = 0with ǫk ∈ 0, 1,−1 is the trivial solution.

Definition 13 (Arithmetic connectedness). A subset A ⊂ G with 0 6∈ A iscalled m-arithmetically connected for some m ≥ 1 if for any subset A′ of Awith size |A′| = m we will have either:

(i) A′ is not dissociated, or

(ii) A′ is dissociated, and there is some x ∈ A \ A′ with x ∈ 〈A〉.

Essentially, if A is anm-arithmetically connected then for any distinct x0, x1, . . . , xm

in A there is a nontrivial linear relation ǫ0x0 + · · ·+ ǫmxm = 0. Using an av-eraging argument it is not hard to show that any m-arithmetically connectedset A has at least e−CM |A|3 additive quadruples a1 + a2 = a3 + a4. Thisresults can be combined with a previous result by Gowers [8, Proposition 12]and the weak Freiman theorem (proposition 12) to deduce:

Proposition 14. If A is m-arithmetically connected nonempty subset of G\0 for some m ≥ 1, then there is a regular Bourgain system S satisfying:

dim(S) ≤ eCm

|S| ≥ ee−Cm |A|‖ψS1A‖∞ ≥ e−Cm

18

Page 19: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Proof of proposition 10. Take m comparable to M4, say m = [50M4] > 4.First consider the simpler case when f ≥ 0. Define:

A := Supp(fZ)

It is sufficient to prove that A is m-arithmetically connected. If A = G this istrivial, otherwise by translating f if necessary we can assume 0 6∈ A. Suppose.towards a contradiction, that A is not m-arithmetically connected. Then wecan find dissociated a1, . . . , am ∈ A such that 〈a1, . . . , am〉∩A = a1, . . . , am.Consider the function p(x) defined through a normalized Riesz product:

p(γ) :=1

|G|

m∏

i=1

(1 +1

2(γ(ai) + γ(ai)))

It is then not hard to see that ‖fZp‖A ≤ 2M2, and the function p is real,nonnegative and supported on 〈a1, . . . , am〉, and,

p(ai) =∑

−→ǫ :ǫ1a1+···+ǫmam=ai

2−P

j |ǫj| ≥ 1

2

Now as 〈a1, . . . , am〉∩A = a1, . . . , am we have fZp(x) =∑m

i=1 fZ(ai)p(ai)1ai(x).

This along with the dissociated property of a1, . . . , am gives

‖fZp‖22 ≥m

4|G|

‖fZp‖44 ≤3

|G|‖fZp‖42

An application of Holder inequality then easily gives ‖fZp‖A ≥√

m12

, contra-dicts previous upper bound ‖fZp‖A ≤ 2M2.

For general f , we consider g = f 2 and get the system S ′ for g. An argumentsimilar to the proof of theorem 11 can be used to refine this system to obtainthe desired system S for f .

References

[1] B. J. Green and T. Sanders, A quantitative version of the idempotenttheorem in harmonic analysis. to appear in Annals of Math.

19

Page 20: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

[2] P.J. Cohen, On a conjecture of Littlewood and Idempotent measures.Amer. J. Math. 82, no. 2, (1960), 191–212.

[3] H. Helson, Note on harmonic functions. Proc. Amer. Math. Soc. 4(1953), 686-691.

[4] W. Rudin, Idempotent measures on abelian groups. Pacific J. Math 9(1959) 195–209.

[5] B. J. Green and I. Z. Ruzsa, Freiman’s theorem in an arbitrary abeliangroup. Jour. London Math. Soc. 75 (2007), no. 1, 163–175.

[6] M.-C. Chang, A polynomial bound in Freiman’s theorem. Duke Math. J.113 (2002), no. 3, 399–419.

[7] Bogolyubov, Sur quelques proprietes des presque-periodes. Ann. ChaireMath. Phys. Kiev 4 (1939), 185–194.

[8] W. T. Gowers, A new proof of Szemeredi’s theorem for arithemtic pro-gressions of length four. Geom. Funct. Anal. 8 (1998), 529–551.

Yen Do, UCLA

email: [email protected]

20

Page 21: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

3 A Sum-Product Estimate in Finite Fields,

and Applications

after J. Bourgain, N. Katz, and T. Tao [1]A summary written by Jacob Fox

Abstract

We prove that if A ⊂ F =: Z/qZ with q prime and |A| ≤ |F |1−δ ,then |A + A| + |A · A| ≥ c(δ)|A|1+ǫ. This sum-product estimate hasapplications to several problems for finite fields, including a Szemeredi-Trotter type theorem, estimates for the Erdos distance problem, andto the three-dimensional Kakeya problem.

3.1 Introduction

Let A be a non-empty subset of a finite field F . Let

A+ A = a + b : a, b,∈ A

andA · A = a · b : a, b ∈ A.

It is clear that |A + A|, |A · A| ≥ |A|, and these bounds are sharp if A isa subfield of F . If F = Z/qZ with q prime, so F has no proper subfields,and |A| ≪ |F |, then one might expect to gain on this inequality. If A isan arithmetic progression, then |A + A| = 2|A| − 1 or A + A = F. If A isa geometric progression, then |A · A| = 2|A| − 1 or A · A = F∗, the set ofinvertible elements of F . It is natural to suspect that a subset A can not actboth like an arithmetic progression and a geometric progression. This ideais captured in the main theorem.

Theorem 1. Let F = Z/qZ with q prime, and let A be a subset of F suchthat |A| < |F |1−δ for some δ > 0. Then

max (|A+ A|, |A · A|) ≥ c(δ)|A|1+ǫ

for some ǫ = ǫ(δ) > 0.

Bourgain, Katz, and Tao prove this theorem with the extra assumptionthat |A| > |F |δ. This assumption was removed by Bourgain and Konyagin

21

Page 22: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

[2]. We will follow most closely the presentation in Green’s lecture notes [6].The best bound on ǫ is given by Katz and Shen [7], improving on an earlierestimate of Garaev [5].

An integer version of Theorem 1 was proved by Erdos and Szemeredi [4]much earlier. They showed max (|A+ A|, |A · A|) ≥ |A|1+ǫ for any set A ofintegers. The bound on ǫ has been improved several times, the most recent bySolymosi [8]. An old conjecture of Erdos, which is still open and consideredquite difficult, says that for each ǫ > 0, max(|A + A|, |A · A|) ≥ c(ǫ)|A|2−ǫ.An analogous conjecture for finite fields, which is at least as difficult, saysmax(|A+ A|, |A ·A|) ≥ c(ǫ) min(|A|2−ǫ, |F |1−ǫ).

We next discuss applications of Theorem 1 to three combinatorial geom-etry problems over finite fields. The following theorem is the first of theseresults.

Theorem 2. If one has N lines and N points in the finite plane (Z/qZ)2

with N ≪ q2, then there are at most O(N3/2−ǫ) incidences.

Using the fact that no two lines are incident to the same two points,which means the point-line incidence graph is has no cycle of length four, itis easy to show the number of incidences between N points and N lines is atmost O(N3/2). Note that the above theorem does better. Theorem 2 is ananalogue of a result of Szemeredi and Trotter [9] that N points and N linesin the plane R2 have O(N4/3) incidences, which is tight.

The second application, which uses Theorem 2, is to the Erdos distinctdistance problem. The problem asks: how many distinct distances are amongany N points in the finite plane (Z/qZ)2? A standard argument using The-orem 2 demonstrates that any N points with N ≪ q2 determine N1/2+ǫ

distinct distances. The original problem of Erdos was in the plane R2.The last application is to the finite field Kakeya problem. A Besicovitch

set in d-dimensions is a subset of (Z/qZ)d that contains a line in each di-rection. The finite field Kakeya problem asks how small can a Besicovitchset be? The finite field Kakeya conjecture states that every Besicovitch sethas at least cqd elements for some constant c = c(d) > 0. Wolff showedwhen d = 3 that every Besicovitch set has cardinality at least cq5/2. UsingTheorem 1, this bound can be improved to cq5/2+ǫ. The bound for the finitefield Kakeya problem was improved by Dvir [3] using completely differentmethods (algebraic). Shortly after Dvir first posted his paper, Alon and Taoobserved how Dvir’s proof can be easily modified to resolve the finite fieldKakeya conjecture.

22

Page 23: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

3.2 Preliminaries

We list here two lemmas, the Plunnecke-Ruzsa inequality and the Balog-Szemeredi-Gowers lemma, that we use for the proof of Theorem 1. Thesetwo lemmas are important tools for many problems in additive combinatorics.

For a positive integer k, let kA denote the k-fold sumset A+A+ · · ·+Awhere the number of A’s is k. We will also use the difference set

A− B := a− b : a ∈ A, b ∈ B

and the quotient set

A/B = a/b : a ∈ A, b ∈ B, b 6= 0.

The first lemma we use is the Plunnecke-Ruzsa inequality.

Lemma 3. Let A and B be finite non-empty subsets of a finite field F with|A+B| ≤ K|A|. Then for all positive integers k and ℓ, |kA−ℓA| ≤ Kk+ℓ|A|.

The next lemma we recall is the versatile Balog-Szemeredi-Gowers lemma.

Lemma 4. Let A,B be finite subsets of an additive group with |A| = |B|,and let G be a subset of A × B with |G| ≥ |A||B|/K and |a + b : (a, b) ∈G| ≤ K|A|. Then there exist subsets A′, B′ of A and B respectively with|A′| ≥ cK−C |A| and |B′| ≥ cK−C |B| such that |A′ − B′| ≤ CKC |A|.

3.3 Proof outline of Theorem 1

The next lemma is an important step toward the proof of the sum-productestimate. It says roughly that if neither A + A nor A · A grows, then thereis a large subset A′ ⊂ A such that A′ ·A′ −A′ ·A′ does not grow. The proofof this lemma uses the Balog-Szemeredi-Gowers lemma twice, once for themultiplicative group F ∗ and once with F as an additive group.

Lemma 5. Let A be a non-empty subset of F such that |A+A|, |A·A| ≤ K|A|.Then there is a subset A′ of A with |A′| ≥ cK−C |A| such that

|A′ · A′ −A′ · A′| ≤ CKC |A|.

The next lemma demonstrates that if A ·A−A ·A does not grow much,then neither does any polynomial expression of A.

23

Page 24: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Lemma 6. Let A be a non-empty subset of F such that |A ·A−A ·A| ≤ K|A|for some K ≥ 1. Then for any polynomial P of several variables and integercoefficients, we have

|P (A,A, . . . , A)| ≤ CKC |A|

where the constant C depends on P .

The previous lemma together with the multiplicative version of the Plunnecke-Ruzsa inequality demonstrates the following extension. Indeed, if P1 and P2

are polynomials, and Q = P1/P2 is a rational function, then Lemma 6 impliesthat, if if A · A− A · A does not grow, then P1 · P2, also a polynomial, doesnot grow and hence Q = P1/P2 does not grow as well.

Lemma 7. Let A be a non-empty subset of F such that |A ·A−A ·A| ≤ K|A|for some K ≥ 1. Then for any rational function Q of several variables andinteger coefficients, we have

|Q(A,A, . . . , A)| ≤ CKC |A|,

where the constant C depends on Q.

We now sketch the idea of the proof of Theorem 1. We may supposefor contradiction that max(|A + A|, |A · A|) ≤ K|A| with K = |A|ǫ, i.e,neither A+ A nor A · A grows. By Lemma 5, there is a subset A′ ⊂ A with|A′| ≥ cK−C |A| such that |A′ · A′ − A′ · A′| ≤ CKC |A|, i.e., A′ is a largesubset of A such that A′ · A′ − A′ · A′ does not grow. By Lemma 7, anyrational expression Q(A′, A′, . . . , A′) does not grow, i.e., there is a constantC1 depending only on Q such that Q(A′, A′, . . . , A′) ≤ C1K

C1 |A′|. But thefollowing proposition gives a rational function J such that J(A′) or A′ − A′

always grows, a contradiction.

Lemma 8. Let J(A) = a5

(a1a2−a3a4

a1−a3+ a6 : a1, . . . , a6 ∈ A, a1 6= a3

). Then

1. If |A| ≥ √q, then |Q(A)| ≥ q/2.

2. If |A| < √q, then |Q(A)| ≥ |A|32|A−A| .

24

Page 25: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

References

[1] Bourgain J., Katz N., and Tao T., A sum-product estimate in finitefields, and applications. Geom. Funct. Anal. 14 (2004), no. 1, 27–57.

[2] Bourgain J., and Konyagin S., Estimates for the number of sums andproducts and for exponential sums over subgroups in fields of prime or-der. C.R. Acad. Sci. Paris, Ser. I 337 (2003) 75–80.

[3] Dvir Z., On the size of Kakeya sets in finite fields. arXiv:0803.2336v3[math.CO]

[4] Erdos P., and Szemeredi E., On sums and products of integers. in:Studies in Pure Mathematics (Birkhauser, Basel, 1983) 213–218.

[5] Garaev M. Z., An explicit sum-product estimate in Fp.arXiv:math/0702780v1 [math.NT]

[6] Green B., Sum-product estimates, lecture notes.

[7] Katz N., and Shen C.-Y., A Slight Improvement to Garaev’s Sum Prod-uct Estimate. arXiv:math/0703614v1 [math.NT]

[8] Solymosi J., An upper bound on the multiplicative energy.arXiv:0806.1040v3 [math.CO]

[9] Szemeredi E., and Trotter W. T., Extremal problems in discrete geom-etry. Combinatorica 3 (1983), 381–392.

Jacob Fox, Princeton

email: [email protected]

25

Page 26: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

4 Growth and generation in SL2(Z/pZ)

after H. A. Helfgott [He1]A summary written by S. Zubin Gautam

Abstract

We summarize the proof in [He1] that, for any generating set A inSL2(Z/pZ), the diameter of the associated Cayley graph is O

((log p)O(1)

);

we also discuss auxiliary results related to expander graphs.

4.1 Introduction

Given a finite group G and a generating set A ⊂ G, the Cayley graph ofG with respect to A is the graph G(G,A) with vertex set G and edge set(g, ag) | g ∈ G, a ∈ A∪A−1. Thus the diameter of G(G,A) is the maximumlength of a word with letters in A∪A−1 needed to produce an arbitary elementof G.

An easy counting argument shows that diam(G(G,A)) ≥ log |G|/ log |A|,and in fact the diameter can be much larger in general for G abelian. How-ever, in the setting of non-abelian simple groups, Babai conjectured that thediameter should not in fact be much larger than the trivial lower bound:

Conjecture 1 (Babai). For any finite, simple, non-abelian group G and anygenerating set A ⊂ G,

diam(G(G,A)

). (log |G|)c

for some absolute constant c.2

For further background and related discussions, see e.g. [BHKLS]. Atpresent, Babai’s conjecture in full generality remains open; our goal is toverify it for the special case G = SL2(Z/pZ) for p a prime.3

2Here and in the sequel, we write X . Y or X = O(Y ) to mean X ≤ cY for someconstant c; subscripts on the symbols “.” or “O” denote dependence of the constant c,and in the absence of such subscripts c is understood to be absolute.

3Of course, SL2(Z/pZ) is not simple, but verifying the conjecture for this choice of Gis trivially equivalent to treating the case of G = PSL2(Z/pZ), which is simple for p > 3.

26

Page 27: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

4.2 Outline of the proof

To prove Babai’s conjecture for SL2(Z/pZ), we will show that a generic subsetA of SL2(Z/pZ) must grow rapidly when acting on itself repeatedly. Moreprecisely, the central result of our discussion is the following:

Key Proposition. Let p be a prime and A a subset of SL2(Z/pZ) notcontained in any proper subgroup.

(a) Suppose |A| < p3−δ for some fixed δ > 0. Then

|A ·A · A| &δ |A|1+ε,

where ε depends only on δ.

(b) Suppose |A| > pδ for some fixed δ > 0. Then

diam(G(SL2(Z/pZ), A)

).δ 1.

In the setting of this proposition, fix δ = 1, for instance. Then for anyp and any generating set A ⊂ SL2(Z/pZ), part (a) gives that one need onlymultiply A by itself at most O((log p)c) times to obtain a set A with |A| > p,where c is a universal constant. But then part (b) of the proposition yields

diam(G(SL2(Z/pZ), A)

). (log p)c

(G(SL2(Z/pZ), A)

). (log p)c,

which resolves the conjecture for SL2(Z/pZ).Our main task is thus to prove the Key Proposition. Part (b) of the

proposition actually relies on part (a); part (a) allows us to assume that Ais sufficiently large, from which point part (b) can be proven by exploitingstructural properties of SL2(Z/pZ) in combination with Fourier-analytic sum-product estimates for “large” sets in the field Fp = Z/pZ.4

Part (a) itself is also proved via a reduction to additive combinatorics infinite fields; this time, the transfer is to Fp2 and is achieved via the trace onSL2. The starting point is the observation that if A ⊂ SL2(Z/pZ) does notgrow rapidly under repeated action on itself, then A is “highly commutative”

4A quicker and slightly stronger proof of part (b) from part (a) has been given byNikolov and Pyber ([NP]); their approach works directly with the representation theory ofSL2(Z/pZ) in lieu of reducing matters to Fourier analysis on Z/pZ. ¿From the perspectiveof pushing results beyond SL2, this is perhaps a more apt approach; see [He2].

27

Page 28: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

in a suitable sense and thus contains a large simultaneously diagonalizablesubset. This fact is used to show that, broadly speaking, one may considerthe size of traces of subsets rather than subsets of SL2(Z/pZ) themselves.From this point, we are reduced to solving a problem in Fp2; the salient toolsused to finish the proof are the Balog–Szemeredi–Gowers regularity theorem,standard sum-product estimates for “small” sets in finite fields, and Ruzsadistances.

4.3 Part (b) from part (a)

In the sequel, for A ⊂ SL2(Z/pZ), Ak denotes the set of products of at mostk elements of A ∪A−1:

Ak = a1 . . . aℓ | aj ∈ A ∪A−1, ℓ ≤ k.

So we want to show that if |A| > pδ, there exists some k depending only on δsuch that Ak = SL2(Z/pZ). We can decompose SL2 as SL2(Z/pZ) = LULU ,where L is the subgroup of lower-triangular unipotent elements and U isthe subgroup of upper-triangular unipotents; thus, it suffices to show thatL ∪ U ⊂ Ak for some k depending only on δ. Additionally, part (a) ofthe Key Proposition affords us the luxury of strengthening our assumptionto |A| > c1p

c2 for any suitable choice of universal constants cj . Via anapplication of the pigeonhole principle, one can show that under such anassumption AA−1 contains many upper-triangular matrices and many lower-triangular matrices. We have thus reduced part (b) of the Key Propositionto a slightly weaker analogous statement in the Borel subgroups5 of SL2:

Lemma 2. Let p be a prime, H a Borel subgroup of SL2(Z/pZ), and A ⊂ Hwith |A| > 2p5/3 + 1. Then A8 contains all trace-2 elements of H ( i.e., allunipotent elements of H).

Proof sketch. Without loss of generality, we take H to be the subgroup ofupper-triangular matrices in SL2(Z/pZ). For C ⊂ H and r ∈ (Z/pZ)∗, define

Pr(C) :=

x ∈ Z/pZ

∣∣∣∣(r x0 r−1

)∈ C

; we want to show |P1(A8)| = p.

5In the present setting, the reader may take this to mean the subgroups of lower- andupper-triangular matrices; these are not all of the Borel subgroups, but they are all wewill need.

28

Page 29: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

By the pigeonhole principle, there exists r ∈ (Z/pZ)∗ for which |Pr(A)| >2p2/3; moreover, some computation and counting allows us to conclude that|P1(A4)| ≥ |Pr(A) − sPr(A)| for all s lying in some large set S ⊂ (Z/pZ)∗

with |S| > 12p2/3.

The following sum-product-type estimate then gives |P1(A4)| ≥ 23p :

Lemma 3. Let p be a prime, A ⊂ Fp, and S ⊂ F∗p. Then there exists s ∈ S

such that

|A+ sA| ≥(

1

p+

p

|S||A|2)−1

.

Thus, by the pigeonhole principle, |P1(A8)| = |P1(A4) + P1(A4)| = p, asdesired.

Lemma 3 is proved via basic Fourier analysis, driven by the observationthat, by Holder,

‖χA ∗ χsA‖L1(Z/pZ) ≤ |A+ sA| · ‖χA ∗ χsA‖L2(Z/pZ).

4.4 Proof of part (a)

4.4.1 A reduction via additive combinatorics

As a preliminary observation, we note that there is nothing special about thethreefold product in part (a) of the Key Proposition:

Lemma 4. For an integer n > 2 and A a finite subset of a group G, supposethere are c, ε > 0 such that |An| > c|A|1+ε. Then there exist constantsc′, ε′ > 0 depending only on c, ε, and n such that |A · A ·A| > c′|A|1+ε′.

The key idea of the proof (which we omit) is to use the triangle inequalityfor the Ruzsa distance. For a group G, the Ruzsa distance between two finitesubsets A,B ⊂ G is

d(A,B) = log

(|AB−1|√|A||B|

).

The Ruzsa distance, though not actually a metric, satisfies the triangle in-equality, and thus we have |AC−1||B| ≤ |AB−1||BC−1| for any finite subsetsA,B,C ⊂ G; this estimate is used repeatedly to prove Lemma 4.

29

Page 30: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

4.4.2 Traces and growth

In this section, we seek to control |Ak| by the size of traces of sets inSL2(Z/pZ). As mentioned above, we begin by observing that, heuristically,a subset A ⊂ SL2(Z/pZ) either grows rapidly under its action on itself orcontains a “large” simultaneously diagonalizable subset:

Proposition 5. Let K be a field and ∅ 6= A ⊂ SL2(K) a finite subset notcontained in any proper subgroup of SL2(K). Assume |Tr(A)| ≥ 2 and |A| ≥4. Then A4 contains at least

(|Tr(A)|−2)( 14|A|−1)

|A6| simultaneously diagonalizablematrices.

To prove this, one first shows that A2 contains at least 14|A| − 1 elements

with trace different from ±2. A counting argument and a little work then

give at least(|Tr(A)|−2)( 1

4|A|−1)

|A6| elements of A−12 A2 that commute with some

fixed g ∈ A2, Tr(g) 6= ±2. (The trace term here shows up as a lower boundfor the number of conjugacy classes intersecting the non-unipotent elementsof A−1

2 A2, which in turn arises in the aforementioned counting argument.)Finally, one uses the fact that in any linear algebraic group, the centralizerof an element with distinct eigenvalues (such as our g) is abelian.

Next, one establishes some general results on SL2 to the following effect:Let K be a field with algebraic closure K, |K| > 3, and let A ⊂ SL2(K) be afinite subset not contained in any proper subgroup. Then, loosely speaking,there is a universal constant k such that Ak acts “nontrivially” on generic

pairs of vectors in K2

(or on pairs of points in the projective space KP 1),in a suitable sense. These results are combined with Proposition 5 and stillmore counting arguments to provide both lower and upper bounds on thesize of A (or boundedly many products thereof) in terms of traces:

Proposition 6. For K and A as in Proposition 5 with |K| > 3, there existsk ∈ N independent of K,A such that

|Ak| ≥1

2

(1

4

(|Tr(A)| − 2)(14|A| − 1)

|A6|− 5

)((|Tr(A)| − 2)(1

4|A| − 1)

|A6|

)2

.

Proposition 7. For any field K and A a finite subset of SL2(K) not con-tained in any proper subgroup of SL2(K), there exists k ∈ N independent ofK,A such that

|Tr(Ak)| & |A|1/3.

30

Page 31: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

4.4.3 A reduction to additive combinatorics

At last, we set out to prove part (a) of the Key Proposition. At any stage ofour reasonings, we may of course assume p and |A| to be larger than any fixedconstants we wish. Moreover, given any reasonable fixed constants c, α > 0and k ∈ N, we may assume |Ak| < c|A|α; if not, we are done by Lemma4. Throughout the following, kj and cj will denote suitably chosen universalconstants.

Combining these observations with Propositions 5 and 7, we see thatAk0 contains a large set B of simultaneously diagonalizable matrices; the setV ⊂ Fp2 of their eigenvalues must be at least as large. Adjusting constants ifnecessary as discussed in the previous paragraph, it turns out we may assume

|V | & |A|α

for any α < 1/3 andCδ < |V | < p1−δ/3

for any given Cδ which depends only on δ. We know how to relate the size ofAk to the size of Tr(Aj) ⊂ Fp for suitable k, j, so we might be tempted to uselower bounds for the size of the set x + x−1 | x ∈ V . This is not actuallysufficient, but we might hope to do better by considering |f(x, y) | x, y ∈ V |for some more general function f ; this approach will work, up to replacing Vby a suitable Vk. Based on the actual estimate for the size of |V |, an appealto Proposition 6 and Lemma 4 shows that we need an estimate of the form

|Tr(Ak)| & |V |1+ε (1)

for suitably large k.Now from the general results on SL2(K) mentioned in the previous sec-

tion, it so happens that there is some g =

(a bc d

)∈ Ak1 with a, b, c, d all

nonzero. By computing Tr(h1gh2g−1) for h1, h2 ∈ B, we can show

Tr(Ak′) ⊃ ad(xy + x−1y−1)− bc(x−1y + xy−1) | x, y ∈ Vk

for any k ∈ N (here k′ = k · k0 + k1 + k · k0 + k1 . k, actually). Returning tothe desired estimate (1), we see that the following sum-product-type resultwill finish the proof:

31

Page 32: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Proposition 8. For Fq the finite field of q elements, let a1, a2 ∈ F∗q and

δ > 0. Then there exist C, ε > 0 depending only on δ such that for anyA ⊂ F∗

q with C < |A| < p1−δ,

|f(x, y) := a1(xy + x−1y−1) + a2(x−1y + xy−1) | x, y ∈ A20| > |A|1+ε.

This result should be compared with the sum-product theorem of Bourgain–Katz–Tao ([BKT]), which essentially replaces the function f(x, y) with thesimpler function x+ y; cf. also [Ko].

Proof sketch. A little algebra shows that it’s enough to prove

|a1(r + r−1) + a2(s+ s−1) | r, s ∈ bA2k| > |A|1+ε

for any b ∈ Aj, where j and k are suitably chosen. (Here A2k denotes the

squares of elements of Ak, rather than AkAk.) Further, we can eliminate thecoefficients a1 and a2 via an application of the Ruzsa triangle inequality. Nowit so happens that we can cover A4 by a well-controlled number of cosets ofA2

2, which eventually reduces us to showing

|(x+ x−1) · (y + y−1) | x, y ∈ A2| > |A|1+ε.

To finish the proof, we set w(x) := x + x−1 and note that w(x)w(y) =w(xy)+w(xy−1). If we suppose toward a contradiction that |w(A2)w(A2)| ≤|A|1+ε, then a variant of the Balog–Szemeredi–Gowers theorem gives thebound |w(A′)+w(A′)| . |A|1+O(ε) for some large subset A′ ⊂ A2. This, com-bined with our original assumption on w(A2)w(A2), contradicts establishedsum-product estimates (cf. e.g. [TV], §2.8).6

4.5 Expander graphs

A good reference for the material in this section is the book [Lu].

Definition 9. Let X(V,E) be a connected k-regular graph with |V | = n. Xis an (n, k, c)-expander graph if for all A ⊂ V with |A| ≤ n/2 we have

|∂A| ≥ c|A|,

where ∂A := x ∈ V | d(x,A) = 1.6Here one combines the aforementioned Bourgain–Katz–Tao theorem with a result of

Heath-Brown–Konyagin (Lemma 5 of [H-BK]); the proof in [TV] unifies these and bypassesthe number-theoretic techniques of [H-BK].

32

Page 33: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

A family of k-regular graphs Xi(Vi, Ei) with |Vi| = ni →∞ is an expanderfamily if each Xi is an (ni, k, c)-expander for some fixed c > 0. Equivalently,given a k-regular graph X(V,E), consider the normalized random walk op-erator

TX f(x) =1

k

(x,y)∈E

f(y)

on L2(V ). The largest eigenvalue of TX is 1 for any graph, and Xi isan expander family if and only if the second-largest eigenvalues of TXi

areuniformly bounded away from 1 (the family has “(uniform) spectral gap”).

Cayley graphs have proven to be a a good source of expander graphs, andthose of SL2(Z/pZ) with respect to certain generating sets are known to beexpander families in p. It is not hard to see that if G(G,A)G,A yields afamily of k-regular expanders, then diamG(G,A) .k log |G|, which gives animprovement over Babai’s conjecture within the family. Consider the family

Fk = G(SL2(Z/pZ), A)|p prime, 〈A〉 = SL2(Z/pZ), |A ∪ A−1| = k.

Of course, we have not shown Fk to be an expander family. However, theproof we have given does imply that G(SL2(Z/pZ), A) has a spectral gap ofat least Ck(log p)−O(1).

4.6 Recent further progress

To conclude, we mention some recent results related to those presented thusfar. Concerning expanders, building off of the results we have discussed,Bourgain and Gamburd ([BG1]) were able to characterize the sets in SL2(Z)whose projections mod p provide expander Cayley graphs of SL2(Z/pZ), aswell as to prove that Cayley graphs of SL2(Z/pZ) with respect to randomgenerating sets of fixed size form expander families in p, “asymptoticallyalmost surely.” Furthermore, in [BG2], they obtained analogous spectralgap results for appropriate subgroups of SU2(C); the methods of both [BG1]and [BG2] are themselves highly additive-combinatorial.

Finally, Helfgott ([He2]) has proven the analogue of the Key Propositionfor SL3(Z/pZ), thus resolving Babai’s conjecture for the family SL3(Z/pZ)p.The proof follows the same approach to that which we have described, and itrelies on the result for SL2; however, it involves a bit more use of the groupstructure than we needed.

33

Page 34: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

References

[BHKLS] L. Babai, G. Hetyei, W. M. Kantor, A. Lubotzky, and A. Seress.On the diameter of finite groups. Proc. 31st IEEE FOCS (1990), 857–865.

[BG1] J. Bourgain and A. Gamburd. Uniform expansion bounds for Cayleygraphs of SL2(Fp). Ann. of Math. 167 (2008), 625–642.

[BG2] ——. On the spectral gap for finitely-generated subgroups of SU(2).Invent. Math. 171 (2008), 83–121.

[BKT] J. Bourgain, N. Katz, and T. Tao. A sum-product estimate in finitefields, and applications. Geom. Funct. Anal. 14 (2004), 27–57.

[H-BK] D. R. Heath-Brown and S. V. Konyagin. New bounds for Gauss sumsderived from kth powers, and for Heilbronn’s exponential sums. Quart.J. Math. 51 (2000), 221–235.

[He1] H. A. Helfgott. Growth and generation in SL2(Z/pZ). Ann. of Math.167 (2008), 601–623.

[He2] ——. Growth in SL3(Z/pZ). Preprint, arXiv:math.GR/0807.2027v1,2008.

[Ko] S. V. Konyagin. A sum-product estimate in fields of prime order.Preprint, arXiv:math.NT/0304217v1, 2003.

[Lu] A. Lubotzky. Discrete Groups, Expanding Graphs and Invariant Mea-sures, Progress in Math. 195, Birkhauser, Basel, 1994.

[NP] N. Nikolov and L. Pyber. Product decompositions ofquasirandom groups and a Jordan-type theorem. Preprint,arXiv:math.GR/0703343v3, 2007.

[TV] T. Tao and V. Vu, Additive Combinatorics, Cambridge Studies in Adv.Math. 105, Cambridge UP, Cambridge, 2006.

S. Zubin Gautam, UCLA

email: [email protected]

34

Page 35: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

5 The true complexity of a system of linear

equations

after W.T. Gowers and J. Wolf [3]A summary written by Derrick Hart

Abstract

Let L1, . . . Lm be a system of m linear forms with d variables froman abelian group G. We say that such a system has true complexityk if control over the Gowers Uk+1 norm of subset A implies thatLi(x) ∈ A for i = 1, . . . m for approximately the statistically correctnumber of x’s. We give a necessary condition for certain systems tohave true complexity 2 in the case G = F

np .

5.1 Introduction

In many problems in additive combinatorics it is often necessary to under-stand when an appropriately uniform subset of an abelian group G will con-tain not only a system of linear forms but the expected number of them.Perhaps the simplest motivating example is the case of three term arith-metic progressions in Zp. It is a short exercise to show that a subset A of

Zp which uniform in the sense that supξ 6=0|A(ξ)| is small then A will containthe statistically correct number of three term arithmetic progressions.

In order to deal with four term arithmetic progressions and in generalsystems of linear forms one needs a much more complicated analysis. LetL = (L1, . . . , Lm) be a system of linear forms with d variables. In order tocount the images of these forms in a set A one considers the expression

Ex1,...,xd∈G

m∏

i=1

A(Li(x1, . . . , xd)).

Let f be the balance function given by f(x) = A(x)− |A|/|G|. Substitutingin the above expression,

Ex∈Gd

m∏

i=1

A(Li(x)) =

( |A||G|

)m

+ Middle Terms + Ex∈Gd

m∏

i=1

f(Li(x)).

If we can show that the second and third term are small in absolute valuethen we will get the desired result. The Middle Terms will automatically be

35

Page 36: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

small if one has control over Ex∈Gd

∏mi=1 f(Li(x)) and so this term will be

the main focus of our attention.The key tools in controlling this term are the Gowers uniformity norms.

Definition 1. [2] Let G be a finite Abelian group. For any k ≥ 1 andf : G→ C, define the Gowers Uk-norm by the formula

‖f‖2k

Uk := Ex,h1,...,hk∈G

w∈0,1k

C |w|f(x+ w · h),

where C |w|f = f is∑

iwi is even and f otherwise.

In this article we are chiefly interested in the U2 and U3 norms. Wesay that a function is c-uniform if ‖f‖U2 ≤ c and c-quadratically uniform if‖f‖U3 ≤ c.

It turns out that in most cases controlling ‖f‖Uk for k large enough willallows one to control Ex∈Gd

∏mi=1 f(Li(x)) and guarantee the expected num-

ber of images. However, since in most applications in which one needs to saymore the associated analysis becomes increasingly complicated (and is notcompletely understood) as k grows and therefore finding the minimal k is ofsome importance.

Definition 2 (True Complexity). The true complexity of L is the smallestk with the following property . For every ǫ > 0 there exists δ > 0 such that ifG is any finite abelian group and f : G→ C is any function with ‖f‖∞ ≤ 1and ‖f‖Uk+1 ≤ δ, then

∣∣∣∣∣Ex1,...,xd∈G

m∏

i=1

f(Li(x1, . . . , xd))

∣∣∣∣∣ ≤ ǫ.

In Green and Tao [1] the another notion of complexity of a system oflinear forms is given.

Definition 3 (Cauchy-Schwarz Complexity). Let L = (L1, . . . , Lm) be asystem of m linear forms in d variables. For 1 ≤ i ≤ m and s ≥ 0, we saythat L is k-complex at i if one can partition the m − 1 forms Lj : j 6= iinto k + 1 classes such that Li does not lie in the linear span of any of theseclasses. The Cauchy-Schwarz complexity of L is defined to be the least k forwhich the system is k-complex at i for all 1 ≤ i ≤ m, or ∞ if no such kexists.

36

Page 37: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

In the same paper it is shown that with this concept of complexity in towone can prove the following theorem.

Lemma 4. Let f1, . . . fm be functions from G → [−1, 1]. Let L be haveCauchy-Schwarz complexity k. Then

∣∣∣∣∣Ex∈Gd

m∏

i=1

fi(Li(x))

∣∣∣∣∣ ≤ mini‖fi‖Uk+1.

Therefore in order to get the expected number of images of systems oflinear forms with Cauchy-Schwarz complexity k one only needs to control theUk+1 norm. The first question one asks then is this upper bourd sharp? Theanswer at least for Cauchy-Schwarz complexity 2 is yes. There exist uniformsets in Zp which contain too many four term arithmetic progressions. Thisresult relies on the fact that x2, (x+ s)2, (x+ 3s)2 and (x+ 2s)2 are linearlydependent. Is then the Cauchy-Schwarz complexity actually equal to the truecomplexity of a system linear forms? At least in the case of Cauchy-Schwarzcomplexity 2 Gowers and Wolf ([3]) show this to be false. However, theymake the following conjecture.

Conjecture 5. The true complexity of a system L is equal to the smallest ksuch that the functions Lk+1

i are linearly independent.

Our goal is to give Gowers’ and Wolf’s proof of this conjecture in the casethat G = F

np and the system of linear forms is of Cauchy-Schwarz complexity

2 .

5.2 Quadratic fourier analysis and initial reductions

¿From this point on we will refer to the property of the L2i being linearly

independent that of as square-independence. In the context of Fnp when we

say that a system of linear forms is square-independent what we really meanis that the quadratic forms LT

i Li are linearly independent, i.e. that thematrices made of the coefficients of LT

i Li are linearly independent over Fp.

Theorem 6 (Main Theorem). For every ǫ > 0 there exists a constant c > 0with the following property. Let f : F

np → [−1, 1] be a c-uniform function.

Let L be a square-independent system of linear forms, with Cauchy-Schwarz

37

Page 38: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

complexity at most 2. Then

Ex∈(Fnp )d

m∏

i=1

f(Li(x)) ≤ ǫ.

Our main tool will be that of quadratic Fourier Analysis. Consider asurjective linear map Γ1 : Fn

p → Fd1p . Also define a quadratic map Γ2 : Fn

p →Fd2

p with Γ2(x) = (q1(x), . . . , qd2(x)) where the qi are quadratic forms on Fnp .

By the rank(Γ2) we shall mean the rank of the the bilinear form∑

i λiβi whereλi are not all zero and βi is the bilinear form associated with qi. For eacha ∈ Fd1

p and b ∈ Fd2p consider the corresponding sets Ma = x : Γ1(x) = a

and Nb = x : Γ2(x) = b. Define B1 to be the algebra generated by theset Ma and B2 to be the algebra generated by Nb. Then B1 is referred toas a linear factor of complexity d1 while the pair (B1,B2) is referred as aquadratic factor of complexity (d1, d2)!. By the rank(B1,B2) we shall meanthe rank(Γ2).

The following theorem allows us to decompose f into the sum of threefunctions. A function f1 which is ”quadratically structured” in the sensethat it is constant on the atoms of B2 of the quadratic factor (B1,B2) whichis of bounded complexity. A function f2 which is small in L2 which allows usto guarantee that (B1,B2) has high rank. And finally, a function f3 which isquadratically uniform.

Theorem 7 (Structure Theorem). Let p be a fixed prime. Let δ > 0, r :N → N be a function which may depend on δ. Suppose that n > n0(r, δ) issufficiently large. Then given any function f : Fn

p → [−1, 1], there exists ad0 = d0(r, δ) and a quadratic factor (B1,B2) with

rank(B1,B2) ≥ r(d1 + d2) and complexity(B1,B2) ≤ (d1, d2),

with d1, d2 ≤ d0 together with a decomposition

f = f1 + f2 + f3,

wheref1 := E(f |B2), ‖f2‖2 ≤ δ and ‖f3‖U3 ≤ δ.

The Structure Theorem allows to essentialy replace f with f1 in the MainTheorem. To see this let δ > 0. Let r : d 7→ 2md + C. From the StructureTheorem (Theorem 7) there exists a d0 and a quadratic factor (B1,B2) with

rank(B1,B2) ≥ 2m(d1 + d2) + C and complexity(B1,B2) ≤ (d1, d2),

38

Page 39: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

with d1, d2 ≤ d0. Replacing the first f ,

Ex∈(Fnp )d

m∏

i=1

f(Li(x)) = Ex∈(Fnp )d(f1 + f2 + f3)(L1(x))(

m∏

i=2

f(Li(x)).

Applying Cauchy-Schwarz to the second term,∣∣∣∣∣Ex∈(Fn

p )df2(L1(x))

m∏

i=2

f(Li(x))

∣∣∣∣∣ ≤ Ex∈(Fnp )d |f2(L1(x))| ≤ ‖f2‖2 ≤ δ.

followed in turn by Lemma 4 to the third term,∣∣∣∣∣Ex∈(Fn

p )df3(L1(x))

m∏

i=2

f(Li(x))

∣∣∣∣∣ ≤ ‖f3‖U3 ≤ δ.

This gives the bound,

Ex∈(Fnp )d

m∏

i=1

f(Li(x)) ≤ Ex∈(Fnp )df1(L1(x))

m∏

i=2

f(Li(x)) + 2δ.

Continuing this process replacing one f at a time,

Ex∈(Fnp )d

m∏

i=1

f(Li(x)) ≤ Ex∈(Fnp )d

m∏

i=1

f1(Li(x)) + 2mδ.

5.3 Dealing with f1

In order to deal with Ex∈(Fnp )d

∏mi=1 f1(Li(x)) we need some preliminary cal-

culations.

Lemma 8. Let L be a square-independent system of linear forms and letΓ2 = (q1, . . . qd2) be a quadratic map from Fn

p → Fd2p with rank(Γ2) ≥ r. Let

φ1, . . . , φm be linear maps from (Fnp)d to F

d2p and b1, . . . , bm be elements of

Fd2p . Then for a randomly chosen element x,

∣∣Pr [Γ2(Li(x)) = φi(x) + bi, i = 1, . . . , m]− p−md2∣∣ ≤ p−r/2.

Proof. Let

Pr [Γ2(Li(x)) = φi(x) + bi, i = 1, . . . , m]− p−md2 =

39

Page 40: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

ExEλ∈Λ∗

m∏

i=1

d2∏

i=1

χ(λij(qj(Li(x))− φij(x)− bij)

Let Li(x) =∑d

u=1 ciuxu. If βj is the bilinear form associated with qj then

qj(Li(x)) =∑

u,v

ciucivβj(xu, xv).

Consider a j such that λij is non-zero for at least one i. Then from the square-independence of L there exists a u, v such that

∑i λijciuciv 6= 0. Setting

βtw =∑

i,j λijcitciw we note that since rank(Γ2) ≥ r and square-independenceimply rank(βuv) ≥ r. It is now possible after some restatement to considerthe

Exu,xvχ(βxu,xv + ψu(xu) + ψv(xv)− b),where ψu and ψv are linear functionals. Now for u = v we may apply thefollowing lemma.

Lemma 9. Let M be matrix of rank r and b ∈ Fnp . Then

∣∣Ex∈Fnpχ(xTMx + bTx)

∣∣ ≤ p−r/2.

When u 6= v then for a fixed xv then βxu,xv +ψu(xu) +ψv(xv)− b is linearin u. However, then the expection is zero unless βxu,xv + ψu(xu) is constant.This occurs on a subspace of dimension n − r and so the expectation isbounded by p−r.

Using this one can then take into account Γ1 as well. For a full proof seeGowers and Wolf ([3]).

Lemma 10. Let the L be a square-independent and with the dimension of thelinear span of L be d′. Let a = (a1, . . . , am) ∈ (Fd1

q )m and b = (b1, . . . , bm) ∈(Fd2

q )m. Then for a randomly chosen x we have that

Pr(a, b) = Pr [Γ1(Li(x)) = ai and Γ2(Li(x)) = bi for i = 1, . . . , m] =

0 for a /∈ Zp−d1d′−d2m +R(a, b) for a ∈ Z,

where |R(a, b)| ≤ pd1−d1d′−r/2 and Z is a subspace of (Fd1q )m with dimension

d′d1.

40

Page 41: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

We can now deal with f1.

Theorem 11. Let Γ1 : Fnp → Fd1

p be a linear map and Γ2 : Fnp → Fd1

p be aquadratic map with corresponding quadratic factor (B1,B2) with rank(B1,B2) ≥r. Let f : Fn

p → [−1, 1] and let f1 := E(f |B1). Let L be a square-independentsystem of linear forms. Then

Ex∈(Fnp )d

m∏

i=1

f1(Li(x)) ≤ ‖f‖U24mpd1/4 + 2m+1pm(d1+d2)−r/2.

In order to prove this we first appeal to a result which is really a com-bination of a few basic results on projections onto linear factors and the U2

norm.

Lemma 12. Let g = E(f1|B1). Then ‖g‖42 ≤ pn‖f‖4U2.

Setting h = f1 − g then we have

Ex∈(Fnp )d

m∏

i=1

f1(Li(x)) = Ex∈(Fnp )d

m∏

i=1

(h+ g)(Li(x)),

which we may then expand into 2m terms containing products of g’s and h’s.For the 2m − 1 terms which contain at least one g we may use the fact that‖g‖∞ ≤ 1 and ‖h‖∞ ≤ 2 until there is one remaining g followed by Lemma12 to ‖g‖1 ≤ ‖g‖2 ≤ ‖f‖U2pd1/4. The worst case occurs when there is onlyone g giving the bound 2mcpd1/4. Therefore,

Ex∈(Fnp )d

m∏

i=1

f1(Li(x)) ≤ c4mpd1/4 + Ex∈(Fnp )d

m∏

i=1

h(Li(x)).

Let H be defined by the formula H(Γ1x,Γ2x) = h(x). From Lemma 10

Ex

i

h(Li(x)) = Ex

i

H(Γ1(Li(x)),Γ2(Li(x))) =∑

a∈Z,b

Pr(a,b)∏

i

H(ai, bi)

= p−d1d′−d2m∑

a∈Z,b

i

H(ai, bi) +∑

a∈Z,b

R(a,b)∏

i

H(ai, bi)

= Ea∈Z

i

EbiH(ai, bi) + 2mpd1d′+d2m+d1−d1d′−r/2.

41

Page 42: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Similarly, one can derive from Lemma 10 that |EbiH(ai, bi)| ≤ 2pd1+d2−r/2,

which shows that

|Ea∈Z

i

EbiH(ai, bi)| ≤ (2pd1+d2−r/2)m.

Combining these terms gives us the desired result.

5.4 Finishing the proof of the main theorem

¿From Lemma 11,

Ex∈(Fnp )d

m∏

i=1

f(Li(x)) ≤ c4mpd1/4 + 2m+1pm(d1+d2)−r/2 + 2mδ.

≤ c4mpd1/4 + 2m+1p−C/2 + 2mδ.

Retroactively setting C such that 2m+1p−C/2 ≤ ǫ/3,

≤ c4mpd0/4 +ǫ

3+ 2mδ.

Setting δ = ǫ/6m gives

c4mpd0/4 +2ǫ

3.

This now lets us set the uniformity c = 4−mp−d0/4ǫ/3, to give the desiredresult.

References

[1] Green, B. and Tao, T., Linear equations in the primes. Preprint;

[2] Gowers, W. T., A new proof of Szemeredi’s theorem. GAFA, 11 (2001),465–588;

[3] Gowers, W. T. and Wolf, J., The true complexity of a system of linearequations. Preprint;

Derrick Hart, Rutgers University

email: [email protected]

42

Page 43: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

6 On an Argument of Shkredov on Two-Di-

mensional Corners

after Michael T. Lacey and William McClain [1]A summary written by Vjekoslav Kovac

Abstract

We consider the cardinality of the largest subset of Fn2×F

n2 contain-

ing no corner, which is a triple of the form (x, y), (x + d, y), (x, y + d)with d 6= 0. We prove that this quantity is bounded by a constantmultiple of 22n log2 log2 n

log2 n .

6.1 Some history and the main result

In the spirit of the famous Roth-Szemeredi theorem on arithmetic progres-sions, it is natural to investigate density of sets that do not contain somefixed two-dimensional sub-structure. The simplest such sub-structure is acorner, i.e. a triple of points of the form

(x, y), (x + d, y), (x, y + d),

for some integers x, y and a positive integer d.In order to state some quantitative results we define

r∠(N) := max|A| : A ⊆ 1, 2, . . . , N2, A contains no corners.

The first result about asymptotics of r∠(N) was found by Ajtai and Szemeredi[2] who proved r∠(N) = o(N2). Their proof actually gives an explicit bound

r∠(N) .N2

(log∗N)c,

where c > 0 is an absolute constant and log∗ is the iterated logarithm function,i.e. the number if times one must take the logarithm in order to producea number less than or equal to 1. Several other authors obtained similarexplicit but very weak bounds. Shkredov [5, 6] was the first to produce a“reasonable” bound:

r∠(N) .N2

(log logN)c.

43

Page 44: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Green observed in [3, 4] that Shkredov’s argument can be simplified inthe context of finite fields. More generally, for a finite abelian group G wedefine a corner to be a triple (x, y), (x + d, y), (x, y + d), where x, y, d ∈ Gand d 6= 0. Again we denote

r∠(G) := max|A| : A ⊆ G×G, A contains no corners.

Although the original problem is essentially the particular case G = Z/NZ,the greatest simplification of Shkredov’s arguments is present in the caseG = F

n2 . If we denote N = |Fn

2 | = 2n, then the result can be stated as

r∠(Fn2 ) .

N2

(log2 log2N)1/25=

22n

(log2 n)1/25.

Later Shkredov proved in [7] an analogous estimate for an arbitrary finiteabelian group.

The result of the paper [1] is a further improvement of the above boundon r∠(Fn

2 ). Lacey and McClain show that

r∠(Fn2 ) . N2 log2 log2 log2 N

log2 log2 N= 22n log2 log2 n

log2 n.

Here we elaborate on their proof.

6.2 Outline of the proof

Basic structure of the proof is the same as in the original Shkredov’s (oradapted Green’s) proof. It is iterative and its main ingredients are:

• Denifition of appropriate “box norms”

• Generalized von Neumann estimate

• Density increment on a sublattice

• Uniformizing a sublattice

The key new ingredient in [1] is considering three different “box norms” asso-ciated to three coordinate systems in Fn

2 × Fn2 . Accordingly, product lattices

are replaced by intersections of two product lattices in different coordinatesystems. Details are to follow.

44

Page 45: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

6.3 Some notation

Let H denote a subspace of Fn2 . Its dimension will decrease at every iterative

step of the proof. X,D, Y ⊂ H are subsets. We view X as a subset ofthe first coordinate associated to basis element e1, Y as a subset of thesecond coordinate, associated to basis element e2, and D as a subset of the“diagonal” coordinate associated to e1 + e2. We will be working with subsetsof

S := X × Y ∩Xdiag× D.

The density of X in H is

δX := P(X | H) = |X||H| ,

and analogously for δY and δD. In the iterative procedure these densities willdecrease.

The following quantity measures “uniformity” of the distribution of X inH :

‖X‖uni := supξ 6=0

| bX(ξ)||H| .

If ‖X‖uni ≤ η then we say that X is η–uniform. Here X represents the theFourier transform of X:

g(ξ) :=∑

x∈H

g(x)(−1)x·ξ.

After the deletion of a small subset, a uniform set is again uniform.The density of A is

δ := P(A | S) .

This quantity will increase in the iterative procedure of the proof. We definethe balanced function of A to be the function supported on S as

f(x, y) := A(x, y)− δS .

Throughout this proof we assume:

‖X‖uni , ‖Y ‖uni , ‖D‖uni ≤ υ , υ := (δδXδY δD)C

where C is a large constant which we need not specify exactly, as its precisevalue only influences implied constants in our main Theorem. We will usethe notation υ′ for a fixed function of υ, that tends to zero as υ does.

45

Page 46: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

For a function f : S −→ C, define the “box norm”

‖f‖ := δ−4D Ex,x′∈X

y,y′∈Y

f(x, y)f(x′, y)f(x, y′)f(x′, y′)

where we use the standard basis (e1, e2). When f is the balanced function ofA, the norm being ‘large’ is an obstacle to A having the expected number ofcorners. We use two additional “box norms”. In the (e1, e1 + e2) coordinatesystem we define

‖f‖4,X := δ−4Y Ex,x′∈X

d,d′∈D

f(x, d)f(x′, d)f(x, d′)f(x′, d′) .

Also with respect to the (e2, e1 + e2) coordinate system we define

‖f‖4,Y := δ−4X Ey,y′∈Y

d,d′∈D

f(y, d)f(y′, d)f(y, d′)f(y′, d′) .

6.4 Main ingredients of the proof

6.4.1 Generalized von Neumann lemma

The following lemma provides sufficient conditions for A to have a corner.

Lemma 1. Suppose that A ⊂ S with P(A | S) = δ and we have the inequal-ities

δXδY δDδ2N > C ,

max‖f | , ‖f |,X , ‖f |,Y ≤ κδ5/4 .

Then A has a corner.

Here C represents a large absolute constant and 0 < c, κ, κ′ < 1 are smallfixed constants.

6.4.2 Density increment lemma

If the conditions of the previous lemma are not satisfied, then we can find asublattice on which A has increased density. This is a result of the followinglemma.

46

Page 47: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Lemma 2. For 0 < κ there is a constant 0 < κ′ < 1 for which the following

holds. Suppose that A ⊂ S = X × Y ∩Xdiag× D with P(A | S) = δ, that f is

the balanced function of A on S, and that

max‖f | , ‖f |,X , ‖f |,Y > κδ5/4 .

Then there exists X ′ ⊂ X, Y ′ ⊂ Y , D′ ⊂ D such that three conditions hold.

either X ′ = X, or Y ′ = Y , or D′ = D;

P(A | S ′) ≥ δ + κ′δ2 , S ′ = X ′ × Y ′ ∩X ′ diag× D′ ;

P(X ′ | X) , P(Y ′ | Y ) , P(D′ | D) ≥ κ′δ2.

We need only refine two of the three sets X, Y and D above. Withuniformity in coordinate that is not refined, we then have that the set S ′ =

X ′ × Y ′ ∩X ′ diag× D′ has about the expected number of points in it.

6.4.3 Uniformizing a sublattice

The last auxiliary result tells us that we can find a uniform sublattice onwhich A has increased density. Uniformity is important since it is requiredin applying the Generalized von Neumann Lemma.

Lemma 3. Suppose that X, Y,D are as above and

• X ′ ⊂ X, Y ′ ⊂ Y , D′ ⊂ D, with P(X ′ | X) ≥ cδ2 and similarly for Yand D;

• Either X ′ = X, Y ′ = Y or D′ = D;

• S ′ = X ′ × Y ′ ∩X ′ diag× D′;

• P(A | S ′) = δ + cδ2;

• dim(H) > C[δ4(υ′′)2]−1, where 0 < υ′′ < 1 is fixed.

47

Page 48: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Then there exists X ′′ ⊂ X ′, Y ′′ ⊂ Y , D′′ ⊂ D′ and H ′, H ′′, translates of thesame subspace H0 ≤ H, so that

‖X ′′‖uni , ‖Y ′′‖uni , ‖D′′‖uni ≤ υ′′,

P(A | S ′′) ≥ δ + c2δ2 , S ′′ = X ′′ × Y ′′ ∩X ′′ diag

× D′′ ,

dim(H0) ≥ dim(H)− C[δ4(υ′′)2]−1 ,

P(X ′′ | H ′) ≥ κδ2P(X ′ | H) .

Analogously for Y ′′ and D′′. In particular P(D′′ | H ′ +H ′′) ≥ κδ2P(D′ | H)

We emphasize that H ′ and H ′′ are translates of the same subspace ofH0 < H , After a joint translation of A, X, Y and D, we can assume that H ′

and H ′′ are in fact the same subspace H .

6.5 Proof of the main theorem

The proof is recursive and we describe its conditional loop.Initialize X ← Fn

2 , Y ← Fn2 , D ← Fn

2 , S ← Fn2 × Fn

2 , H ← Fn2 . Likewise

δX , δY , δD ← 1. Fix a set A0 with density δ0 in Fn2 × Fn

2 . Initialize A ← A0

and δ ← P(A | S).We iteratively apply the following steps:

• If max‖f‖ , ‖f‖,X , ‖f‖,Y > κδ5/4, apply the density incrementlemma.

• If X ′, Y ′ or D′ is not υ = (δδX′δY ′δD)C uniform, apply the uniformitylemma. Suppose these sets are as in the Lemma: subsets X ′′ ⊂ X ′,Y ′′ ⊂ Y ′, D′′ ⊂ D′ and affine subspaces H ′, H ′′ ⊂ H containingX ′′, Y ′′, D′′. After joint translation of X ′′, Y ′′, D′′, A and H ′, H ′′, wecan assume that H ′ = H ′′ and are subspaces of H .

• Update variables:

X ← X ′′ , Y ← Y ′′ , D ← D′′, H ← H ′ ,

δX ← P(X ′′ | H ′) , δY ← P(Y ′′ | H ′) , δD ← P(D′′ | H ′) ,

S ← X × Y ∩Xdiag× D , δ ← P(A | S).

48

Page 49: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

• The density of the incremented A on the set S has increased by at leastκδ2

0. The incremented densities δX , δY , and δD have decreased by atmost (κδ0)

C .

Once this loop stops we conclude that A has a corner, provided that theinitial dimension is large enough. This loop must stop in . δ−1

0 iterates, sinceitherwise the density of A on the sublattice would exceed one. Thus we needto be able to apply our lemmas . δ−1

0 times. In order to do that, both Xand H must be sufficiently large at each stage of the loop.

This requirement places lower bounds on N = 2n. The most stringent ofthese comes from the loss of dimensions. Note that before the loop termi-nates, we can have δX as small as

δX ≥ (κδ0)(κδ0)−1

.

In order to apply the uniformity lemma at that stage, we need

N > 2(Cδ0)−Cδ−10 .

From this we get the bound stated in the main theorem.

References

[1] Lacey M. T., McClain W., On an Argument of Shkredov on Two-Dimensional Corners, Online Journal of Analytic Combinatorics, No.2, 2007,arXiv:math/0510491v3 [math.CO]

[2] Ajtai M., Szemeredi E., Sets of lattice points that form no squares, Stud.Sci. Math. Hungar., 9, 1974

[3] Green B., Finite field models in additive combinatorics, Surveys in com-binatorics 2005, London Math. Soc. Lecture Note Ser., Vol. 327,arXiv:math/0409420v1 [math.NT]

[4] Green B., An Argument of Shkredov in the Finite Field Setting,http://www.dpmms.cam.ac.uk/~bjg23/

[5] Shkredov I. D., On one problem of Gowers, Izv. ross. Akad. Nauk Ser.Mat., 70, 2006, No. 2, 179–221,arXiv:math/0405406v1 [math.NT]

49

Page 50: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

[6] Shkredov I. D., On a Generalization of Szemeredi’s Theorem,arXiv:math/0503639v1 [math.NT]

[7] Shkredov I. D., On a two-dimensional analog of Szemeredi’s Theoremin Abelian groups,arXiv:0705.0451v1 [math.NT]

Vjekoslav Kovac, UCLA

email: [email protected]

50

Page 51: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

7 An inverse theorem for the Gowers U3(G)

norm over the finite field Fn5

after B. Green and T. Tao [1]A summary written by Choongbum Lee

Abstract

We give an inverse theorem for functions with large Gowers U3(G)norm over G = F

n5 which can be stated as following. A bounded

function f : G → C has large U3(G) norm if and only if it has alarge inner product with a function e2πiφ, where φ : F

n5 → R/Z is a

quadratic phase function.

7.1 Introduction

Let’s start by quoting a paragraph from Green and Tao [1]

There has been much recent progress in the study of arithmeticprogressions in various sets, such as dense subsets of the integersor of the primes. One key tool in these developments has beenthe sequence of Gowers uniformity norms Ud(G), d = 1, 2, 3, . . .on a finite additive group G; in particular, to detect arithmeticprogressions of length k in G it is importatnt to know under whatcircumstances the Uk−1(G) norm can be large.

The goal of this summary is to provide preliminaries and show the outlineof the proof of an inverse theorem for functions with large U3(Fn

5) norm andthereby answering the question above about the necessary condition for largeUk−1(G) norm in a special case. Using this theorem we can deduce r4(Fn

5)≪N(log logN)c for some constant c where r4(G) is the largest cardinality of aset A ⊂ G which does not contain an arithmetic progression of length 4.

We also note that in the same paper the authors also proves an inversetheorem for general abelian groupG but this is not the scope of this summary.The statement of definitions and theorems will be quoted from [1].

7.2 Preliminaries

In this section we will define the Gowers uniformity norm Ud(G) and a semi-norm ud(B) which will be defined over a subset B ⊂ G.

51

Page 52: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

7.2.1 Notation

Here we collection the notations we will use in this summary.G will denote a finite additive group where an additive group is an abelian

group with operation +. If f : G → H is a function from one additivegroup to another, and h ∈ G, we define the shift operator T h applied tof by T hf(x) := f(x + h), and the difference operator h · ∇ := T h − 1applied to f by the formula (h · ∇)f(x) = f(x+ h)− f(x). We extend thesedefinitions to functions of several variables by using subscript as following.T h

x (x, y) = f(x+ h, y) and (h · ∇x)f(x, y) = f(x+ h, y)− f(x, y)The expectation Ex∈Bf(x) := 1

|B|∑

x∈B f(x) will be used to denote theaverage of f over B.

We will use e : R/Z → C is the exponential map e(x) := e2πix andD := z ∈ C : |z| ≤ 1 for the unit disk.

7.2.2 Gowers uniformity norm Ud(G), ‖·‖Ud(G)

The following norm was first introduced by Gowers [4] to prove Szemeredi’stheorem.

Definition 1. (Gowers uniformity norm). Let d ≥ 0, and let f : G→ C bea function. We define the Gowers uniformity norm ‖f‖Ud(G) ≥ 0 of f to bethe quantity

‖f‖Ud(G) :=(Ex∈G,h∈Gd

w∈0,1d

C|w|Tw·hf(x))1/2d

where w = (w1, . . . , wd), h ∈ (h1, . . . , hd), w · h := w1h1 + . . . + wdhd, |w| :=w1 + . . .+ wd and C is the conjugation operator Cf(x) := f(x)

To feel comfortable with the Ud(G) norm, here we explicitly write downthe U2(G) norm and U3(G) norm.

‖f‖U2(G) :=(Ex,a,b∈Gf(x)f(x+ a)f(x + b)f(x+ a + b)

)1/4

‖f‖U3(G) :=(Ex,a,b,c∈Gf(x)f(x+ a)f(x + b)f(x+ c)

f(x+ a + b)f(x+ b+ c)f(x+ c+ a)f(x + a+ b+ c))1/8

We can also define Ud(G) norms recursively as following.

‖f‖U0(G) := E(f); ‖f‖U1(G) := |E(f)|

‖f‖Ud(G) := (Eh∈G

∥∥T hff∥∥2d−1

Ud−1(G))1/2d

52

Page 53: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Here are some basic properties of ‖·‖Ud(G).

1. (Norm) ‖·‖Ud(G) is a norm for d > 1

2. (Monotonicity) ‖f‖Ud(G) ≤ ‖f‖Ud+1(G)

3. (Conjugation Symmetry)∥∥f∥∥

Ud(G)= ‖f‖Ud(G)

4. (Phase invariance) ‖fe(φ)‖Ud(G) = ‖f‖Ud(G) whenever φ is a global

polynomial phase function of order at most d− 1 7 .

5. (Shift Invariance)∥∥T hf

∥∥Ud(G)

= ‖f‖Ud(G) for all h ∈ G

7.2.3 Local polynomial bias of order d, ‖·‖ud(B)

Here we define a seminorm which is called the local polynomial bias of orderd. This seminorm will be used for the main theorem. i.e. we will later explorethe relation between Gowers norm and local polynomial bias.

Definition 2. (Locally polynomial phase functions) If B is any non-emptysubset of a finite additive group G and d ≥ 1, we say that a function φ : B →R/Z is a polynomial phase function of order at most d− 1 locally on B if wehave

(h1 · ∇x) . . . (hd · ∇x)φ(x) = 0

whenever the cube (x + w1h1 + . . . + wdhd)w1,...wd∈0,1 is contained in B. Iff : B → C is a function, we define the local polynomial bias of order d on Bto be the quantity

‖f‖ud(B) := sup|Ex∈B(f(x)e(−φ(x)))|

where φ ranges over all local polynomial phase functions of order at mostd− 1 on B.

From now on we will refer to polynomial phase functions of degree atmost 1 as linear phase functions, and of degree at most 2 as quadratic phasefunctions. Note that to measure the local polynomial bias, we are taking theinner product of f with functions of the form e(φ) where φ is a polynomial

7Global polynomial phase function will be defined in the next section.(See Definition2) But we include this property here to see the similarity between the properties of theGowers Uniformity norm and that of the local polynomial bias defined in next section

53

Page 54: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

phase function. Therefore in some sense we are measuring the correlation off with polynomial oscillations.

We can deduce the folllowing properties from the definition which aresimilar to the Gowers uniformity norm.

1. (Seminorm) ‖·‖ud(B) is a seminorm for any B ⊂ G

2. (Monotonicity) ‖f‖ud(B) ≤ ‖f‖ud+1(B)

3. (Conjugation Symmetry)∥∥f∥∥

ud(B)= ‖f‖ud(B)

4. (Phase invariance) ‖fe(φ)‖ud(B) = ‖f‖ud(B) whenever φ is a locallypolynomial phase function of order at most d− 1 on B.

5. (Shift Invariance for B = G)∥∥T hf

∥∥ud(G)

= ‖f‖ud(G) for all h ∈ G

By using these properties together with the properties of the GowersUd(G) norm we can prove the following proposition.

Proposition 3. Let G be an additive group and f : G → C be a function.Then for all d > 1,

‖f‖Ud(G) ≥ ‖f‖ud(G)

Proof. Given any global polynomial phase function of degree at most d − 1we can use the monotonicity and phase invariance properties to deduce

‖f‖Ud(G) = ‖fe(−φ)‖Ud(G) ≥ ‖fe(−φ)‖U1(G) = |Ex∈G(f(x)e(−φ(x)))|

and taking the supremum over all φ we have

‖f‖Ud(G) ≥ ‖f‖ud(G)

This proposition asserts that large polynomial bias of order d forces largeUd(G) norm. Now we are interested in the reverse direction. So the questionbecomes whether large Ud(G) norm forces large polynomial bias of order dor not. For d = 2 case we indeed have an inverse theorem which isn’t thatdifficult to deduce.

54

Page 55: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Proposition 4. (Inverse theorem for U2(G) norm). Let f : G → D be abounded function. Then

‖f‖u2(G) ≤ ‖f‖U2(G) ≤ ‖f‖1/2

u2(G)

Unfortunately for d > 2 in general additive group G, the converse isn’ttrue. This was discovered by Furstenberg and Weiss [3] and the example canbe found in [1], Example 2.4. This is why we only worry about Fn

5 where aninverse theorem for U3(Fn

5 ) norm does exist.

7.3 Main Theorem

The main theorem will positively answer our question for G = Fn5 and d = 3.

That is, large U3(Fn5 ) norm does indeed force large polynomial bias of order

3. Actually we can replace F5 by Fp for any odd prime p where an almostidentical proof will be applied. But we will work in F5 for concreteness andsimplicity.

Theorem 5. (Inverse theorem for U3(Fn5 )). Let f : Fn

5 → D be a boundedfunction and let 0 < η ≤ 1

(i) If ‖f‖U3(Fn5 ) ≥ η, then there exists a subspace W ≤ Fn

5 of codimension

at most (2/η)C such that

Ey∈Fn5‖f‖u3(y+W ) ≥ (η/2)C

where we can take C = 216. In particular, there exists y ∈ G such that‖f‖u3(y+W ) ≥ (η/2)C.

(ii) Conversely, given any subspace W ≤ Fn5 and any function f : Fn

5 → C

we have ‖f‖U3(Fn5 ) ≥ ‖f‖u3(Fn

5 ) ≥ 5−n|W | ‖f‖u3(y+W ) for any y ∈ Fn5

Combining the two parts of Theorem 5 we see that

‖f‖u3(Fn5 ) ≤ ‖f‖U3(Fn

5 ) ≤C

logc(1 + 1/ ‖f‖u3(Fn5 ))

for some absolute constants c, C > 0. Thus we obtained an inverse theoremas promised. But note that this bound is much more weaker than the boundof Proposition 4 for U2(G). Actually the first part of Theorem 5 assertsthat the control is much better when we look at a quadratic bias over a wellchosen affine subspace.

55

Page 56: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

7.4 Outline of the proof

Throughout this section G will be fixed as Fn5

We will describe the easier proof of the second part first. Proposition3 already gives us one side of the inequality so we only need to look at thesecond inequality ‖f‖u3(Fn

5 ) ≥ 5−n|W | ‖f‖u3(y+W ) for any subspace W and anyy ∈ Fn

5 . Each side of the norm is determined by the supremum of the innerproduct of f with functions of the form e(p(x)) where p(x) is a quadraticphase function. But by using the following proposition we can see that everypossible choice of e(p(x)) for the RHS can be also considered as a choice ofe(p(x)) for the LHS and therefore prove the second part of the theorem.

Proposition 6. (Quadratic Extension Theorem). Let G be an additivegroup, let H ≤ G be a subgroup, and suppose that y ∈ G. Then any quadraticphase function φ : y + H → R/Z can be extended (non-uniquely in general)to a globally quadratic phase function on G.

Now let’s prove the first part of theorem 5 which asserts that if we have afunction f with large U3(Fn

5 ) norm, then it must have quadratic polynomialbias. Recall that quadratic polynomial bias is defined as the supremum of|Ex∈F

n5f(x)e(−p(x))| where p(x) varies over polynomials of degree at most

2. Introducing φ(x) as the phase of f(x) (That is f(x) = e(φ(x))) will helpus understand the heuristic of the argument. With this phase φ(x), thepolynomial bias can be rewritten as,

‖f‖u3(G) = sup|e(φ(x)− p(x))|

Thus our goal of proving that f has large quadratic polynomial bias can bechanged into proving that the phase φ(x) is ’almost’ a quadratic polynomial.8

STEP 1 : Linearization of phase derivative.A nice way of showing that a function φ(x) behaves like a quadratic

polynomial is by proving that (h · ∇)φ(x) = φ(x + h) − φ(x) is approxi-mately linear. The proof will go in this direction. Since we are talking aboutpolynomial bias, we will first examine φ(x + h) − φ(x) and prove it’s linearbias which certainly exists if we allow to restrict ourself to a subspace V of

8Here the formal meaning of ’almost a quadratic polynomial’ guides us back to theoriginal formulation. Thus formally this will mean that f has large quadratic polynomialbias

56

Page 57: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Fn5 .(Fortunately we have a bound on codimension). The formal statement

we derive is

Proposition 7. (Large U3(Fn5 ) gives linear phase derivative). Let G = F

n5 ,

and let f : G → D be a bounded function such that ‖f‖U3(G) ≥ η for someη > 0. Then there exists a linear subspace V of G with the codimensionbound

n− dim(V ) ≤ 2C1η−C′1

and a translate x0 +V of this subspace, together with a linear transformationM : V → G and an element ξ0 ∈ G such that

Eh∈V |Ex∈GTx0+hf(x)f(x)e(−(2Mh + ξ0) · x)| ≥ 2−C2ηC′

2

It is permissible to take Ci, C′i = 216 for i = 1, 2.

Note that the conclusion

Eh∈V |Ex∈GTx0+hf(x)f(x)e(−(2Mh + ξ0) · x)| ≥ 2−C2ηC′

2

can be rewritten as following by using the phase φ(x) instead of f(x).

Eh∈V |Ex∈Ge(φ(x+ x0 + h)− φ(x)− (2Mh + ξ0) · x)| ≥ 2−C2ηC′2

which exhibits the linearity of φ(x+ h)− φ(x)9.

STEP2: The symmetry argumentEven though we have established the linearity of φ(x+ h)− φ(x) by ap-

proximating it as a linear operator M on a large subspace V of Fn5 , this

doesn’t immediately give us the ’almost’ quadratic property of φ(x). For ourargument to work, we need the linear operator M to be self-adjoint. Againthis is possible if we restrict ourself to a subspace W of V with a bound oncodimension.

STEP3: Eliminating the quadratic phase componentBy using the results of STEP1 and STEP2 we can indeed show that f

has large quadratic bias but this is not an immediate consequence. If wehad an approximation such as φ(x + h) − φ(x) − 2Mh · x ≈ 0 then it isnot that difficult to deduce φ(x) ≈ Mx · x + q(x) where q(x) is a linear

9in the sense of linear bias

57

Page 58: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

polynomial. Unfortunately the formal meaning of φ(x + h) − φ(x) beinglinear in STEP1 is somewhat different from above. But still we can followthe idea (with almost identical steps) of the above argument and deduce thefollowing, which implies that translations of f |W have large correlation withthe quadratic phase function Mx · x.

Ey∈G

∥∥∥f(x + y)e(Mx · x)∥∥∥

u3(W )≥ 2−C2ηC′

2

and by the properties from section 7.2.3 we can conclude that

Ey∈G ‖f‖u3(y+W ) ≥ (η/2)C

which is the conclusion of Theorem 5.

7.5 Application

As an application of Theorem 5 we can give an upper bound on r4(Fn5 ) where

r4(G) is the largest cardinality of a set A ⊂ G which doesn’t contain anyarithmetic progression of length 4.

Theorem 8. (Szemeredi theorem for r4(Fn5 )). write N = 5n. Then we have

the boundr4(F

n5 )≪ N(log logN)−2−21

This theorem can be easily deduced from the next proposition (which canbe proved using Theorem 5).

Proposition 9. Let δ > 0, suppose that

n > 6(2/δ)220

and let A ⊂ Fn5 be a set with size at least δN . Suppose that A contains

no four-term arithmetic progressions. Then we can find an affine subspacex0 + V of Fn

5 with dimension dim(V ) ≥ n/3 such that we have the densityincrement

Ex∈x0+V 1A(x) ≥ Ex∈Fn51A(x) + (δ/2)220

58

Page 59: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

References

[1] Green, B. and Tao, T., An inverse theorem for the Gowers U3(G) norm,To appear, Proc. Edin. Math. Soc.

[2] Tao, T. and Vu, V., Additive Combinatorics Cambridge studies inadvanced mathematics 105, Cambridge University Press, 2006.

[3] H. Furstenberg and B. Weiss, A mean ergodic theorem forN−1

∑Nn=1 f(T nx)g(T n2

x), , Convergence in ergodic theory and proba-bility (Columbus OH 1993), 193-227, Ohio State Univ. Math. Res. Inst.Publ., 5. de Truyter, Berlin, 1996.

[4] W.T.Gowers, A new proof of Szemeredi’s theorem for progressions oflength four, GAFA 8 (1998), no. 3, 529-551

[5] W.T.Gowers, A new proof of Szemeredi’s theorem, GAFA 11 (2001),465-588

Choongbum Lee, UCLA

email: [email protected]

59

Page 60: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

8 New bounds for Szemeredi’s Theorem, II:

A new bound for r4(N)

after B. Green and T. Tao [1]A summary written by Kenneth Maples

Abstract

For N a large positive integer, let rk(N) denote the maximumcardinality of a subset A ⊂ [N ] = 1, ...,N that does not contain anarithmetic progression of length k. Szemeredi showed that r3(N) ≪Ne−c

√log log N , and later that rk(N) = o(N) for k ≥ 4. We improve

the known effective bounds for r4(N) to r4(N)≪ Ne−c√

log log N .

8.1 The Problem

Let A be a subset of an abelian group Z, which will typically be the integersof Z/pZ. An arithmetic progression of length k is a subset of A of the formx, x + r, x + 2r, ..., x + (k − 1)r for some initial value x and step r. Thesequence 3, 8, 13, 18 is an arithmetic progression of length 4 and step 5 inthe positive integers.

As mentioned in the abstract, let rk(N) denote the maximum cardinalityof a subset A ⊆ [N ] = 1, ..., N without an arithmetic progression of lengthk. For example, r3(5) = 4, which is achieved by the subset A = 1, 2, 4, 5.

Roth proved that r3(N) ≪ N(log logN)−1 in 1953 [3], which was sub-sequently improved by Szemeredi to r3(N) ≪ Ne−c

√log log N (unpublished).

Szemeredi later showed rk(N) = o(N) in [4] [5], which was improved byGowers in 1998 to the effective bound rk(N)≪ N(log logN)−ck .

In a series of papers, Green and Tao refined the analysis of Gowers toimprove the effective bounds on r4(N) to the level of the estimates for r3(N).This note summarizes the second paper in the series, which establishes thefollowing improvement on Gowers’ bound in the r4(N) case; the inequalityis equivalent to Szemeredi’s bound on r3(N).

Theorem 1. For N sufficiently large,

r4(N)≪ Ne−c√

log log N .

Stronger bounds for r3(N) and r4(N) have also been found but we willnot discuss them here.

60

Page 61: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

8.2 Notation

We will use standard notations for asymptotic estimates. In particular, theVinogradov notation f ≪ g means that there is a positive constant C suchthat f ≤ Cg for all (sufficiently large) arguments of f and g. If the im-plied constant C depends on another value, say δ, then we will denote thedependence with a subscript: f ≪δ g. We will also employ Landau notation,where O, o,Ω,Θ have their usual significance. If we write c or C then theyare placeholders for suitable small or large constants, respectively, which maydenote different values even if used in the same expression.

If Z is any finite set and E ⊆ Z is a subset, we write the density of E inZ as P(E) = PZ(E) = Pζ∈Z(E) = |E|

|Z| . Similarly, for any function f : E → C

we can define its expectation by EZf = Eζ∈Zf(ζ) = 1|Z|∑

ζ∈Z f(ζ). Lp spacestake their usual meaning against this expectation.

This summary uses the language of factors which are σ-algebras on finitesets. In particular, a factor B ⊆ 2X is a collection of subsets of a finite setX that is closed under unions, intersections, and complements. The minimalnon-empty elements of the factor are called atoms. In other words, a factorpartitions P of X into atoms, where the factor itself consists of all possibleunions of the atoms. We let B∨C denote the smallest factor containing bothB and C. The conditional expectation of f relative to the factor B is denotedby E(f |B) and may be defined as the usual orthogonal projection.

8.3 Strategy and Initial Reductions

Theorem 1 is proven using the density increment strategy invented by Roth.The key step is to establish the following density increment proposition.

Proposition 2. Let δ > 0 and assume that N ≥ eCδ−C. Let A ⊆ [N ]

with |A| ≥ δN be a subset with density at least δ. Suppose that A does notcontain an arithmetic progressions of length 4. Then there is an arithmeticprogression P ⊆ [N ] of length |P | ≫ N cδC

on which A has a larger density;explicitly, we have the density increment

|A ∩ P ||P | ≥ (1 + c)δ.

Let us suppose that this proposition is proven and then derive a proofof the main theorem. Suppose that we have constructed a subset A ⊆ [N ]

61

Page 62: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

with cardinality |A| = δN that does not contain any (non-trivial) arithmeticprogressions of length 4. We can apply the above proposition iteratively inthe following way. We initialize the values A0 = A, N0 = N , and δ0 = δ,and we will construct each Ak, Nk, and δk in turn. For each k, we apply theproposition to Nk, Ak, and δk and note that one of the following alternativesholds:

1. Nk ≤ eCδ−Cand our iteration must terminate, as the conditions of the

proposition are not satisfied.

2. Nk < eCδ−Cand the conditions of the proposition are satisfied. In

particular, we can find an arithmetic progression P on which Ak has alarger density as described above.

We let the next set Ak+1 be the image of Ak ∩ P under the affine trans-formation that takes P to Nk+1 = 1, ..., |P |. The new set has densityδk+1 ≥ (1+c)δk in the smaller interval, which we expand to δk+1 ≥ (1+c)k+1δ.

We also have the lower bound Nk+1 ≫ NcδC

kk from the proposition.

Clearly the algorithm cannot continue past δk ≥ 1. Because δk growsexponentially, the algorithm will terminate before C log(1/δ) steps. Once thishas happened, we can combine our two estimates for Nk at the terminationtime to see that

N (cδC )C log(1/δ) ≪ eCδ−C

.

Taking two logarithms and rearranging shows that

δ ≪ e−c√

log log N .

As this was true for any construction of A, it must hold for any subset whichachieves the cardinality r4(N); in this case δ = r4(N)/N and we are done.

We now embed the interval [N ] with the cyclic group Z/pZ for somelarge prime p so we can use Fourier analytic techniques. The cautious readerwill note that this may introduce spurious arithmetic progressions that “wraparound” the cyclic group; these are prevented by taking p is sufficiently large,say p ∈ [4N, 8N ].

We will analyze the quadrilinear form Λ, which is defined for functionsfj : Z/pZ→ C by

Λ(f0, f1, f2, f3) = Ex,h∈Z/pZf0(x)f1(x + h)f2(x + 2h)f3(x + 3h).

62

Page 63: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Notice that if 1A is the characteristic function of some subset A ⊆ Z/pZ,then Λ(1A, 1A, 1A, 1A) ≤ 1/p if A does not contain an arithmetic progressionof length 4; in fact, the product is positive only when h = 0 and x ∈ A.

The purpose of this quadrilinear form is the following reduction of theprevious proposition.

Theorem 3. Let p be a large prime, N ∈ [p/8, p/4] an integer, and f :Z/pZ → [0, 1] be a 1-bounded nonnegative function which vanishes outside[N ]. Let δ = E[N ](f). Suppose p ≫ exp(Cδ−C) for some suitably largeconstant C, and

|Λ(f, f, f, f)− Λ(δ1[N ], δ1[N ], δ1[N ], δ1[N ])| ≫ δ4.

Then we can find an arithmetic progression P in [N ] obeying the length bound|P | ≫ pcδC

and the density increment

EP (f) ≥ (1 + c)δ.

To see how this theorem implies Proposition 2, take f = 1A and explicitlycount the number of length 4 arithmetic progressions in [N ]. It thereforeremains to prove this theorem.

8.4 Relevant Definitions

Before we can discuss the proof of Theorem 3, we must make some relevantdefinitions. The analysis depends on finding subsets of [N ] that roughlyobey linear or quadratic equations. To this end, we will look at the level setsof “phase functions”, which will be generalizations of linear and quadraticpolynomials from Z/pZ→ R/Z. We say that a function φ : Z/pZ→ R/Z isa globally linear phase function if it satisfies the equation,

φ(x+ h1 + h2)− φ(x+ h1)− φ(x+ h2) + φ(x) = 0.

Note that φ(x) = ξx/p+α is a globally linear phase function, where ξ ∈ Z/pZand α ∈ R/Z.

We say that a function φ : Z/pZ → R/Z is a locally quadratic phasefunction on B ⊆ Z/pZ if it satisfies the equation,

φ(x+ h1 + h2 + h3)− φ(x+ h1 + h2)− φ(x + h2 + h3)− φ(x+ h1 + h3)

+ φ(x+ h1) + φ(x + h2) + φ(x + h3)− φ(x) = 0.

63

Page 64: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

where each sum x+ · · · ∈ B. Note that this definition is local, i.e. restrictedto the subset B of the cyclic group.

Phase functions generate factors as their level sets in the following preciseway. A linear factor of complexity at most d and resolution K is a factorof the form B = Bφ1 ∨ · · · ∨ Bφd′

where d′ ≤ d, each φk is a globally linearphase function, and Bφ is the factor whose atoms are the sets x : ‖φ(x) −j/K‖R/Z < 1/2K for j = 0, ..., K−1. Here ‖·‖R/Z denotes the distance fromthe argument to the nearest integer. The complexity refers to the number ofphase functions in its definition, and its resolution refers to the width of thelevel sets. A pure quadratic factor is defined in the same way but with locallyquadratic phase factors relative to B, which is the atom of a linear factorWe can therefore define a (general) quadratic factor to be a pair (B1,B2) offactors where B1 is a linear factor and B2 is a factor whose restriction to eachatom of B1 is a pure quadratic factor. These will turn out to be the factorsrelevant in our analysis.

8.5 Proof Outline

We will now prove Theorem 3. It turns out that if f satisfies the givenhypotheses then it has a high relative expectation on a quadratic factor. Thefollowing proposition finds this factor; its proof is discussed in Section 8.6.Here Btriv denotes the factor generated by [N ], namely Btriv = ∅, [N ],Z/pZ\[N ],Z/pZ.Proposition 4. Let the assumptions be as in Theorem 3. Then there existsa quadratic factor (B1,B2) in Z/pZ of complexity and resolution bounded byO(δ−C) and an atom B2 of B2 ∨ Btriv of density PZ/pZ(B2)≫ exp(−O(δ−C))and contained in [N ] such that

EB2(f) ≥ (1 + c)δ

for some absolute constant c > 0.

Next, we need to partition the quadratic atom into arithmetic progres-sions, as in the following proposition.

Proposition 5. Let (B1,B2) in Z/pZ of complexity at most (d1, d2) and someresolution K, and let B2 be an atom of B2. Then one can partition B2 ∩ [N ]

as the union of ≪ dO(d2)2 N1−c/(d1+1)(d2+1)3 disjoint arithmetic progressions in

Z/pZ.

64

Page 65: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

An appropriate form of the pigeonhole principle gives us an arithmeticprogression of length at least exp(−O(δ−C))N cδC

on which f has averagevalue at least (1 + 1

2c0)δ. Careful control of the constants shows that the

length of the arithmetic progression is indeed large enough.

8.6 The Gowers U 3-norm and the Quadratic Bohr Sets

The proof of Proposition 4 relies on the Gowers U3-norm, which is definedfor functions φ : Z/pZ→ C by

‖f‖8U3(Z/pZ) = Ex,hk∈Z/pZf(x)f(x + h1)f(x+ h2)f(x+ h3)f(x+ h1 + h2)×× f(x+ h2 + h3)f(x+ h1 + h3)f(x+ h1 + h2 + h3).

The reason for using the Gowers U3-norm is the following form of the GowersInverse U3 Theorem.

Theorem 6 (Inverse U3(Z/pZ) theorem for quadratic factors). Let f :Z/pZ → C be a 1-bounded function such that ‖f‖U3(Z/pZ) ≥ η for someη ∈ (0, 1). Suppose also that K is an integer such that K ≥ Cη−C forsome sufficiently large constant C > 0. Then there exists a quadratic factor(B1,B2) in Z/pZ of complexity at most (O(η−C), 1) and resolution K suchthat

‖E(f |B2)‖L1(Z/pZ) ≫ ηC .

The original theorem is proven in [2]. This version can be derived by anaveraging argument from the usual form for locally quadratic phase functions.

Iterating the U3 theorem gives the following bound.

Theorem 7 (Quadratic Koopman-von Neumann). Let f : Z/pZ → [−1, 1]be a 1-bounded function, and let η > 0. Suppose also that K is an integer suchthat K ≥ Cη−C for some sufficiently large constant C > 0. Then there existsa quadratic factor (B1,B2) in Z/pZ of complexity at most (O(η−C), O(η−C))and resolution K such that

‖f − E(f |B2 ∨ Btriv)‖U3(Z/pZ) ≤ η.

Explicitly, this theorem is proven using an energy increment method.We construct the quadratic factor (B1,B2) iteratively. Initialize B1 = B2 =∅,Z/pZ. At each step we apply Theorem 6 to f − E(f |B2 ∨ Btriv). Ele-mentary estimates show that the “energy” ‖E(f |B2 ∨ Btriv)‖2L2 increases by

65

Page 66: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

at least ηC at each step. Because the energy is bounded above by 1, there areat most η−C steps. This algorithm therefore constructs the desired quadraticfactor with the given complexity.

We are now in a position to prove Proposition 4. If g = E(f |B2 ∨ Btriv),then applying the quadratic Koopman-von Neumann theorem above to Λshows that we can replace f with g in the following sense.

Corollary 8 (Anomalous AP4 count on a quadratic factor). Let the assump-tions be as in Theorem 3. Then there exists a quadratic factor (B1,B2) inZ/pZ of complexity at most (O(δ−C), O(δ−C)) and resolution O(δ−C) suchthat the function g := E(f |B2 ∨ Btriv) obeys

|Λ(g, g, g, g)− Λ(δ1[N ], δ1[N ], δ1[N ], δ1[N ])| ≫ δ4.

With this corollary proven, let Ω be the set of numbers in Z/pZ whereg ≥ (1+c)δ. Norm control of Λ, along with the previous reduction, show thatPZ/pZ(Ω) ≫ δ4. Our bounds on the complexity and resolution of B2 ∨ Btriv

further bound the number of atoms, which by the pigeonhole principle givesus our desired high-density atom.

References

[1] Green, B. and Tao, T., New bounds for Szemeredi’s theorem, II: a newbound for r4(N). arXiv:math/0610604v1 [math.NT], pp. 1–26.

[2] Green, B. and Tao, T., An inverse theorem for the Gowers U3(G) norm.Proc. Edinb. Math. Soc. (2) 51 (2008), no. 1, pp. 73–153.

[3] Roth, K., On certain sets of integers. J. London Math. Soc. 28 (1953),pp. 245–252.

[4] Szemeredi, E., On sets of integers containing no four elements in arith-metic progression. Acta Math. Acad. Sci. Hungary. 20 (1949), pp. 89–104.

[5] Szemeredi, E., On sets of integers containing no k elements in arith-metic progression. Acta Arith. 27 (1975), 299–345.

Kenneth Maples, UCLA

email: [email protected]

66

Page 67: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

9 An inverse theorem for the Gowers U3(G)

norm

after Ben Green and Terence Tao [1]A summary written by Eyvindur Ari Palsson

Abstract

The sequence of Gowers uniformity norms Ud(G), d = 1, 2, 3, . . .on a finite additive group G, are a great tool in the study of arith-metic progressions in various sets. To detect arithmetic progressionsof length k in G it is important to know when the Uk−1(G) norm canbe large. We state an inverse theorem for the U3(G) norm on a finite,additive group G of odd order and give an outline of the proof.

9.1 Introduction

Let G be a finite additive group, that is a finite group with a commutativegroup operation +. Throughout the talk, we will use N := |G| for thecardinality of G.

Let f : G → H be a function from one additive group to another andh ∈ H . We define the shift operator T h applied to f by the formula T hf(x) :=f(x+ h). We define the difference operator h · ∇ := T h − 1 applied to f bythe formula (h ·∇)f(x) := f(x+h)− f(x). We then extend these definitionsto functions of several variables by subscripting the variable to which theoperator is applied. For example T h

x f(x, y) = f(x+ h, y).If f : G → C is a complex valued function, and B ⊆ G is a non-empty

subset of G, we will write Ex∈Bf(x) := 1|B|∑

x∈B f(x), which is the average

of f over B. We will further write Ex∈Gf(x) as Ef(x) when the domain Gof f is clear from the context.

If f0, . . . , fk−1 : G→ C, we define the k-linear form Λk(f0, . . . , fk−1) ∈ C

byΛk(f0, . . . , fk−1) := Ex,r∈Gf0(x)T rf1(x) . . . T (k−1)rfk−1(x).

Note that if A ⊆ G and f0 = . . . = fk−1 = 1A then Λk(1A, . . . , 1A) is thenumber of progressions of length k, including those with common difference0, divided by the normalizing factor of N2.

If (N, (k − 1)!) = 1 and A contains no proper progressions of length kthen we see that Λk(1A, . . . , 1A) = |A|/N2, which will be quite small when

67

Page 68: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

N is large. We are thus interested in determining whether Λk(1A, . . . , 1A) issmall or large.

Definition 1. (Gowers uniformity norm). Let d ≥ 1, and let f : G→ C bea function. We define the Gowers uniformity norm ‖f‖Ud(G) ≥ 0 of f to bethe quantity

‖f‖Ud(G) :=

Ex∈G,h∈Gd

ω∈0,1d

C|ω|T ω·hf(x)

1/2d

,

where ω = (ω1, . . . , ωd), h = (h1, . . . , hd), ω · h := ω1h1 + . . . + ωdhd, |ω| :=ω1 + . . .+ ωd, and C is the conjugation operator Cf(x) := f(x).

It turns out that ‖·‖U1(G) is not a norm. However for d > 1, one can showthat ‖·‖Ud(G) is indeed a norm.

An equivalent definition of the ‖f‖Ud(G) norms is given by the recursiveformulae

‖f‖U1(G) = |E(f)|

‖f‖Ud(G) :=(

Eh∈G‖T hf f‖2d−1

Ud−1(G)

)1/2d

for all d ≥ 2.This latter definition and induction give us a monotonicity property

‖f‖Ud(G) ≤ ‖f‖Ud+1(G) for d = 0, 1, 2, . . .

Since the k-linear form Λk can be used to count arithmetic progressionsin a set then the following theorem shows how the Gowers uniformity normscan be used to bound the number of those arithmetic progressions.

Proposition 2. (Generalized von Neumann Theorem). Let G be a finiteabelian group with (N, (k − 1)!) = 1. Let f0, . . . , fk−1 : G → D := z ∈ C :|z| ≤ 1 be functions. Then we have

|Λk(f0, . . . , fk−1)| ≤ min1≤j≤k

‖fj‖Uk−1(G)

Let e : R/Z→ C be the exponential map e(x) := e2πix. If f has the formf(x) := e(φ(x)), for some phase function φ : G→ R/Z, then

‖f‖2d

Ud(G) = Ex,h1,...,hd∈Ge((h1 · ∇x) . . . (hd · · ·∇x)φ(x)).

68

Page 69: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

This shows that the Ud norm in some sense measures the oscillation inthe dth “derivative” of the phase, since h · ∇ is the discrete analogue of adirectional derivative operator.

Definition 3. (Locally polynomial phase functions). If B is any non-emptysubset of a finite additive group G and d ≥ 1, we say that a function φ : B →R/Z is a polynomial phase function of order at most d− 1 locally on B if wehave

(h1 · ∇x) . . . (hd · ∇x)φ(x) = 0

whenever the cube (x + ω1h1 + . . . + ωdhd)ω1,...,ωd∈0,1 is contained in B. Iff : B → C is a function, we define the local polynomial bias of order d on B‖f‖ud(B) to be the quantity

‖f‖ud(B) := sup |Ex∈B(f(x)e(−φ(x)))|

where φ ranges over all local polynomial phase functions of order at mostd− 1 on B.

‖f‖ud(B) is a seminorm, ‖f‖ud(B) ≤ ‖f‖ud+1(B), ‖f‖ud(B) = ‖f‖ud(B) and‖T hf‖ud(G) = ‖f‖ud(G). When φ is a locally polynomial phase of degree atmost d− 1 on B then ‖fe(φ)‖ud(B) = ‖f‖ud(B). Similarly when φ is a globalpolynomial phase function of degree at most d − 1 then it turns out that‖fe(φ)‖Ud(G) = ‖f‖Ud(G).

Using this last invariance we get

‖f‖Ud(G) = ‖fe(−φ)‖Ud(G) ≥ ‖fe(−φ)‖U1(G) = |Ex∈G(f(x)e(−φ(x)))|

whenever φ is a global polynomial phase of degree at most d − 1 andtaking suprema over all such φ we end up with

‖f‖Ud(G) ≥ ‖f‖ud(G)

for all d ≥ 1.Let G be the Pontryagin dual of G, in other words the space of homo-

morphisms ξ : x 7−→ ξ · x from G to R/Z. G is an additive group which isisomorphic to G. We define the Fourier coefficient f(ξ) of f at the frequencyξ ∈ G by the formula

f(ξ) = Ex∈Gf(x)e(−ξ · x).

69

Page 70: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

We have some classical Fourier results for those new Fourier coefficients,for example the Fourier inversion formula and the Plancherel identity.

Using the fact that polynomials of degree at most 0 are constant we caneasily see that

‖f‖U1(G) = ‖f‖u1(G).

Next using some straightforward Fourier techniques one can proof

Proposition 4. (Inverse theorem for U2(G) norm). Let f : G → D be abounded function. Then

‖f‖u2(G) ≤ ‖f‖U2(G) ≤ ‖f‖1/2

u2(G)

It is tempting to conjecture that the two norms Ud(G) and ud(G) are alsorelated for higher d.

The paper establishes an inverse theorem in the case where d = 3 andG = Fn

5 is a finite field. However due to applications then we are interestedin more general groups, for example Z/NZ. Unfortunately then in generalit does not hold that any bounded function with a small ud(G) norm musthave a small Ud(G) norm. Let’s look at an example from Furstenberg andWeiss that illustrates this point.

Example 5. (Furstenberg and Weiss). Let N be a large prime number, andlet M be the largest integer less than or equal to

√N . Let G := Z/NZ, and

let f : G→ C be the bounded function defined by setting

f(yM + z) := e(yz/M)ψ(y/M)ψ(z/M)

whenever −M/10 ≤ y, z ≤ M/10, and f = 0 otherwise. Here ψ : R → R≥0

is a non-negative smooth cutoff function which equals one on the interval[−1/20, 1/20] and vanishes outside of [−1/10, 1/10]. Then a direct calcula-tion shows that ‖f‖U3(G) ≥ c0 for some absolute constant c0 > 0 wheras aWeyl sum computation reveals that E(fe(−φ)) = O(N−c) for any quadraticphase function φ and some explicit constant c > 0.

The key element in the example was that the function yM + z 7−→ yz/Mis locally quadratic on the set B := yM + z : −M/10 ≤ y, z ≤ M/10 butdoes not extend to a globally quadratic phase function on all of G. We needto account for those local quadratic phase functions in order to produce agenuine inverse theorem for the U3(G) norm. We also must understand thegeneralization of sets like B. We’ll accomplish that by looking at Bohr sets.

70

Page 71: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

9.2 The inverse theorem

Definition 6. (Bohr sets). Let G be a finite additive group, and let S ⊆ G,|S| = d be a subset of the dual group. We define a sub-additive quantity ‖·‖Son G by setting

‖x‖S := supξ∈S‖ξ · x‖R/Z,

where ‖·‖R/Z denotes the distance to the nearest integer, and define the Bohrset B(S, ρ) ⊆ G for any ρ > 0 to be the set

B(S, ρ) := x ∈ G : ‖x‖S < ρ for all ξ ∈ S

The dependence of the Bohr set B(S, ρ) on ρ can be rather discontinuousbut fortunately we can get past that in applications by restricting our atten-tion to regular Bohr sets. We can show that there is an abundance of thoseregular Bohr sets.

Definition 7. (Regular Bohr sets). Let S ⊆ G, |S| = d, be a set of charac-ters, and suppose that ρ ∈ (0, 1). A Bohr set B(S, ρ) is said to be regular ifone has

(1− 100d|κ|)|B(S, ρ)| ≤ |B(S, (1 + κ)ρ| ≤ (1 + 100d|κ|)|B(S, ρ)|

whenever |κ| ≤ 1/100d.

Theorem 8. (Inverse theorem for U3(G)). Let G be a finite, additive groupof odd order, let f : G→ D be a bounded function and let 0 < η ≤ 1.

(i) If ‖f‖U3(G) ≥ η, then there exists a regular Bohr set B := B(S, ρ) in Gwith |S| ≤ (2/η)C and ρ ≥ (η/2)C such that

Ey∈G‖f‖u3(y+B) ≥ (η/2)C,

where it is permissible to take C = 224. In particular, there exists y ∈ Gsuch that ‖f‖u3(y+B) ≥ (η/2)C.

(ii) Conversely, if B = B(S, ρ) is a regular Bohr set, f : G→ D is a boundedfunction and ‖f‖u3(y+B) ≥ η, then we have

‖f‖U3(G) ≥ (η3ρ2/C ′d3)d

for some absolute constant C ′.

71

Page 72: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

9.3 Outline of proof for the inverse theorem

The proof of part (ii) is relatively straightforward so we’ll focus on how toprove part (i) of the inverse theorem. The first step is to establish a “phasederivative” h 7−→ ξh for the function f and to establish some additivity prop-erties on this phase derivative. We use the Balog-Szemerdi-Gowers theoremand Plunnecke inequalities, both from additive combinatorics. The followingproposition is the main result in this step.

Proposition 9. Let G be an arbitrary finite additive group, and let f : G→D be a bounded function such that ‖f‖U3(G) ≥ η for some η > 0. Then there

exists a set H ′ ⊆ G, a function ξ : H ′ → G whose graph Γ′ := (h, ξh) : h ∈H ′ ⊆ G× G obeys the estimates

|Γ′| ≥ 2−C1ηC′1N

and|kΓ′ − lΓ′| ≤ (2C2η−C′

2)k+l|Γ′| for all k, l ≥ 1.

Furthermore for each (h, ξh) ∈ Γ′ we have

|Ex∈GThf(x)f(x)e(−ξh · x)| ≥ η4/2.

The second step is the linearization of the phase derivative. That is, thefunction h 7−→ ξh, which roughly speaking captures the derivative of thephase of f , matches up with a locally linear function. Using the previousproposition we get the following key result in this step.

Proposition 10. (Large U3(G)-norm implies locally linear phase derivative).Let G be an arbitrary finite additive group, and let f : G→ D be a boundedfunction such that ‖f‖U3(G) ≥ η for some η > 0. Then there exists a set

S ⊆ G withd1 := |S| ≤ 2C3η−C′

3 ,

a regular Bohr set B1 := B(S, ρ) ⊆ B(S, 14) = B0 with ρ ∈ [ 1

16, 1

8], elements

x0 ∈ G and ξ0 ∈ G, and a function M : B0 → G obeying the local linearityproperty

M(h ± h′) = Mh±Mh′ whenever ‖h‖S, ‖h′‖S ≤1

8,

and such that

Eh∈B1 |Ex∈GTx0+hf(x)f(x)e(−(ξ0 + 2Mh) · x)| ≥ 2−C4ηC′

4.

72

Page 73: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

The third step is a symmetry argument.

Lemma 11. (Symmetry of derivative). Let the notation be as in the previousproposition. For any x, y ∈ B0, let x, y denote the anti-symmetric form

x, y := M(x) · y −M(y) · x.

Then there exists a set S3 of frequencies with S3 ⊇ 12S and d3 := |S3| ≤

2C5η−C′5, and a Bohr set B3 = B(S3, 2

−C6ηC′6) ⊆ B1, such that

‖x, z‖R/Z ≤ 2C7η−C′7‖x‖S3

The fourth and final step is then to use the symmetry of the derivativeto get rid of the quadratic phase component in the last proposition.

References

[1] Green, B. and Tao, T., An inverse theorem for the Gowers U3(G) norm,arXiv:math/0503014v3 [math.NT];

Eyvindur Ari Palsson, Cornell

email: [email protected]

73

Page 74: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

10 On the Erdos-Volkmann and Katz-Tao ring

conjectures

after J. Bourgain [2]A summary written by Chun-Yen Shen

Abstract

We summarize the results of [2], where it is shown that a Borelsubring of R cannot have Hausdorff dimension strictly intermediatebetween 0 and 1.

10.1 Introduction

The problem is raised in [1] on the existence of subrings R of the reals R,R a Borel set, with Hausdorff dimension strictly between 0 and 1. It wasalready shown by Falconer that if the Hausdorff dimension H−dimR > 1/2,then H − dimR = 1. But his argument, based on dimension considerationsof the distance set |a− b|; a, b ∈ R×R ⊂

√R, does not seem to cover the

range H−dimR ≤ 1/2. In fact, the particular issue whether there is a Borelring satisfying H − dimR = 1/2 and more specifically the discretized ringconjecture due to Katz and Tao [3] mentioned below, became recently a focusof attention because of its relevance to another problem of combinatorialmeasure theory, known as the Kakeya conjecture which says if A is a Borelsubset of R3 containing a unit line segment in every direction, then H −dimA = 3.

Now we state the version of the ring problem as considered in [3]. Firsta definition, a bounded subset A of R is called a (δ, σ)1-set provided A is aunion of δ-intervals and satisfies

|A ∩ I| < (γ

δ)1−σδ1−ǫ (1)

whenever I ⊂ R is an arbitrary interval of sizes δ ≤ γ ≤ 1 and ǫ is a smallparameter. Roughly speaking, a (δ, σ)1-set behaves like the δ-neighbourhoodof a σ-dimensional set.

It is conjectured in [3] that if A is a (δ, 1/2)1 set satisfying |A| > δ1/2+ǫ,then one must have |A+A|+ |AA| > δ1/2−c, with c > 0 an absolute constant.The following is the main result in [2]

74

Page 75: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Theorem 1. If A is a (δ, σ)1-set, 0 < σ < 1, such that |A| > δσ+ǫ. Then

|A+ A|+ |AA| > δσ−c

for an absolute value c = c(σ) > 0.

The case σ = 1/2 is given special attention because of its many con-nections and equivalent formulations described in [3]. In particular, for thediscrete version of this problem, when measure is replaced by cardinality,there is a result of Elekes that when A has finite cardinality ♯A, at least oneof A+A and AA has cardinality ≥ ♯A5/4. The proof of this result exploits theSzemeredi-Trotter theorem. This is heuristic evidence for the ring conjectureif one accepts the (somewhat questionable) analogy between discrete modelsand δ-discretized models.

Here we outline the proof of theorem 1. We proceed by contradiction,assuming

|A+ A|+ |AA| < δσ−ǫ.

The initial stages of the argument use only the additive information, thus|A + A| < δ−ǫ|A|. It is processed through a multi-scale construction, basedon Ruzsa’s sumset estimates and, most importantly, quantitative versions ofFreimann’s famous theorem on finite sets of reals with small doubling set.The key difficulty comes from the fact that the hypothesis |A+ A| < δ−ǫ|A|is by far too weak for a direct application to Freimann’s theorem and thedoubling constant δ−ǫ needs to be reduced. This is achieved using certainsub-multiplicativity properties of the doubling constants at various scales.The final product is a subset C of A with a tree-structure that exhibits a”multiscale porosity property”. At this point, we start using the multiplica-tive structure and prove the existence of elements x1, x2 ∈ A− A such that

|x1C + x2C| > δσ−κ,

where κ > 0 is an absolute constant. In fact, the elements x1, x2 are obtainedrandomly according to a probability measure on A−A. Thus the conclusionis that

|AA−AA + AA− AA| ≥ |A(A−A) + A(A−A)| > δσ−κ.

The contradiction comes from the fact that under our assumptions

|A| > δσ+ǫ, |A+ A|+ |AA| < δσ−ǫ,

75

Page 76: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

there is a subset A′ of A satisfying |A′| > δσ+Cǫ and |A′A′ − A′A′ + A′A′ −A′A′| < δσ−Cǫ (for some constant C). This non-trivial fact was proven byKatz and Tao [3] for δ = 1/2 and the arguments can be extended to thegeneral cases.

10.2 Preliminary results

We present a number of results to be used to prove theorem 1. The firstone is the so-called Freimann theorem and Lemma 4 is so-called Katz-Taolemma.

Theorem 2. Let A ⊂ R be a finite set such that

|A+ A| < K|A|.Then A is contained in a translation of a proper d-dimensional progressionP

P = b1ξ1 + · · ·+ bdξd; 0 < bi ≤ hi, 1 ≤ i ≤ dsatisfying

d < K

and|P | = Πd

i=1hi < exp(K2(logK)3)|A|.Lemma 3. Let A ⊂ R be a bounded set with δN(A, δ) < κ and assume

N(2A, δ)

N(A, δ)< K

where N(A, δ) denotes the δ-covering number. Then there are σ > 0, σδ′ >δ and a subset A′ of A contained in a union of intervals of size σδ′ andseparation at least δ′ satisfying the following conditions.

σ < (logK)Cκ(logK)−C

.

N(A′, δ) >N(A, δ)

Klog1/δ.

Lemma 4. Let A be a Borel set obtained as union of δ-intervals and 0 <σ < 1, satisfying |A| > δσ+ǫ, |A + A| + |AA| < δσ−ǫ. Then there is A′ ⊂ Awith |A′| > δσ+ǫ′ such that

|A′A′ − A′A′| < δσ−ǫ′ .

where ǫ′ → 0 when ǫ→ 0.

76

Page 77: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Proof. We outline the main steps and refer the reader to [3] for further details.As in [3], we use X / Y to mean X ≤ Cǫδ

−CǫY and X ≈ Y to mean X / Yand Y / X. A subset B of A is called a refinement of A provided B is aunion of δ-intervals and |B| ' |A|.

First using the assumptions, we may obtain refinements C,D of A suchthat for all (c, d) ∈ C ×D

|(a1, a2, a3, a4, a5, a6) ∈ A6; c−d = (a1−a4)−(a2−a5)+(a3−a6)| ≈ δ5σ (2)

The measure || refers here to the corresponding hyperplane in R6. We mayalso assume moreover

|c− d| ≈ 1, c ∈ C, d ∈ D.

Next, fix a1, a2, a3, a4, a5 ∈ A, c ∈ C, d ∈ D. Multiplying in (2) with a1a2a3

a4a5,

we get

|(e1, e2, e3, e4, e5, e6) ∈ (AAAA

AA)6;

a1a2a3

a4a5(c−d) = (e1−e4)−(e2−e5)+(e3−e6)| ≈ δ5σ.

Since by assumption on the set A ( together with the multiplicative versionof Ruzsa sumset estimates), we have

|AAAAAA

| ∼ δσ,

and by Fubini theorem we have

|AAA(C −D)

AA| . δσ.

Since|C|, |D|, |CD| ≈ δσ,

from the multiplicative version of (2), there are further refinements C ′ ⊂ C,D′ ⊂ D such that for all c ∈ C ′, d ∈ D′

|(c1, c2, c3, d1, d2, d3) ∈ C3 ×D3; cd = c1d1(c2d2)−1c3d3| ≈ δ5σ (3)

Now we fix c, c′ ∈ C ′ and d, d′ ∈ D′. Denote X the set obtained in (3). If(c1, c2, c3, d1, d2, d3) ∈ X, we have

cd− c′d′ = x1 − x2 + x3 − x4,

77

Page 78: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

where

x1 =(c1 − d′)d1c3d3

c2d2

x2 =(c′ − d1)d′c3d3

c2d2

x3 =(c3 − d2)d

′c′d3

c2d2

x4 =(c2 − d3)d

′c′d2

c2d2

The map (c1, c2, c3, d1, d2, d3) 7→ (x1, x2, x3, x1, c2, d2) is clearly a diffeomor-phism. Therefoe we have

|(x1, x2, x3, x1, c2, d2) ∈ (AAA(C −D)

AA)4×C×D, cd−c′d′ = x1−x2+x3−x4| & δ5σ.

Finally, by Fubini theorem again, we may conclude that

δ4σ × δσ × δσ & |AAA(C −D)

AA|4|C||D| & |C ′D′ − C ′D′|

and hence by letting A′ = C ′′ we get the desired result.

10.3 Outline of Proof of Theorem 1

Using notation from [3], a bounded subset A of R is called a (δ, σ)1-setprovided A is a union of δ-intervals and

|A ∩ I| < (γ

δ)1−σδ1−ǫ.

for any interval I(x, γ), δ ≤ γ ≤ 1 and ǫ is a very small number. We proceedby contradiction, thus we assume

|A| > δσ+ǫ, |A+ A|+ |AA| < δσ−ǫ.

¿From lemma 4, we may assume moreover

|AA−AA| < δσ−ǫ

78

Page 79: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

and hence|AA− AA+ AA− AA| < δσ−ǫ.

Next at the first stage, we use the bound on the sumset, i.e |A+ A| < δσ−ǫ.¿From lemma 3, we construct a subset C of A, |C| > δǫ1|A|, which has a treestructure. The key property is multi-scale porosity. This means that at eachlevel of the tree, C is a union of subsets contained in disjoint intervals of acertain size which is small w.r.t their mutual separation. One then considersa set of the form

x0C + xC

where x0, x ∈ A−A, x0 fixed and x considered as a random variable governedby a measure ν on A− A. Using the porosity property, we have that

∫|x0C + xC|ν(dx) > δ−ǫ2|C|.

Here ǫ2 ≫ ǫ1 ≫ ǫ. Thus

|AA−AA + AA− AA| ≥ |(A− A)C + (A− A)C| > δ−(ǫ2−ǫ1−ǫ)+σ

and in conclusion proves theorem 1.

10.4 Applications

In [3], Katz and Tao investigate three unsolved conjectures in geometriccombinatorics, namely Falconer’s distance problem, the dimension of setsof Furstenburg’s type, and Erdos ring problem. They reduce these geomet-ric problems to δ-discretized variants and show that these variants are allequivalent.

10.4.1 The Falconer distance problem

For any compact subset K of the plane R2, define the distance set dist(K)of K by

dist(K) = |x− y|; x, y ∈ K.Falconer conjectured that if dimK ≥ 1, then dim(dist(K)) = 1, where dimKdenotes the Hausdorff dimension of K. As progress towards this conjecture,the best result to date is by Wolff who showed that dim(dist(K)) = 1 pro-vided dimK ≥ 4/3.Now suppose that one only assumes that dimK ≥ 1. An argument of Mattilashows that dim(dist(K)) ≥ 1/2. Hence by theorem 1 we have

79

Page 80: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Theorem 5. There exists an absolute constant c > 0 such that

dim(dist(K)) ≥ 1/2 + c,

whenever K is compact and satisfies dimK ≥ 1.

10.4.2 Dimension of sets of Furstenburg type

Let 0 < β ≤ 1. We define a β-set to be a compact set K ⊂ R2 such that forevery direction ω ∈ S1 there exists a line segment ℓω with direction ω whichintersects K in a set with Hausdorff dimension at least β. We let γ(β) bethe infimum of the Hausdorff dimensions of β-sets.At present the best bounds known are

max(β +1

2, 2β) ≤ γ(β) ≤ 3

2β +

1

2.

The most interesting value of β appears to be β = 1/2. In this case the twolower bounds on γ(β) coincide to become γ(1/2) ≥ 1. Again, by theorem 1we have

Theorem 6. The 1/2-sets must have Hausdorff dimension at least 1 + c forsome absolute constants c > 0.

Remark 7.

The fact that R is a totally ordered field is relevant, since the analogueof Erdos’s ring problem is false for non-ordered fields such as the complexnumbers or the finite field Fp2. The analogues of Falconer’s distance problemand the conjectures for Furstenburg sets also fail for these fields.

These problems are also related to the Kakeya problem in three dimen-sions, although the connection is more tenuous. The proof of ring conjecturealso leads to an alternate proof of a result in [4], namely that Besicovitch setsin R3 have Minkowski dimension strictly greater than 5/2, and would notrely as heavily on the assumption that the line segments all point in differentdirections. Very informally, the point is that the arguments in [4] can bepushed a bit further to conclude that a Besicovitch set of dimension exactly5/2 must essentially be a Heisenberg group over a ring of dimension 1/2.

80

Page 81: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

References

[1] P. Erdos and B. Volkmann, Additive Gruppen mit vorgegebener Haus-dorffscher Dimension . J. Reine Angew. Math 221 (1966),pp. 203–208;

[2] J. Bourgain, On the Erdos-Volkmann and Katz-Tao ring conjectures.GAFA, 13 (2003), pp. 334-365.

[3] N. Katz and T. Tao, Some connections between Falconer’s distance setconjectures and sets of Furstenberg type. New York J.Math, 7 (2001),pp. 149-187.

[4] N. Katz, I. Laba and T. Tao, An improved bound for the Minkowskidimension of Besicovitch sets in R3. Annals of Math, 152 (2002), pp.346-383.

Chun-Yen Shen, Indiana University

email: [email protected]

81

Page 82: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

11 Quantitative bounds for Freiman’s Theo-

rem

Abstract

A summary of the Chang/Ruzsa proof of Freiman’s theorem.

11.1 Introduction

Let Z be an additive group (usually Z or the cyclic group ZN). If d ≥ 1 isan integer, a d-dimensional generalized arithmetic progression (g.a.p.) in Zis a set P which can be written

P = a+ x1v1 + · · ·+ xdvd : 0 ≤ xi < li, 1 ≤ i ≤ d (4)

=: P (v1, . . . , vd; l1, . . . , ld; a),

for a, v1, . . . , vd ∈ Z and non-negative integers l1, . . . , ld. The length of P is

ℓ(P ) :=

d∏

j=1

ld.

P is proper if |P | = ℓ(P ), or equivalently if each sum in (4) is distinct.Freiman’s theorem states that if |2A| = |A + A| is small relative to |A|,

then A is comparable to a g.a.p.:

Theorem 1. Let A ⊂ Z be an additive set satisfying |2A| ≤ K|A| for someK > 0. Then A is contained in a g.a.p. of dimension d ≤ d(K) and lengthℓ(P ) ≤ C(K)|A|.

According to Chang [2], Freiman’s original proof (1973) was a difficultread, and it wasn’t until the 90s that Bilu and Ruzsa offered more accessiblearguments. In all of these results, the bounds on the function C were dou-bly exponential in K. In 2002, Chang [2] improved on Ruzsa’s argument,addressing two inefficiencies and thereby proving the following theorems:

Theorem 2. Given an additive set A ⊂ Z with |2A| ≤ K|A|, the conclusionsof Freiman’s theorem hold with d(K) and logC(K) bounded by CK2(logK)2.

It is conjectured that one can find P ⊃ A with d(K) = O(K) and C(K) =eO(K). See next section for other conjectures and an explanation why this isoptimal.

82

Page 83: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Theorem 3. Given an additive set A ⊂ Z with |2A| ≤ K|A|, A is con-tained in a g.a.p. P which is proper, has dimension d ≤ [K − 1], and whosecardinality satisfies

log|P ||A| ≤ CK2(logK)3.

Along the way, Chang made an improvement to a structure theorem ofRuzsa, which has found applications independent of Freiman’s theorem.

Theorem 4. If A ⊂ Z is an additive set and |2A| ≤ K|A|, then 2A − 2Acontains a proper symmetric g.a.p. P of dimension d ≤ O(K+K logK) andcardinality |P | ≥ exp(−O(K(1 + log2K)))|A|.

11.2 Applications, remaining conjectures

Much of the material for this section was taken from [5], particularly chapter5.

Freiman’s theorem, particularly the quantitative bounds established byChang have been useful in much of the recent literature on arithmetic com-binatorics, including several of the papers discussed at this summer school,in particular, the articles of: Bourgain, Katz, and Tao on the sum-productestimate; Tao and Green on Gowers’ U3(G) norm; Bourgain on the ringconjectures; and Green and Sanders on the idempotent theorem.

The following conjectures are still open:

Conjecture 5. (Polynomial Freiman-Ruzsa) Suppose A ⊂ Z is an additiveset satisfying |2A| ≤ K|A|. Then there exists a g.a.p. P with

dim(P ) ≤ CKO(1) |P | ≤ CKO(1)|A|

such that one has |A ∩ P | ≥ cK−O(1)|A|.Note: the conclusion is that A∩P is large, not that A ⊂ P . One can see

that with the conclusion A ⊂ P , the best one could hope for is

dim(P ) ≤ CK |P | ≤ C exp(CK)|A|

by considering (for instance) the set 2[0,K−1] = 1, 2, . . . , 2K−1.Conjecture 6. (Gowers, see [3]) If A ⊂ Z is an additive set satisfying|2A| ≤ K|A| and ε > 0, then there exists a g.a.p. P with dim(P ) ≤ CKε

such that |P | is reasonably small and |A ∩ P | is reasonably large.

83

Page 84: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

In many of the above mentioned applications, a proof of either of theseconjectures would yield substantially improved numerical results.

11.3 The proof of Theorem 2

Roughly the Chang/Ruzsa’s argument breaks down into three major por-tions. First, reduce to the case when A is contained in ZN for some primeN . Next, find a large proper g.a.p. P0 ⊂ 2A − 2A of small degree. Finally,use P0 to produce the g.a.p. P (having small dimension and length) whichcontains A.

11.3.1 Reduction to A ⊂ ZN .

For details, particularly the proof of the theorems of Ruzsa and Plunecke,see [4].

Let A and B be subsets of additive groups Y and Z, respectively. Leth ≥ 1 be an integer. A function φ : A → B is a Freiman homomorphism oforder h if whenever a1, . . . , ah, a1

′, . . . , ah′ ∈ A,

a1 + · · ·+ah = a1′ + · · ·+ah

′ =⇒ φ(a1) + · · ·+φ(ah) = φ(a1′) + · · ·+φ(ah

′).

The function φ is a Freiman isomorphism of order h if φ is a bijection fromA onto B and both φ and φ−1 are Freiman homomorphisms of order h

What’s the point? We cannot identify non-trivial subsets of Z with sub-sets of ZN while preserving arithmetic relations of all orders, but fortunately,for the purposes of Freiman’s theorem, we only need to preserve arithmeticrelations of low order and g.a.p.’s. For this one can show that Freiman iso-morphisms suffice.

Ruzsa proved that one can indeed identify subsets of A with subsets ofZN via Freiman isomorphisms:

Theorem 7. Let A ⊂ Z be finite, non-empty. Let h ≥ 2 and let m ≥4h|hA− hA|. Then there exists A′ ⊂ A with

|A′| ≥ |A|h

which is Freiman isomorphic of order h to Zm.

84

Page 85: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

The remainder of the proof proceeds as follows: Identify A′ ⊂ A witha subset R ⊂ ZN via a Freiman isomorphism φ of order 8. Find a g.a.p.P0 ⊂ 2R − 2R. Then Q0 := φ−1(P0) is a g.a.p. contained in 2A− 2A. UseQ0 to construct a g.a.p. containing A.

To obtain quantitative bounds on P0 in the middle step, we will need forN to be small relative to |A| and prime. Once we have achieved smallness ofN , primality is easy since there is always a prime between N and 2N . Forsmallness, looking back at the previous theorem, it suffices to show that |2A|small implies |8A − 8A| is small. For this, one uses Plunnecke’s theorem,which states that if |2A| is small relative to |A|, then the same holds for alldifference sets kA− lA with k, l non-negative integers.

11.3.2 Finding a progression in 2A− 2A.

Suppose A ⊂ ZN , where N is prime, |2A| ≤ K|A| and δ := |A|/N > cK−16.By a method due to Bogolyubov, given any subset A ⊂ ZN (N prime),

there is a proper g.a.p. P0 ⊂ 2A− 2A such that

dim(P0) ≤ p1(δ−1) log(|P0|N

) ≥ p2(δ−1),

for polynomials p1 and p2. In our case, this implies bounds which are poly-nomial in K. This was the method used by Ruzsa.

Chang modified Bogolyubov’s argument to take advantage of the extrainformation |2A| ≤ K|A|. This gave improved bounds of the same type asthose above. We will briefly sketch the argument behind this improvement.

Two crucial concepts for this portion of the proof are Bohr sets anddissociated sets.

Let S ⊂ ZN and ε > 0. Then the Bohr neighborhood of S with radius εis defined as

B(S; ε) := x ∈ ZN : ‖nxN‖R/ZN

< ε, ∀n ∈ S.

A subset λ1, . . . , λd of an additive group Z is dissociated if wheneverεj ∈ −1, 0, 1, 1 ≤ j ≤ d are not all zero,

j

εjλj 6= 0.

For some choice of c, ε ∼ 1, if one lets

Γ := x ∈ ZN : |1A(x)| ≥ cK−1/2δ

85

Page 86: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

and B := B(Γ; ε), then B ⊂ 2R− 2R and d := |Γ| is small.Then, we refine Γ to a maximal dissociated subset of Λ ⊂ Γ. An ap-

plication of Rudin’s theorem shows that such Γ has small size, and by themaximality of Γ,

B(Λ,ε

|Λ|) ⊂ B(Γ, ε).

At this stage, one may apply a theorem of Bogolyubov which guaranteesthe existence of proper g.a.p.’s of small dimension, large cardinality in Bohrsets. This allows one to produce a g.a.p. P0 ⊂ B ⊂ 2A − 2A which obeysthe bounds in the structure theorem, Theorem 4.

11.3.3 From P0 ⊂ 2A− 2A to P ⊃ A

It is in the passage from the proper g.a.p. P0 ⊂ 2A−2A to the (not necessarilyproper) g.a.p. P ⊃ A that Chang’s argument offers the greatest numericalgains (from doubly to singly exponential) over Ruzsa’s argument, though thetwo are similar in spirit. We describe the simpler argument of Ruzsa herebecause of limited space.

Ruzsa’s argument: Let a1, . . . , as ⊂ A to be a maximal set with theproperty that the sets ai + P0 are pairwise disjoint. One can show thatA ⊂ a1, . . . , as+ P0−P0 ⊂ P1 for some (not necessarily proper) g.a.p. P1.If d := dim(P0), then one can show that the g.a.p. P1 satisfies

dim(P1) ≤ s+ d ℓ(P1) ≤ 2s+d|P0|.

One can also show (using Plunnecke’s inequality) that s ≤ CdK5dd, whichtogether with the bounds for d and |P1| described above, proves Freiman’stheorem, but with non-optimal values of d(K) and |C(K)|.

11.4 Producing a proper progression of small rank.

Starting from a g.a.p. P as described in Theorem 2, Chang can provesthe existence of an g.a.p. P ′ satisfying the conclusions of Theorem 3 via amodified (to maintain the small cardinality bound) argument of Freiman, asdescribed by Bilu in [1].

Roughly, this proceeds as follows. We have a g.a.p.

P = P (v1, . . . , vd; l1, . . . , ld; a).

86

Page 87: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

We can assume a is zero. Associated to P are the homomorphism

φ : Zd → Z φ(ej) = vj

and the parallelogram B :=∏d

j=1[−li + 1, li − 1]. Note that φ(B) ⊃ A.If φ|2B∩Zd is not one-to-one, one can reduce the dimension by 1, eventuallyobtaining a homomorphism which is one-to-one on 2B.

This homomorphism φ is then a 2-Freiman isomorphism from 2B ∩Zd toits image in Z. One uses φ to pull A back to a subset A′ ⊂ Zd, maintainingthe inequality |2A′| ≤ K|A|. The inequality implies that A′ must be a [K−1]-dimensional set. Let Γ be the affine space spanned by A′. The g.a.p. P ′ isthe image of φ|Γ∩B∩Zd. We still have A ⊂ P ′, but now dim(P ) ≤ [K − 1].

References

[1] Bilu, Y., ”Structure of sets with small sumset” in Structure Theory ofSet Addition, Astrisque 258, Soc. Math. France, Montrouge, 1999, 77–108.

[2] Chang, M.-C., A polynomial bound in Freiman’s theorem. Duke Math.J. 113 (2002), no. 3, 399–419.

[3] Green, B. J., Structure Theory of Set Addition, Lecture notes,http://www-math.mit.edu/ green/notes.html.

[4] Nathanson, M. B., Additive Number Theory: Inverse Problems and theGeometry of Sumsets, Springer, New York, 1996.

[5] Tao, T. and Vu, V. H., Additive Combinatorics, Cambridge Univ. Press,Cambridge, 2006.

Betsy Stovall, UC Berkeley

email: [email protected]

87

Page 88: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

12 Norm Convergence of Multiple Ergodic

Averages of Commuting Transformations

after Terence Tao [2]A summary written by Zhiren Wang

Abstract

In [2] it is shown that the averages 1N

∑N−1n=0

∏li=1 fi(T

ni x) converge

in L2(X) for commuting measure-preserving transformations Tili=1

and L∞ functions fili=1 on a probability (X,X , µ). The proof ispurely combinatorial by converting the original problem to a finitaryone, where both the underlying space and the number of terms in theconverging sequence which need to be studied are finite.

12.1 Introduction

Let (X,X , µ) be a probability space and T : X 7→ X be a measure pre-serving transformation, i.e. T is X -measurable and T∗µ = µ. The vonNeumann mean ergodic theorem claims for any f ∈ L2(X,X , µ) the averages1N

∑N−1n=0 f(T nx) are convergent in L2(X) as N →∞.

The main theorem of the present work [2] deals with a generalizationwhere there are more than one transformations and functions.

Theorem 1. For l ≥ 1, if T1, · · · , Tl are commuting measure-preservingtransformations in a probability space (X,X , µ) and f1, · · · , fl ∈ L∞(X,X , µ)then 1

N

∑N−1n=0

∏li=1 fi(T

ni x)∞N=1 converge in L2(X,X , µ).

When l = 1 this gives the conventional mean ergodic theorem. Manyother special cases (with small l or additional assumptions on Ti) have beenpreviously studied

12.2 Finitary versions of the main theorem

The first part of the proof reduces the main theorem in to a finitary state-ment. This is achieved in two steps.

Before we start, it should be remarked that (X,X , µ) can be assumed tobe ergodic under the Z

l action generated by T1, · · · , Tl, as it satisfies Theorem1 as long as all members in its ergodic decomposition do, which can be easilydemonstrated with dominated convergence theorem.

88

Page 89: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Notations: The average operator over a finite set Γ is denoted by Eγ∈Γ,i.e. Eγ∈Γf(γ) = 1

|Γ|∑

γ∈Γ f(γ). Note [N ] := 0, · · · , N − 1 for all N ∈ N.

12.2.1 Finite type convergence statement

Theorem 1 is equivalent to the following fact, which allows us to calculateestimates for only a finite number of terms.

Theorem 2. Given the same setting as in Theorem 1, ∀ǫ > 0, ∀F : N 7→ N,∃M ∈ N such that

‖En∈[N ]

l∏

i=1

fi T ni − En∈[N ′]

l∏

i=1

fi T ni ‖L2(X) < ǫ, ∀N,N ′ ∈ [M,F (M)].

It’s clear that Theorem 1 implies Theorem 2. To see the other direction,observe if Theorem 1 fails then ∃ǫ, ∀M ∈ N, ∃F (M) > M , ‖En∈[M ]

∏li=1 fi

T ni − En∈[F (M)]

∏li=1 fi T n

i ‖L2(X) > ǫ, which contradicts Theorem 2. Thistype of reduction works for general convergence statements.

12.2.2 Discretization of the space

In his ergodic proof ([1]) to Szemeredi theorem, Furstenberg introduced theFurstenberg correspondence principle which assigns a dynamical system toa combinatorial object. The current step is the reverse of that correspon-dence, similar to the implication from Szemeredi theorem to Furstenbergmultirecurrence theorem.

Equip the finite additive group ZlP with uniform probability measure, de-

note by Si the shift on the i-th coordinate: Si(a1, · · · , al) = (a1, · · · , ai−1, ai+1, ai+1, · · · , al).

Theorem 2 is implied by the following

Theorem 3. Fix l ∈ N, ∀ǫ > 0, ∀F : N 7→ N, ∃M∗ ∈ N such that thefollowing property holds:

if P ∈ N, fili=1 are functions on the finite additive group ZlP whose

values are bounded in [−1, 1], then ∃M ≤M∗ such that

‖En∈[N ′]

l∏

i=1

fi Sni −En∈[N ′]

l∏

i=1

fi Sni ‖L2(Zl

P ) < ǫ, ∀N,N ′ ∈ [M,F (M)].

89

Page 90: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Proof of Theorem 3 ⇒ Theorem 2: Fix (X,X , µ), fi’s, Ti’s, ǫ and F , applyTheorem 3 to ǫ

2and get a M∗.

For v ∈ Zl and x ∈ X, let T vx = T v11 T v2

2 · · ·T vll x. As we assumed the

system is ergodic under the joint action of Ti’s, for any g ∈ L∞(X) thereexists a null set Ωg with µ(Ωg) = 0 and Ev∈[P ]lg(T vx) →

∫gdµ, ∀x /∈ Ωg

since [P ]l∞P=1 is a Følner sequence in the amenable group Zl. Let G be thecollection of functions which are rational polynomials of functions of the formfi T n

i , then G is countable, thus we can pick a generic point x0 /∈⋃

g∈G Ωg.

gN,N ′(x) := |En∈[N ′]

∏li=1 fi(T

ni x) − En∈[N ′]

∏li=1 fi(T

ni x)|2 ∈ G, therefore

for P large enough,

Ev∈[P ]lgN,N ′(x0)−∫gN,N ′dµ <

ǫ2

10, ∀N,N ′ ≤ F (M∗), (1)

here we can suppose F is increasing.Now we construct a new system on the space Zl

P . ∀v ∈ ZlP , identify

v with an element of [P ]l ⊂ Zl in the obvious way, still noted by v. Letf ′

i(v) = fi(Tvx0). Without loss of generality assume ‖fi‖L∞(X) ≤ 1, ∀i. So

Theorem 3 applies to f ′i: ∃M ≤M∗,

‖En∈[N ′]

l∏

i=1

f ′i Sn

i −En∈[N ′]

l∏

i=1

f ′i Sn

i ‖L2(ZlP ) <

ǫ

2, ∀N,N ′ ∈ [M,F (M)]. (2)

The square of the left hand side is approximately

Ev∈[P ]l|En∈[N ′]

l∏

i=1

fi(Tni x0)−En∈[N ′]

l∏

i=1

fi(Tni x0)|2 = Ev∈[P ]lgN,N ′(x0). (3)

The error arises from those terms with v near the boundary of the region [P ]l

and thus “wraparound” in ZlP under some shift Sn

i . As n < max(N,N ′) ≤F (M) ≤ F (M∗), the contribution of these problematic terms is of order

O(F (M∗)P

) and is at most ǫ2

10when P ≥ O(F (M∗)

ǫ2).

Fix such a sufficiently large P and compare (1),(2),(3), we get ∀N,N ′ ∈[M,F (M)],

(

∫gN,N ′dµ)

12 ≤ (Ev∈[P ]lgN,N ′(x0) +

ǫ2

10)

12 ≤

√(ǫ

2)2 +

ǫ2

10+ǫ2

10< ǫ,

which is exactly the claim in Theorem 2.

90

Page 91: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Notice

l∏

i=1

fi Sni (v1, · · · , vl) =

l∏

i=1

f ∗i (v1, · · · , vl,−

l∑

i=1

vi − n)

where

f ∗i (v1, · · · , vl, vl+1) := fi(v1, · · · , vi−1,−

1≤j≤l+1,j 6=i

vj , vi+1, · · · , vl)

is a function on Zl+1P . f ∗

i depends on only l coordinates of Zl+1P .

Definition 4. Let (Ω,O, η) be a probability space. On Zl+1P ×Ω, a elemen-

tary function of complexity (d, J) is a function which can be expressedas

J∑

j=1

e⊂1,··· ,l+1,|e|=d

ge,j

where ge,j : Zl+1P × Ω 7→ [−1, 1] depends only on the d coordinates inside e

and on the Ω coordinate.

Definition 5. ∀g : Zl+1P × Ω 7→ R, ∀N ∈ N, define Ng : Zl

P × Ω 7→ R by

Ng(v, ω) := En∈[N ]g((v,−∑li=1 vi − n), ω).

It’s clear that∏l

i=1 f∗i is an elementary function of complexity (l, 1) on

Zl+1P (with Ω = point). Thus Theorem 3 is a special case of

Theorem 6. Fix l ∈ N , M∗ ≥ 1, 0 ≤ d ≤ l and J ≥ 1. Then ∀ǫ > 0,∀F : N 7→ N, ∃M∗ ∈ N such that the following property holds:∀P ∈ N, ∀(Ω,O, η), if g : Z

l+1P × Ω 7→ R is an elementary function of

complexity (d, J) then ∃M∗ ≤M ≤ M∗ such that

‖Ng −N ′g‖L2(ZlP ×Ω) ≤ ǫ, ∀N,N ′ ∈ [M,F (M)].

We remark that in Theorem 3 and Theorem 6, the key feature is that M∗

is independent of: the functions fi (or g), the extrinsic probability space Ωand most importantly, the scale P of the space.

Ω, J and M∗ appear in the statement merely because of technical needsin a later inductive argument. They can be dropped without weakeningTheorem 6.

91

Page 92: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Theorem 7. Fix l ∈ N and 0 ≤ d ≤ l. Then ∀ǫ > 0, ∀F : N 7→ N, ∃M∗ ∈ N

such that the following property holds:∀P ∈ N, if g : Z

l+1P 7→ R is an elementary function of complexity (d, 1)

then ∃M ≤M∗ such that

‖Ng −N ′g‖L2(ZlP ) ≤ ǫ, ∀N,N ′ ∈ [M,F (M)].

Theorem 7 is equivalent to Theorem 6. In fact, the space Ω can beignored because of an elegant theorem ([2], Theorem A.2) which is basicallyLebesgue dominated convergence theorem, but translated into the finitarylanguage appeared in Theorem 2. By definition a function of complexity(d, J) can be written as the sum of J functions of complexity (d, 1), as N

and N ′ are linear operators the L2 norm bound for the (d, J) case is at

most J12 times as large as that for (d, 1). By adjusting ǫ, one can reduce to

the case J = 1. Finally, M∗ can also be supposed to be 1 as we can replaceF by FM∗(M) = F (max(M,M∗)), M∗ by 1 and apply the theorem, then theM∗ we get would also work for the original problem.

12.3 Sketch of proof

We shall prove Theorem 6 & 7 by induction on d. In the original paper [2]the induction starts at d = 1 but there would be no problem to begin withd = 0, in which case g is a constant and the theorem is trivial. For larger d,we are going to assume Theorem 6 for d′ = d− 1 and deduce Theorem 7.

We want some control over the rate of convergence of N(g) which isuniform in P . g can be roughly decomposed into two parts, one is “locallyflat”(called anti-uniform) and the other is oscillatory(called uniform). Theuniform part would be nice for us because of its self-cancellation when av-eraged over long intervals. The anti-uniform part behaves less friendly butsurprisingly it can be restricted in at most a constant finite number (Oǫ(1),independent of P ) of scale levels so that the left-over is oscillatory enough foran error smaller than ǫ. After that we shall approximate anti-uniform func-tions of these scale levels with elementary functions of complexity (d− 1, J)and make use of the inductive hypothesis.

12.3.1 Koopman-von Neumann type decomposition

Let g =∏

e⊂1,··· ,l+1,|e|=d ge be an elementary function of complexity (d, 1)

where ge : Zl+1P :7→ [−1, 1] is a function that depends only on the coordinates

92

Page 93: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

inside e. Since if l + 1 /∈ e then Ngeh = geNh, ∀N, h, in the proof ofTheorem 7 we can suppose ge = 1 when l + 1 /∈ e, i.e. g =

∏e∈I ge where

|e| = d, l + 1 ∈ e, ∀e ∈ I ⊂ 1, · · · , l + 1.Definition 8. Let M ∈ N and let e ∈ I, an basic e-anti-uniform functionof scale M is a function φe : Z

l+1P of the form

φe(v) = Em∈[M ]

i∈e

bi((vj)j∈e,j 6=i,

k∈e

vk +m).

where bi is a function Ze\iP × ZP 7→ [−1, 1].

A basic e-anti-uniform function depends only on coordinates in e.

Lemma 9. For M ≥ 1 and ǫ > 0, if ‖Ng‖L2(ZlP ) > ǫ for some N ≥ 10M

ǫ2,

then ∀e0 ∈ I, there is a basic e0-anti-uniform function φe0 of scale M suchthat |〈ge0, φe0〉|L2(Zl

P ) ≥ ǫ2

2.

This lemma is crucial. It asserts that “uniform” functions, or functionsapproximately orthogonal to anti-uniform ones, become small in norm underaverage operators.

Now let K = ⌈106(2l+1)5

ǫ4⌉ ≥ 106|I|5

ǫ4and 1 = M1 ≤ M2 ≤ · · · ≤ MK be

defined by Mk+1 = F (Mk) where F : N 7→ N will be determined later.

Theorem 10. ∃2 ≤ k ≤ K + 1 and a decomposition ge = ge,U⊥ + ge,U foreach e ∈ I, where ge,U⊥, ge,U : Z

l+1P 7→ [−1, 1] depend only on coordinates

inside e and satisfy:(i) (anti-uniform part) ∀e ∈ I, ∀k ≤ j ≤ K, there is a basic e-anti-uniformfunction φe,j and a polynomial Ψe whose degree and coefficients are bothbounded by Oǫ(1) (i.e., independent of P and g), such that

‖ge,U⊥ −Ψe(φe,k, · · · , φe,K)‖L1(Zl+1P ) ≤ ǫ2

400|I|2 ,

‖ge,U⊥ −Ψe(φe,k, · · · , φe,K)‖L∞(Zl+1P ) ≤ 1.

(4)

(ii) (uniform part) ∀e ∈ I, ∀N ≥ 1000|I|2Mk−1

ǫ2, if a function he′ : Z

l+1P 7→

[−1, 1] which only depends on the cooordinates inside e′ is given for everye′ 6= e, |e′| = d then

‖N(ge,U

e′ 6=e,|e′|=d

he′)‖ ≤ǫ

10|I| . (5)

93

Page 94: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

The idea of the proof of Theorem 5 is simple. Initialize k = K + 1and ge,U = ge, ge,U⊥ = 0. During each step check if (5) holds for all

N ≥ 1000|I|2Mk−1

ǫ2and all e. If yes then stop the algorithm. Otherwise it

means for some e ∈ I there is a basic e-anti-uniform function φe,k−1 whichis highly correlated with ge,U . In this case reallocate the part inside ge, Uwhich is correlated with φe,k−1 to ge,U⊥, decrease k by 1 and repeat the step.This involves some complication as the reallocation is not by calculating aninner product but by take the conditional expectation (i.e. an orthogonalprojection) of ge with respect to a finitely generated σ-algebra which is gen-erated by truncations of φe,j’s with j ≥ k. Lemma 3.6 in [2] constructs thesetruncations and the σ-algebra explicitly, guarantees that (4) holds.

The main problem here is if the algorithm can stop in a constant numberof steps. In fact, each time when we decrease k by 1, ‖ge,U⊥‖L2 increases at

least by ( ǫ2

200|I|2 )2 for one of the e’s according to Lemma 9. However as ‖ge,U⊥‖is an orthogonal projection of ge, ‖ge,U⊥‖L2 ≤ 1. So

∑e∈I ‖ge,U⊥‖L2 ≤ |I| <

K · ( ǫ2

200|I|2 )2. Thus the processus always stops before k reaches 1.

12.3.2 Inductive step

First of all we can get rid of the uniform part of g. Let M∗∗ = ⌈1000|I|2Mk−1

ǫ2⌉.

Then for anyN ≥ M∗∗,N(∏

e∈I ge) can be expanded as the sum ofN(∏

e∈I ge,U⊥)with |I| terms, each of which is of the form N(ge,U

∏e′ 6=e,|e′|=d he′) for some

e ∈ I. By (5), each of these |I| terms is of L2 norm at most ǫ10|I| , thus

‖N(∏

e∈I ge) − N(∏

e∈I ge,U⊥)‖L2(ZlP ) ≤ ǫ

10. By triangle inequality it suf-

fices to find a constant M∗∗ such that ∃M∗∗ < M < M∗∗,‖N(∏

e∈I ge,U⊥)−N ′(

∏e∈I ge,U⊥)‖L2(Zl

P ) ≤ ǫ2, ∀N,N ′ ∈ [M,F (M)].

With a similar argument (4) implies

‖N(∏

e∈I

ge,U⊥)−N(∏

e∈I

Ψe(φe,k, · · · , φe,K))‖L1(ZlP ) ≤

ǫ2

400|I|2 · |I|,

‖N(∏

e∈I

ge,U⊥)−N(∏

e∈I

Ψe(φe,k, · · · , φe,K))‖L∞(ZlP ) ≤ |I|,

(here we used the fact that N contracts norms). Thus

‖N(∏

e∈I

ge,U⊥)−N(∏

e∈I

Ψe(φe,k, · · · , φe,K))‖L2(ZlP ) ≤

ǫ

20.

94

Page 95: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

Again by triangle inequality it suffices to find a constant M∗∗ such that∃M∗∗ < M < M∗∗, ∀N,N ′ ∈ [M,F (M)]

‖N(∏

e∈I

Ψe(φe,k, · · · , φe,K)))−N ′(∏

e∈I

Ψe(φe,k, · · · , φe,K)))‖L2(ZlP ) ≤

ǫ

2. (6)

Denote h = Ψe(φe,k, · · · , φe,K , set M∗∗ = ⌈F−1(M14k )⌉ (we assume as

before F is increasing). Then Mk−1 < M∗∗ < M∗∗ < Mk as long as F growssufficiently faster than F does.

Let L = ⌈M12k ⌉, we now approximate h with elementary functions of

complexity d− 1.

Lemma 11. For v ∈ ZlP , w ∈ [L]l and n ∈ [N ] ∪ [N ′],

h(v + w,−l∑

i=1

(vi + wi)− n) = Eω∈Ωφv,ω(w,−l∑

i=1

wi − n) +Oǫ(M− 1

2k ),

where Ω is a finite probability space and ∀ω ∈ Ω, v ∈ ZlP , φv,ω : Zl 7→ R is an

elementary function of complexity (d− 1, Oǫ(1)).

Sketch of proof. h can be expanded, each term is a product of many factorsφe,j. Again expand

φe,j(v + w,−∑li=1(vi + wi)− n)

= Emj∈[Mj ]

∏i∈e be,i,j((vs + ws − nδs,l+1)s∈e\i,

∑s∈e(vs + ws)− n +mj)

where vl+1 := −∑li=1 vi, where wl+1 := −∑l

i=1wi.

The key feature here is that ‖w‖ ≤ O(L) ≤ O(M12k ) ≤ O(M

12j ), n ≤

F (M) ≤ F (M∗∗) ≃M14k so we can replace

∑s∈e(vs+ws)−n+mj by

∑s∈e vs−

n in the last coordinate and get an error term at most O(|P

s∈e ws−n|Mj

) ≤O(M

− 12

K ) as most terms remained in the summation after the shift. Then fora fixed v, the be,i,j factor depends only on the coordinates in e\i. Thus theproduct of such factors are of complexity d− 1.

The rest of the proof adds up these products and is straightforward. Theprobability space Ω represents averages over products of intervals of the form[Mj ] and the Oǫ(1) in the complexity expression comes from the coefficientsof the Ψe’s.

95

Page 96: Additive Combinatoricsthiele/workshop9/proc/addcomb.pdf · 2008-08-07 · Additive Combinatorics Summer School, Catalina Island ∗ August 10th - August 15th 2008 Organizers: Ciprian

‖Nh−N ′h‖L2(ZlP )

= (Ev∈ZlPEw∈[L]l|(En∈[N ] − En∈[N ′])h(v + w,−∑l

i=1(vi + wi)− n)|2) 12

=(Ev∈Zl

PEw∈[L]l|(En∈[N ] − En∈[N ′]) Eω∈Ωφv,ω(w,−∑l

i=1wi − n)|2) 1

2 +Oǫ(M− 1

4k )

=(Ev∈Z

lP ,ω∈ΩEw∈[L]l|(En∈[N ] −En∈[N ′]) φv,ω(w,−∑l

i=1wi − n)|2) 1

2 +Oǫ(M− 1

4k )

Consider ZlP × Ω as an extrinsic probability space and identify [L]l in

the obvious way with a subset of ZlQ where Q = (l + 1)L, still denoted by

[L]l. Like in Theorem 3, we construct a function φv,ω on the new space Zl+1Q :

let φv,ω(w1, · · · , wl+1) equal φv,ω(w1, · · · , wl+1) if (w1, · · · , wl+1) ∈ [L]l andwl+1 ∈ [−Q,−1], equal 0 otherwise. Then φv, m is an elementary function ofcomplexity (d− 1, Oǫ(1)) as well as φv, m.

‖Nh−N ′h‖L2(ZlP ) ≤ ‖N φv,ω −N ′φv,ω‖L2(Zl

Q×Ω) +Oǫ(M− 1

4k ).

Now we can apply the inductive hypothesis (Theorem 6) to φv,ω on Zl+1Q ×

Ω. There is a constant Oǫ,F,M∗∗(1) such that

‖N φv,ω −N ′φv,ω‖L2(ZlQ×Ω) ≤

ǫ

4, ∀N,N ′ ∈ [M,F (M)]

for some M∗∗ ≤ M ≤ Oǫ,F,M∗∗(1). When F grows rapidly enough, M∗∗ =

⌈F−1(F (Mk−1)14 )⌉ is larger than the constant C and Oǫ(M

− 14

k ) is smaller thenǫ4, which implies (6) is satisfied and completes the proof of the main theorem.

References

[1] Furstenberg, H., Ergodic behavior of diagonal measures and a theoremof Szemeredi on arithmetic progressions. J. d’Analyse Math. 31 (1977),204-256

[2] Tao, T., Norm Convergence of Multiple Ergodic Averages for Commut-ing Transformations. arxiv:0707.1117.

Zhiren Wang, Princeton University

email: [email protected]

96