bi-lipshitz embedding of ultrametric cantor sets into euclidean spaces

31
BI-LIPSHITZ EMBEDDING OF ULTRAMETRIC CANTOR SETS INTO EUCLIDEAN SPACES JEAN V. BELLISSARD, ANTOINE JULIEN Abstract. Let (C, d) be an ultrametric Cantor set. Then it admits an isometric embedding into an infinite dimensional Euclidean space [30]. Associated with it is a weighted rooted tree, the reduced Michon graph T [28]. It will be called f -embeddable if there is a bi-Lipshitz map from (C, d) into a finite dimensional Euclidean space. The main result establishes that (C, d) is f -embeddable if and only if it can be represented by a weighted Michon tree such that (i) the number of children per vertex is uniformly bounded, (ii) if κ denotes the weight, there are constants c> 0 and 0 <δ< 1 such that κ(v)(u) d(u,v) where v is a descendant of u and where d(u, v) denotes the graph distance between the vertices u, v. Several examples are provided: (a) the tiling space of a linear repetitive sequence is f -embeddable, (b) the tiling space of a Sturmian sequence is f -embeddable if and only if the irrational number characterizing it has bounded type, (c) the boundary of a Galton-Watson random trees with more than one child per vertex is almost surely not f -embeddable. 1. Introduction Embedding isometrically compact metric spaces into a Euclidean space is an old problem [17, 6, 33, 34] that has been investigated by several communities, in Mathematics and in Com- puter Science as well. There are elementary examples of finite metric space for which such an embedding is impossible: the simplest example is the space K 1,3 = {0, 1, 2, 3} (see Fig. 1 and [21]) with metric d(0,i) = 1 for i =1, 2, 3 and d(i, j ) = 2 if i 6= j X \{0}. This is the path metric on the graph with three vertices and the three internal edges. Then there is no isometric embedding of (X, d) in any Euclidean space. More generally, Theorem 1 (Schoenberg [32]). If (X, d) is a finite metric space, let x X be a point and let A(x) be the matrix A(x) indexed by X \{x} with elements A(x) y,z =(d(x, y) 2 + d(x, z ) 2 - d(y,z ) 2 )/2. Then (X, d) admits an isometric embedding in R n if and only if (i) A(x) is a positive matrix and (ii) the rank of A(x) is less than or equal to n. Testing whether A(x) is positive can be done through the Cholesky algorithm, which provides also a computation of the rank. This method is amenable to numerical calculation to analyze data sets (see [27, 8] for instance). The simplest type of infinite metric spaces for which the isometrical embedding problem can be extended are Cantor sets. A Cantor set is a compact Hausdorff space that is completely disconnected and has no isolated point. An old Theorem of Brouwer [7] claims that all Cantor sets are homeomorphic to the set {0, 1} N of infinite sequences of 0’s and 1’s, endowed with the Work supported in part by NSF grants DMS 06009565, DMS 0901514 and by the CRC701, Spectral Structures and Topological Methods in Mathematics, Mathematik, Universit¨ at Bielefeld, Germany. The second author is supported in part by the Pacific Institute for the Mathematical Sciences. 1 arXiv:1202.4330v1 [math.GN] 20 Feb 2012

Upload: wolverineinzen

Post on 08-Nov-2014

14 views

Category:

Documents


0 download

DESCRIPTION

Embedding

TRANSCRIPT

Page 1: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

BI-LIPSHITZ EMBEDDING OF ULTRAMETRIC CANTOR SETS INTO

EUCLIDEAN SPACES

JEAN V. BELLISSARD, ANTOINE JULIEN

Abstract. Let (C, d) be an ultrametric Cantor set. Then it admits an isometric embeddinginto an infinite dimensional Euclidean space [30]. Associated with it is a weighted rooted tree,the reduced Michon graph T [28]. It will be called f -embeddable if there is a bi-Lipshitz mapfrom (C, d) into a finite dimensional Euclidean space. The main result establishes that (C, d)is f -embeddable if and only if it can be represented by a weighted Michon tree such that (i)the number of children per vertex is uniformly bounded, (ii) if κ denotes the weight, there are

constants c > 0 and 0 < δ < 1 such that κ(v)/κ(u) ≤ c δd(u,v) where v is a descendant of uand where d(u, v) denotes the graph distance between the vertices u, v. Several examples areprovided: (a) the tiling space of a linear repetitive sequence is f -embeddable, (b) the tiling spaceof a Sturmian sequence is f -embeddable if and only if the irrational number characterizing ithas bounded type, (c) the boundary of a Galton-Watson random trees with more than one childper vertex is almost surely not f -embeddable.

1. Introduction

Embedding isometrically compact metric spaces into a Euclidean space is an old problem[17, 6, 33, 34] that has been investigated by several communities, in Mathematics and in Com-puter Science as well. There are elementary examples of finite metric space for which such anembedding is impossible: the simplest example is the space K1,3 = {0, 1, 2, 3} (see Fig. 1 and[21]) with metric d(0, i) = 1 for i = 1, 2, 3 and d(i, j) = 2 if i 6= j ∈ X \ {0}. This is the pathmetric on the graph with three vertices and the three internal edges. Then there is no isometricembedding of (X, d) in any Euclidean space. More generally,

Theorem 1 (Schoenberg [32]). If (X, d) is a finite metric space, let x ∈ X be a point and let A(x)be the matrix A(x) indexed by X \ {x} with elements A(x)y,z = (d(x, y)2 +d(x, z)2−d(y, z)2)/2.Then (X, d) admits an isometric embedding in Rn if and only if

(i) A(x) is a positive matrix and(ii) the rank of A(x) is less than or equal to n.

Testing whether A(x) is positive can be done through the Cholesky algorithm, which providesalso a computation of the rank. This method is amenable to numerical calculation to analyzedata sets (see [27, 8] for instance).

The simplest type of infinite metric spaces for which the isometrical embedding problem canbe extended are Cantor sets. A Cantor set is a compact Hausdorff space that is completelydisconnected and has no isolated point. An old Theorem of Brouwer [7] claims that all Cantorsets are homeomorphic to the set {0, 1}N of infinite sequences of 0’s and 1’s, endowed with the

Work supported in part by NSF grants DMS 06009565, DMS 0901514 and by the CRC701, Spectral Structuresand Topological Methods in Mathematics, Mathematik, Universitat Bielefeld, Germany. The second author issupported in part by the Pacific Institute for the Mathematical Sciences.

1

arX

iv:1

202.

4330

v1 [

mat

h.G

N]

20

Feb

2012

Page 2: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

2 JEAN V. BELLISSARD, ANTOINE JULIEN

Figure 1. The space K1,3 with d(0, i) = 1, d(i, j) = 2 is not embeddable

product topology. While there is only one Cantor set, as a topological space, there are manymetrics on it describing the topology. The subclass of ultrametrics turns out to be as natural forCantor sets as the Euclidean metric on Rn. This was investigated in the eighties by Michon [28]who described all possible ultrametrics through the concept of weighted rooted tree. In a recentwork [30], J. Pearson and the first author, using this formalism, proved the following embeddingresult

Theorem 2 (see [30]). Any ultrametric Cantor set can be embedded isometrically into an infinitedimensional Euclidean space with a countable basis.

The algorithm behind this result also applies trivially to finite ultrametric spaces. Actually, in afinite space X, all metric are equivalent, so that replacing the metric by the smallest ultrametricwhich dominates it, allows to use the method of [30] to find a quasi-isometric embedding in afinite dimensional Euclidean space. The method however gives a dimension n which is at leastthe number of points of the space minus one. On the other hand, given any finite set F onthe real line with the same cardinality as X and given any one-to-one map from X to F , themetric on X induced by the Euclidean metric on F is equivalent to the original one, so thatquasi isometric embedding of X in R is always possible. As it turns out this is a subclass of setsfor which the main result (Theorem 3) of the present work applies.

One of the questions suggested by Theorem 2 is whether the dimension of the embedding spacecan be made finite. There is an obvious obstacle to such an embedding: since the Cantor setis ultrametric, it cannot be embedded isometrically into Rn, because the Euclidean metric isnot an ultrametric. However, it is sufficient that the two metrics be equivalent. Again thereis an obvious obstacle, namely if such an embedding exists, then the Hausdorff dimension of(C, d) and of its image φ(C) are equal and therefore dimH(C, d) ≤ n. So the question has to berephrased as follows

Definition 1. An ultrametric Cantor set (C, d) will be called f -embeddable if there is n ∈ N, apositive real number c and a map φ : C → Rn such that

1

cd(x, y) ≤ ‖φ(x)− φ(y)‖ ≤ c d(x, y) , ∀x, y ∈ C .

Such a map will be called a quasi-isometry or a bi-Lipshitz map, or, for short, in this paper, anf -embedding. Then n will be called the dimension of the embedding.

Acknowledgements: J. B. thanks A. Grigor’yan and his group at the CRC701, University ofBielefeld for giving him the opportunity to present this result prior to publication. A. J. thanksJ. Savinien for discussions on this embedding problem.

Page 3: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

BI-LIPSHITZ EMBEDDING OF ULTRAMETRIC CANTOR SETS INTO EUCLIDEAN SPACES 3

2. The Main Result

A graph G is a family G = (V,E), where V, called the set of vertices, and E, called the set ofedges, are countable sets. They occur with two maps (r, s) : E→ V, everywhere defined, calledthe range and the source. A finite path in G is a sequence γ = (e1, · · · , en) of edges such thatr(ej) = s(ej+1). The integer n = |γ| is called the length of γ. The maps r, s extend to finitepath by r(γ) = r(en) and s(γ) = s(e1). If x = s(γ) and y = r(γ) then γ connects x to y and thiswill be denoted by γ : x → y. If v = r(ei) = s(ei+1), then γ is said to pass through v. A pathis a loop if r(γ) = s(γ). The relation “x is connected to y”, defined by “there is a path γ suchthat either γ : x → y or γ : y → x”, is an equivalence. The equivalence classes of G are calledthe connected components. If there is only one connected component, G is called connected. Apath γ is a cycle if it is a non-trivial path γ : x→ x. In particular, a loop is a cycle.

A tree T is a connected graph with no cycle. A rooted tree is a tree with a pointed vertex ∅called the root. In such a case, given a vertex v ∈ V there is a unique path connecting the rootto v. Then |v| will denote the length of this path and will be called the generation of v. Thisinduces an order between the vertices as follows: two vertices v, w satisfies w � v whenever theunique path from ∅ to w is passing through v. Then v is called an ancestor of w, while w iscalled a descendant of v. In such a case, v is the father of w (or, equivalently, w is a child of v)whenever the unique path from v to w contains only one edge. A vertex is dangling wheneverit has no descendant. A vertex is branching if it has at least two children. Given a rooted treeT = (V,E, ∅), let V′ ⊂ V be the set of branching vertices, which have infinitely many branchingdescendants. If w ∈ V′, define its predecessor v to be the closest ancestor in V′. This leadsto the reduced tree T′ = (V′,E′, ∅′) where e ∈ E′ is a pair (v, w) ∈ V′ × V′, such that v is thepredecessor of w. In such a case s(v, w) = v while r(v, w) = w. The new root ∅′ is either ∅ if thelatter belongs to V′ or its closest descendant belonging to V′. All the trees considered here willbe reduced, unless specified otherwise. A rooted tree is called Cantorian whenever each vertexhas only a finite number of children and each vertex admits a descendant having more than onechild.

Let T be a rooted tree. Its boundary ∂T is the set of maximal paths starting at the root. If Thas dangling vertices, the latter are representatives of a point in ∂T. So ∂T is the union of thedangling vertices and of the set of infinite maximal paths. Given a vertex v, [v] will denote theset of paths in ∂T passing through v. The family {[v] ; v ∈ V} defines a basis for a topology on∂T which makes this space completely discontinuous. In particular ∂T is a Cantor set if and onlyif T is Cantorian [30]. Conversely any Cantor set can be described through a rooted tree whichis not necessarily unique. If x, y ∈ ∂T, then x ∧ y will denote the youngest common ancestorof x and y. A weight of the rooted tree T is a map κ : V → (0,∞) such that (i) if w � v thenκ(w) ≤ κ(v) and (ii) lim|v|→∞ κ(v) = 0. A weight κ leads to an ultrametric dκ on ∂T by setting[28, 30]

dκ(x, y) = κ(x ∧ y) , x, y ∈ ∂T .Conversely the Michon theorem [28] establishes that any ultrametric Cantor set is isometric tothe boundary of a reduced rooted Cantorian tree endowed with a weight. Such a tree will becalled a Michon tree. The main result can be now expressed more precisely

Theorem 3 (Main Result). An ultrametric Cantor set (C, d) is f -embeddable if and only if itcan be represented by a Michon tree T with weight κ satisfying the following two conditions

(i) the number of children of a vertex is uniformly bounded,

Page 4: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

4 JEAN V. BELLISSARD, ANTOINE JULIEN

(ii) there are constant c > 0 and 0 < δ < 1 such that

κ(w)

κ(v)≤ c δ|w|−|v| , ∀w � v ∈ V .

The proof that these conditions are necessary is new, to the best of the authors knowledge.However the proof that they are sufficient is deeply inspired by a recent paper by J. Savinienand the second author [22], in which a similar result was obtained for what the authors arecalling self similar Cantor sets. Examples of such Cantor sets can be obtained as tiling spaces ofsubstitution tilings [31]. Thanks to the existence of a substitution, the conditions (i) and (ii) areeasy to implement and the proof reduces to show the existence of an f -embedding. This proofis actually transferable to the present situation and will show that the two conditions (i) and(ii) are sufficient. In [22] though, the authors have an additional result, namely the dimensionof the embedding can be chosen to be the smallest integer larger than the Hausdorff dimensionof (C, d). Such a result cannot be expected to be true for a general ultrametric Cantor sets.However, the following result will be shown in Section 5

Proposition 1. If (C, d) is an ultrametric Cantor set for which the conditions (i) & (ii) ofTheorem 3 hold, then

dimH(C, d) ≤ lnM

| ln δ|where M is the maximum number of children per vertex. Moreover the smallest dimension ofan f -embedding is at most equal to ln (Mc+ 1)/| ln δ|.

Several consequences of the Main Result will be proved at the end of this work. Given a doublyinfinite sequence ξ = (un)n∈N, of letters taken from a finite alphabet un ∈ A, the tiling spaceof ξ is the set of all doubly infinite sequences sharing the same dictionary of finite words. It iscustomary to define a metric on the tiling space, called the combinatorial metric, as follows: ifξ = (un)n∈N, η = (vn)n∈N are two sequences their distance is d(ξ, η) = (N + 1)−1, where N isthe largest natural integer N ≥ 0 such that un = vn for |n| ≤ N .

The first consequence of the Main Result concerns the tiling space of a linearly repetitive se-quence. A sequence x = (un)n∈N ∈ AZ is linearly repetitive whenever there is K > 1 such thatfor each n ∈ N and all pair of finite subwords v, w with length |v| = n, |w| = Kn, then v occursas a subword of w.

Theorem 4. The tiling space of a linearly repetitive sequence is f -embeddable.

It is possible to extend the notion of linear repetitivity to tilings in higher dimension [1]. Theproof of the previous result, however, requires a stronger definition to be extended in higherdimension, related to forcing the border in a controlled way (See Section 6.3). This suggest that

Conjecture 1. There is a suitable definition of linearly repetitive tiling in Rd such that thetransversal of any such tiling space is f -embeddable

In a similar way, the next consequence concerns Sturmian sequences. These are doubly infinitesequences such that number of words of length n it contains equals n + 1. A classical theorem(see, for instance, [5]) asserts that, for each such sequence x = (xn)n∈Z, the alphabet containstwo letters, usually denoted by {0, 1} and there is an irrational number α ∈ (0, 1) such thatun = [x − nα] − [x − (n + 1)α] for some x ∈ (0, 1]. The following theorem is a corollary of theMain Result, and is proved in two parts: “if” (Proposition 12), and “only if” (Proposition 13).

Page 5: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

BI-LIPSHITZ EMBEDDING OF ULTRAMETRIC CANTOR SETS INTO EUCLIDEAN SPACES 5

Theorem 5. The tiling space of a Sturmian sequence x associated with the irrational numberα ∈ (0, 1) is f -embbedable if and only if α has bounded type, namely if and only if its continuedfraction expansion has bounded partial quotients.

A possible extension of this result is provided by the tiling spaces of quasicrystals obtained bythe cut-and-project method [20]. Namely let Zn seen as a subset of the Euclidean space Rn. Let∆ be the unit cube [0, 1]n in Rn. Let then E‖ be a linear subspace of Rn of dimension d, with

irrational orientation, namely such that E‖ ∩ Zd = {0}. Then Σ = Σ(E‖) =(∆ + E‖

)∩ Zd.

The set L = L(E‖) obtained as the orthogonal projection of Σ(E‖) in E‖ gives a model for theatomic positions of a possible d-dimensional quasicrystal. The set L has several properties, inparticular it is a Delone set of finite local complexity (FLC) [25]. A local patch is a finite subsetof E‖ of the form (L−x)∩B(0; r] for some x ∈ L. Here B(0; r] denotes the closed ball centeredat 0 with radius r. Then FLC means that, for each r > 0 the set of patches of radius r is finite.The tiling space of L is again the set of all Delone sets in E‖, containing the origin sharing thesame family of local patches. The combinatorial metric on the tiling space is defined in a similarway. Namely, if L and L′ are two FLC Delone sets, then d(L,L′) is the inverse 1/R where Ris the largest r > 0 such that the patch of radius r at the origin coincide in both L and L′,namely such that L ∩ B(0; r] = L′ ∩ B(0; r]. By analogy with the previous result the followingconjecture is expected

Conjecture 2. For Lebesgue almost all d-dimensional linear space E‖ ∈ Rd, seen as points inthe Grassmanian manifold of d-dimensional subspace of Rn, the tiling space of L(E‖), endowedwith the combinatorial metric, is not f -embeddable.

The last consequence analyzed here concerns random trees obtained from a Galton-Watsonbranching process [19, 3] (see [23] for a well documented fascinating history of this problem).Let p be a probability defined on the set N = {0, 1, 2, · · · } of natural integers. The GW -tree isconstructed inductively, starting from the root, as follows: if v is a vertex, it has Mv childrenwhere Mv is a random variable with distribution p and is independent from the Mu where u isany prior vertex constructed so far. It is a classical result in probability, partly proved initially bythe Reverend Watson [35], called the Galton-Watson-Haldane-Steffensen critical theorem [23],that if the average number of children 〈Mv〉 is less than or equal to one, the tree obtained in thisway is almost surely finite (extinction). In particular its reduced tree is empty. On the otherhand, as was eventually proved by Steffensen in 1930, if 〈Mv〉 > 1, the probability of extinctionis less than one. However, the random tree produced in this way is likely to have dandlingvertices.

One way to avoid such a property is to force every vertex to have at least two children. This canbe done by demanding that p0 = p1 = 0, namely that p be supported by [2,∞) ⊂ N: then theGalton-Watson tree is automatically reduced and this model can be called a Reduced RandomTree. The first consequence of the Main Result is the following

Proposition 2. Let p be a probability on the set [2,∞) ⊂ N. If it has an infinite support, thenthe number of children per vertex is almost surely unbounded. In particular, for any weight, theboundary of the reduced Galton-Watson tree associated with this probability is almost surely notf -embeddable.

In such a case a random weight can be added in the following way: a family (λv)v∈V of i.i.d.supported in [0, 1], with common distribution ρ is defined. A weight is defined inductively bysetting κ(∅) = 1 and κ(v) = λvκ(u) is v is a child of u. In order that this defines a weight, it isrequired that ρ{0} = 0 and ρ{1} < 1. Then

Page 6: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

6 JEAN V. BELLISSARD, ANTOINE JULIEN

Theorem 6. Let p be a probability on the set [2,∞) ⊂ N and let ρ be a probability on [0, 1] suchthat ρ{0} = 0 and ρ{1} < 1. If m = 〈Mv〉 < ρ{1}−1, let sm be the unique solution of 〈λs〉m = 1.Then the Reduced Random Tree T produced by the Galton-Watson process associated with p andendowed with the random weight associated with ρ gives rise to an ultrametric Cantor set ∂Twith Hausdorff dimension sm almost surely. In addition, its Hausdorff measure exists almostsurely and is a random probability measure.

3. Necessary Conditions for f-Embeddability

The first result concerning the existence of an f -embedding is provided by

Proposition 3. Let T be a weighted Michon tree with weight κ such that ∂T endowed with itsultrametric dκ is f -embeddable. Then the number of children per vertex is uniformly bounded.

Proof: By assumption, there is n ∈ N, a constant c ≥ 1 and a map φ : ∂T → Rn such that

(1)1

cdκ(x, y) ≤ ‖φ(x)− φ(y)‖ ≤ c dκ(x, y) , ∀x, y ∈ ∂T .

Let then v ∈ V be a vertex. By definition κ(v) represents the diameter of [v] in ∂T. Thereforediam{φ([v])} ≤ c κ(v). Moreover, if w,w′ are two distinct children of v, the distance between[w], [w′] is equal to κ(v) as well, thanks to the definition of dκ. Therefore, given xw ∈ [w] foreach child of v, the open balls B(xw, κ(v)) are pairwise disjoint, because dκ is an ultrametric.These balls are contained in [v]. Thanks to eq. (1) it follows that the euclidean open ballsBw = B(φ(xw), κ(v)/c) are pairwise disjoint and contained in a Euclidean closed ball Bv ofradius cκ(v). Let Mv be the number of children of v. Let ωn be the volume of the Euclideanball of radius 1 in Rn. Then the sum of the volumes of the Bw’s is at most equal to the volumeof Bv, leading to

Mv ωn

(κ(v)

c

)n≤ ωn (c κ(v))n , ⇒ Mv ≤ (c2)n ,

2

Remark 1. The Proposition 3 shows that if two Michon trees represents two equivalent ultra-metrics d, d′ on the Cantor set C, then they both have a bounded number of children per vertex.2

Proposition 4. Let T be a Michon tree with a weight κ and a uniformly bounded number ofchildren per vertex. If the corresponding ultrametric Cantor set (∂T, dκ) is f -embeddable, thenthere are 0 < θ < 1, and c ≥ 1 such that for any pair (v, w) of vertices such that v is an ancestorof w

κ(w)

κ(v)≤ c θ|w|−|v| .

Proof: Let M be the maximum number of children per vertex in T. It is finite by assumptionand since T is reduced, M ≥ 2. Let 0 < δ < 1 be chosen. For each vertex v ∈ V, let Descδ(v)be the set of descendants w of v such that κ(w) ≤ δ κ(v). This set is not empty since theweight converges to zero along any path passing though v. Let Sδ(v) be the maximal elementsin Descδ(v), so that w ∈ Sδ(v) if and only if w is a descendant of v such that κ(w) ≤ δκ(v) andits father w satisfies κ(w) > δκ(v).

Page 7: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

BI-LIPSHITZ EMBEDDING OF ULTRAMETRIC CANTOR SETS INTO EUCLIDEAN SPACES 7

Then a reduced graph Tδ = (Vδ,Eδ, ∅, κδ) is constructed as follows:(i) its root is the same as the root of T,(ii) its set of vertices is a subset Vδ ⊂ V constructed inductively as follows:

(a) Vδ,1 = Sδ(∅) ⊂ V,(b) Vδ,n+1 =

⋃u∈Vδ,n Sδ(u),

(iii) an edge e ∈ Eδ is a pair (v, w) where v ∈ Vδ and w ∈ Sδ(v),(iv) its weight is given by κδ(v) = κ(v) if v ∈ Vδ.

Let dδ be the metric defined on ∂Tδ by the weight κδ. It follows, from the construction of Tδ,that every infinite path x ∈ ∂T starting at the root defines also a unique infinite path f(x) ∈ ∂Tδstarting at the same root, by selecting, among the vertices defining x, the ones belonging to Vδ.From this it follows that f is actually invertible and continuous. Hence, identifying ∂T and ∂Tδwith the Cantor set C, f is just the identity map.

If x, y are distinct paths in ∂T, then the vertex x ∧ y ∈ V may not belong to Vδ. However, if vis its least ancestor in Vδ, its weight satisfies always

δκ(v) < κ(x ∧ y) ≤ κ(v) .

Moreover, v is the least common ancestor of x and y inside the tree Tδ, so that dδ(x, y) = κ(v)leading to

δ ≤ d(x, y)

dδ(x, y)≤ 1 .

Thus, the two metrics d, dδ on the Cantor set C, are equivalent. In particular, (C, d) is em-beddable if and only if (C, dδ) is. Hence, thanks to Proposition 3, the maximum number Mδ

of children per vertex in Tδ is finite. It follows that the number of vertices in Sδ(v) is boundedby Mδ whenever v ∈ Vδ. Let then r(v) be the maximal generation difference, in T, between vand an element w ∈ Sδ(v). Since the path from v to w cannot contains more that Mδ pointsr(v) ≤Mδ.

Let v, w be two vertices in V such that w � v. Let v0 (resp. w0) be the least ancestor ofv (resp. w) belonging to Vδ. By construction w0 is a descendant of v0. If v0 = w0, thenδκ(v0) < κ(w) ≤ κ(v) ≤ κ(v0). Hence

δ <κ(w)

κ(v)≤ 1 ≤ δ|w|−|v|

δMδ.

If now w0 6= v0, then, by construction δκ(w0) < κ(w) ≤ κ(w0), and κ(w0) ≤ δnκ(v0) where n isthe generation difference between v0 and w0 in Tδ. Hence, the generation difference between v0

and w0, in T, is at most nMδ. Thus, if |v0| denotes the generation of v0 in T, n ≥ (|w0|−|v0|)/Mδ,so that

κ(w0)

κ(v0)≤ δ(|w0|−|v0|)/Mδ .

Since |w| − |v| ≤ |w| − |v0| ≤ |w| − |w0|+ |w0| − |v0| ≤Mδ + |w0| − |v0|, this implies

κ(w)

κ(v)≤ κ(w0)

δκ(v0)≤ 1

δ2δ(|w|−|v|)/Mδ .

Since T is reduced, it follows that Mδ ≥ M ≥ 2. Consequently, the conclusion holds withc ≤ δ−Mδ and if δ1/Mδ = θ. 2

Page 8: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

8 JEAN V. BELLISSARD, ANTOINE JULIEN

4. The Embedding Theorem

While the previous section showed the the conditions (i) and (ii) in Theorem 3 are necessary,this Section will prove that they are sufficient. The proof is just and adjustment of the proof ofSavinien and the second author in [22].

Proposition 5. Let T be a Michon tree with a weight κ with a bounded number of children pervertex and so that there are c ≥ 1 and 0 < θ < 1 for which

(2)κ(w)

κ(v)≤ c θ|w|−|v| ∀w � v , v, w,∈ V .

Then (∂T, dκ) is f -embeddable.

Proof: (i) Let M be the maximum number of children per vertex in T. Let g : V →{1, 2, · · · ,M} be a map such that its restriction to the set of children of v be one-to-one for allvertex v ∈ V. Such a map consists simply in giving a numbering to each child of each vertex.Hence 1 ≤ g(v) ≤ M for all v. Let now L be a natural integer. The choice of L will be madelater. Any point x ∈ ∂T will be represented by the sequence x = (vn)∞n=0 of its vertices in T, withthe convention that vn has generation n. Then let φ : ∂T → RL be defined by φ(x) = (φr(x))Lr=1

with

φr(x) =∞∑j=0

g(vjL+r)κ(vjL+r−1) , 1 ≤ r ≤ L .

Since g(vn) ≤M and κ(vn) ≤ c θn it follows that the series converges absolutely and uniformlywith respect to r and x.

(ii) Let now x, y ∈ ∂T with x 6= y with x = (vn)∞n=0 and y = (wn)∞n=0. Then let j0, r0 be naturalintegers such that |x∧ y| = j0L+ r0− 1 and 1 ≤ r0 ≤ L. It follows that the paths x and y sharethe same vertices until the generation |x ∧ y| and they split at the next generation. Therefore,

φr(x)− φr(y) =

∞∑j=j0+1

(g(vjL+r)κ(vjL+r−1)− g(wjL+r)κ(wjL+r−1))

+ χ(r ≥ r0) (g(vj0L+r)κ(vj0L+r−1)− g(wj0L+r)κ(wj0L+r−1))

It should be remarked that

−g(wjL+r)κ(wjL+r−1) ≤ g(vjL+r)κ(vjL+r−1)− g(wjL+r)κ(wjL+r−1) ≤ g(vjL+r)κ(vjL+r−1) ,

leading to the inequality (see eq. (2)),

|g(vjL+r)κ(vjL+r−1)− g(wjL+r)κ(wjL+r−1)| ≤M κ(x ∧ y) c θ(j−j0)L+r−r0

Therefore

(3) |φr(x)− φr(y)| ≤ Mc

1− θLκ(x ∧ y) .

Since κ(x∧ y) = dκ(x, y), this map is Lipshitz continuous. On the other hand, ‖φ(x)− φ(y)‖ ≥|φr(x)− φr(y)| for all 1 ≤ r ≤ L. In particular, using again eq. (3) and since vj0L+r0 6= wj0L+r0 ,it follows that

Page 9: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

BI-LIPSHITZ EMBEDDING OF ULTRAMETRIC CANTOR SETS INTO EUCLIDEAN SPACES 9

‖φ(x)− φ(y)‖ ≥ |φr0(x)− φr0(y)| ≥ κ(x ∧ y)− McθL

1− θLκ(x ∧ y) .

If L is chosen so that θL < (Mc+ 1)−1, it follows that this map is actually bi-Lipshitz. 2

5. Hausdorff Dimension

The minimal value of the embedding dimension L found in the proof of Proposition 5 is not asgood as what was found in [22] for self similar Cantor sets. This is because in this case, morerestrictions are put on the weight, in particular a lower bound of a type similar to the upperbound given in eq. (2). In such a case it was shown that the minimal embedding dimension is thesmallest integer larger than the Hausdorff dimension of the ultrametric Cantor set considered.This addresses the question of computing the Hausdorff dimension of the ultrametric Cantor set(∂T, dκ).

5.1. Hausdorff Dimension: the main result. The definition of the Hausdorff dimension [16]starts with the following construction: given an open cover U of C = ∂T and given s ∈ [0,∞),let Hs(U) be defined by

Hs(U) =∑U∈U

diam(U)s .

In addition, let diam(U) = supU∈U diam(U). Then for 0 < δ < 1 let Hsδ(C) be defined by

(4) Hsδ(C) = infdiam(U)<δ

Hs(U)

It follows that δ′ ≤ δ ⇒ Hsδ(C) ≤ Hsδ′(C). In addition, if σ > 0 then Hs+σδ (C) ≤ δσHsδ(C) ≤Hsδ(C). Consequently,

(i) the limit limδ→0Hsδ(C) = Hs(C) exists in [0,+∞) ∪ {+∞},(ii) there is a unique s0 such that if s > s0 then Hs(C) = 0 whereas for s < s0, Hs(C) = +∞.

This unique value s0 is precisely the Hausdorff dimension s0 = dimH(C, d). In order to give abetter formula, the following definition will be nedded

Definition 2. A finite subtree Γ of T is a tree graph Γ = (VΓ,EΓ, ∅) where(i) VΓ ⊂ V is finite and contains the root ∅,(ii) for every v ∈ VΓ every vertex of the unique path γv : ∅ → v in T belong to VΓ,(iii) EΓ is the subset of edges in E of the form (v, w) where both v, w are in VΓ.

∂Γ will denote the set of endpoints. The finite subtree Γ is called full if for any vertex v ∈ Γ\∂Γ,each child of v is a vertex of Γ.

The main result of this Section is

Theorem 7. Let T be a reduced Michon tree with weight κ. For any δ > 0 let Gδ(T) be the setof all full finite subtrees of T such that maxv∈∂Γ κ(v) < δ. Then

Hsδ(∂T) = infΓ∈Gδ(T)

∑v∈∂Γ

κ(v)s .

Page 10: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

10 JEAN V. BELLISSARD, ANTOINE JULIEN

Figure 2. A full finite subtree and its extremal vertices

Corollary 1. Let T be a reduced Michon tree with weight κ. If the maximum number of childrenper vertex is M and if there are c ≥ 1 and 0 < θ < 1 such that eq. (2) holds, then

dimH(∂T, dκ) ≤ lnM

| ln θ|.

Given a vertex v let K(v) be the product of M(u)−1 over the ancestors of v. Then K defines aweight called the Kraft weight. This leads to the following elementary result

Corollary 2. Let T be a reduced Michon tree with its Kraft weight K. The Hausdorff dimensionof (∂T, dK) is exactly one. In addition, the Kraft weight defines a probability measure on ∂Twhich coincides with the Hausdorff measure of (∂T, dK). In particular (∂T, dK) is f -embeddableif and only if T has a bounded number of children per vertex.

The proofs of the last three results will be the content of the following Sections.

5.2. Partitions by Closed and Open Sets.

Definition 3. A c-partition of the Cantor set C is a finite family P = (P1, P2, · · · , Pr) wherethe Pi’s are closed and open (clopen) subsets of C with union C and with empty pairwise inter-sections.

That c-partitions exist as a refinement of any open cover, is essentially equivalent to the definitionof a Cantor set. This is because the topology admits a basis of clopen sets. Hence every coveradmits a refinement made of clopen sets. By compactness, every clopen cover admits a finitesubcover. At last, the intersection or the union of two clopen sets are clopen sets as well.Therefore the minimal elements of the σ-algebra generated by a finite clopen cover are themselvesclopen and they make up a partition. This argument shows that any open cover admits a refiningc-partition. Conversely a compact space with this property is completely disconnected and, if ithas no isolated point, it is a Cantor set.

Proposition 6. Let T be a Michon tree with weight κ. The extremal vertices of a full finitesubtree make up a c-partition of C. Conversely, for any c-partition P of the ultrametric Cantorset (∂T, dκ), there is a full finite subtree Γ such that each element of P is a union of extremalvertices of Γ.

Page 11: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

BI-LIPSHITZ EMBEDDING OF ULTRAMETRIC CANTOR SETS INTO EUCLIDEAN SPACES 11

Proof: (i) Let Γ be a full finite subtree of T. If v, w ∈ ∂Γ then neither of them is the descendantof the other, just because they are both extremal. Hence [v]∩[w] = ∅. Since Γ is full, the brothersof any extremal vertex are all vertices in Γ. Let x ∈ ∂T be an infinite path x = (vn)n∈N ∈ ∂T.Since v0 = ∅, there is a unique n0 ∈ N such that vn ∈ VΓ for n ≤ n0 and vn /∈ VΓ for n > n0.If vn0 were not an extremal vertex for Γ, then, since Γ is full, its child vn0+1 should be in Γ aswell, a contradiction. Therefore vn0 ∈ ∂Γ and x ∈ [vn0 ]. Consequently, the family {[v] ; v ∈ ∂Γ}is a c-partition.

(ii) The set{[v] ; v ∈ V} is a basis for the topology of ∂T. Given any clopen set P ⊂ ∂T, anypoint x ∈ P admits a neighborhood of the form [vx] ⊂ P . The family {[vx] ; x ∈ ∂T} is anopen cover of P . Since P is closed, it is compact so that a finite subcover can be extracted.But the intersection [v] ∩ [w] is either empty or, if not, one of the two [v], [w] is included in theother. In the latter case, remove the smallest from the family. Proceeding this way, leads to asubcover which is actually a c-partition of P by vertices of the Michon graph. Let now P be ac-partition of ∂T. Let VP be the set of vertices generating the c-partitions of the elements in P.Let also Γ be the finite subgraph of T made with vertices and edges of the finite paths joiningthe root to one of the vertices in VP. Then Γ is a finite subtree by construction. Actually, it isfull: for indeed if not, it has an extremal vertex v having one of its brother w not in VΓ. Butthis in not possible because, P being a cover, [w] is covered by P and thus has a non emptyintersection with some P ∈ P. Since [w] does not intersect any of the [u] with u ∈ VP, there isa contradiction with the construction of VP. Hence w must belong to VP as well. 2

Lemma 1. Let (X, d) be an ultrametric complete space. Then every open ball is closed andevery closed ball is open.

Proof: Let x ∈ X and let r > 0. The ball B(x; r) = {y ∈ X ; d(x, y) < r} is open. If z ∈B(x; r) then there is y ∈ B(x; r) with d(y, z) < r. Therefore d(x, z) ≤ max{d(x, y), d(y, z)} < r

so that z ∈ B(x; r). Conversely, let B(x; r) = {y ∈ X ; d(x, y) ≤ r}. If y ∈ B(x; r) and ifz ∈ B(y; r) then

d(x, z) ≤ max{d(x, y), d(y, z)} ≤ r ,so that z ∈ B(x; r). In particular B(y; r) ⊂ B(x; r) for all y ∈ B(x; r). Hence B(x; r) is open. 2

Lemma 2. Let (X, d) be an ultrametric compact space. Then for every open set U ⊂ X, thereis a clopen set PU ⊃ U such that diam(PU ) = diam(U).

Proof: For δ > 0 let U δ = {y ∈ X ; dist(y, U) < δ}. By definition U δ =⋃x∈U B(x; δ) which

implies U =⋂δ>0 U

δ. In particular, U δ is open. Since U is closed, it is compact and therefore

there are (x1, . . . , xn) in U such that U ⊂ B(x1; δ)∪ · · ·∪B(xn; δ) = PU ⊂ U δ. Since every openball is also closed, it follows that PU is clopen. Moreover diam(U) ≤ diam(PU ) ≤ diam(U δ).If now x, y ∈ U δ, there are x′, y′ ∈ U such that d(x, x′) < δ and d(y, y′) < δ. There-fore d(x, y) ≤ max{d(x, x′), d(x′, y′), d(y′, y)} ≤ max{δ, diam(U)}. In particular diam(U δ) ≤max{δ, diam(U)}. If δ < diam(U), it follows that diam(PU ) = diam(U). 2

Lemma 3. Let (C, d) be an ultrametric Cantor set. Then any finite cover by clopen set P =(P1, P2, · · · , Pn) admits a refinement Q = (Q1, Q2, · · · , Qn) such that Qj ⊂ Pj and Q is a c-partition.

Proof: The set Q1 = P1 \ (P2 ∪ · · · ∪ Pn) is clopen and contained in P1. It may happen thatQ1 = ∅. But in any case, C = Q1 ∪ P2 ∪ · · · ∪ Pn, while Q1 ∩ Pj = ∅ for j > 1. Hence replacing

Page 12: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

12 JEAN V. BELLISSARD, ANTOINE JULIEN

C by C1 = C \ Q1 and P by P1 = P \ {P1} the same argument follows. The Lemma is provedby induction. 2

5.3. Proof of Theorem 7. Let V be the set of vertices of the Michon tree T. Thanks to therepresentation of c-partitions provided by the Proposition 6, each clopen set P ⊂ ∂T can be

written as P =⋃lj=1[vj ] where (v1, v2, · · · , vl) ∈ V are vertices which are pairwise not ancestor

of each others. This means that [vi] ∩ [vj ] = ∅ if vi 6= vj . It follows that the least commonancestor v1 ∧ v2 ∧ · · · ∧ vl = vP provides a measure of the diameter of P namely

diam(P ) = κ(vP ) , vP =∧

v∈V ; [v]⊂P

v .

Let now P be a c-partition P. Several clopen sets of P might be associated with the same vertexvP . In particular ∑

P∈P;vP=v

diam(P )s ≥ κ(v)s .

In addition there might be clopen sets P, P ′ ∈ P such that vP is an ancestor of vP ′ . In such acase diam(P )s + diam(P ′)s ≥ κ(vP )s. Let then Q′ = {[vP ] ; P ∈ P}. Since [vP ] ⊃ P , it followsthat Q′ is an open cover. In view of the previous remark, it might happen that [vP ] ⊃ [vP ′ ].By removing the smaller set, it gives a subcover. Let then Q be the minimal subcover obtainedfrom Q′ in this way. Then Hs(P) ≥ Hs(Q′) ≥ Hs(Q). In particular, Q is a c-partition madeof vertices of the Michon graph. In addition, by construction, if diam(P) < δ it follows thatκ(v) < δ for all v ∈ V such that [v] ∈ Q. Hence the Proposition 6 implies that Q is defined bythe extremal vertices of a full finite subtree Γ ∈ Gδ(T). 2

5.4. Kraft’s Identity. Let T = (V,E, ∅, κ) be the reduced Michon tree of the ultrametricCantor set (C, d). For each vertex v ∈ V, let M(v) be the number of its children. If γ = (v0 =∅, v1, · · · , vn) denotes a finite path starting at the root, then M(γ) is defined by

M(γ) =n−1∏j=0

M(vj)

Proposition 7 (see [9]). If Γ is any full finite subtree of T then∑γ∈PΓ

1

M(γ)= 1 (Kraft’s identity)

where PΓ denotes the set of all maximal paths in Γ starting at the root.

Proof: The proof is based upon an elementary remark: if γ = (v0, v1, · · · , vn−1, vn) is a pathin PΓ, then the choice of vn can only be made among the children of vn−1. In particular, ifCh(v) denotes the set of children of v,

1

M(vn−1)

∑vn∈Ch(vn−1)

1 = 1 ,

since M(vn−1) is exactly the number of children of vn−1. Since Γ is full, it follows that thisidentity applies also to paths in Γ. To vary γ further, it is necessary to sum over the possible

Page 13: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

BI-LIPSHITZ EMBEDDING OF ULTRAMETRIC CANTOR SETS INTO EUCLIDEAN SPACES 13

vertices vn−1, namely among the children of vn−2. Proceeding this way inductively gives theresult. 2

Remark 2. The usual Kraft identity, in coding theory, deals with binary tree graphs, represent-ing the possible strings of binary digits. Therefore M(v) = 2 for all vertex v. Then the previous

identity reads∑

γ∈PΓ2−|γ| = 1. Every maximal path can be identified with a code word. If no

pair of code words share the same prefix, then the family of these code words lead to a finitesubtree. This tree is full if the family of codewords is maximal among the set of codewords withno prefix in common. This prefix-free property makes this family of words recognizable whenread. In particular a recognizable set of code words must be a subset of PΓ for some full finitesubtree of the binary tree. Hence if W is the set of such codewords and if `(w) denotes thelength of the word w, the following inequality is necessary for recognizability∑

w∈W2−`(w) ≤ 1 (Kraft’s inequality)

2

5.5. Proof of Corollary 1. Let Γ be a full finite subtree of T. Then, using the Kraft identity(see Proposition 7), it follows that∑

v∈∂Γ

κ(v)s ≤ supv∈∂Γ

κ(v)sM(γv) ≤ c (θsM)|v| .

On the other hand, let Nδ be the smallest natural integer such that cθNδ ≤ δ. Then there is afull finite subtree Γ of T such that v ∈ ∂Γ ⇒ |v| ≥ Nδ. In particular, Γ ∈ Gδ(T) and therefore

Hsδ(∂T) ≤ c (θsM)Nδ .

Since Nδ → ∞ as δ → 0, it follows that if θsM < 1, then Hs(∂T) = 0, namely s ≥ dimH(∂T).This gives the result.

6. S-adic Systems

6.1. Definitions. This Section is devoted to the basic definitions for S-adic systems. Thepresentation uses the notations of Durand [12, 13]. Let A be a finite alphabet. Note A? the setof all finite words with letters in A. For w ∈ A?, |w| denotes its length, namely the number ofletters it contains. Consider S a finite set of morphisms

σ ∈ S : A(σ) −→ A?, with A(σ) ⊂ A.A S-adic system is a sequence (σn : An+1 → (An)?)n∈N ∈ SN, such that the morphisms arecomposable. It will be assumed that for all n, every letter in An appears in a word σn(b) forsome b ∈ An+1. For m > n the following notation will be used

σn,m = σn ◦ . . . ◦ σm−1 : Am → An.

It satisfies σn,m ◦σm,k = σn,k. A S-adic system is primitive if there exists some s0 > 0 such thatfor all r ∈ N, for all a ∈ Ar+s0 and all b ∈ Ar, the letter b appears in the word σr,r+s0(a). It iscalled proper if there exist two letters l and r in A such that for all σ ∈ S and all a ∈ A(σ), σ(a)begins by the letter l and ends by r. Furthermore, it will be assumed from this point on, that

Page 14: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

14 JEAN V. BELLISSARD, ANTOINE JULIEN

(5) limn→+∞

minc∈An

|σ1,n(c)| = +∞,

Hence, words of arbitrary long length can be obtained from iterating the substitutions on asingle letter. Given a S-adic system, there is a way to associate a subshift Ξ ⊂ AZ (it is thencalled a S-adic subshift). Let T be the shift operator on AZ

T (. . . x−1 · x0x1 . . .) = . . . x−1x0 · x1 . . . .

It is straightforward to extend the morphisms of S to AZ by concatenation

σ(. . . x−1 · x0x1 . . .) = . . . σ(x−1) · σ(x0)σ(x1) . . . .

It ought to be remarked that for all n, σ1,n(l) is a prefix of σ1,n+1(l). Similarly, σ1,n(r) is a suffixof σ1,n+1(r). Therefore, an element of AZ can be defined by

x =

(lim

n→+∞σ1,n(r)

)·(

limn→+∞

σ1,n(l)

),

where the dot separates the xi with i ≥ 0 on the right from the ones with i < 0 on the left.The subshift associated with the S-adic system is the closure in AZ (endowed with the producttopology) of the orbit of x under the shift. This subshift, Ξ, is endowed with the combinatorialdistance d, namely two sequences have distance (n+ 1)−1 whenever they coincide on a string ofradius n around the origin and do not coincide beyond. If the S-adic system is primitive, (Ξ, T )is minimal (see Durand [12, Lemma 7]).

Definition 4. A word x ∈ AZ is called linearly recurrent or linearly repetitive (LR for short)if there is a constant K such that for all n ∈ N, for all subwords w and w′ of x respectively oflength n and Kn, then w occurs in w′ as a subword.

In other words1, in a LR word, each finite subword repeat infinitely often and within a distancewhich varies linearly with their size. It is easy to see that a subshift generated by an LR wordis minimal and that every elements of the subshift is LR with the same constant. In this case,the subshift itself is called linearly recurrent. The following is due to Durand

Theorem 8 (Durand [13], Proposition 1.1). A subshift is S-adic primitive and proper if andonly if it is linearly recurrent.

The periodic case is trivial. In the non-periodic case, Durand’s construction of the S-adic systemassociated with a linearly recurrent subshift involves return words. The S-adic system he buildsin this case has the following property, which will be used in the proof of the result below (see [13,Section 4] for the construction, and [14, Definition 9, Lemma 17] for the properties of returnwords and codes).

Definition 5. Let (σn)n∈N be a S-adic system. It is said to have unique decomposition propertyif, given any element x in the subshift associated with Ξ, there is a unique decomposition of xas a concatenation

x = . . . σ1(a−1)σ1(a0)σ1(a1) . . . ,

where the index 0 of x is in the underlined word σ1(a0).

1No pun intended.

Page 15: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

BI-LIPSHITZ EMBEDDING OF ULTRAMETRIC CANTOR SETS INTO EUCLIDEAN SPACES 15

6.2. Proof of Theorem 4. The periodic case is trivial (in this case, Ξ is finite). Using thetheorem of Durand cited above, it is enough to prove the following

Proposition 8. Let Ξ be a a primitive proper S-adic subshift with unique decomposition propertyand endowed with the combinatorial metric. Then, it is f -embeddable.

It will be convenient to describe an S-adic subshift in terms of a Bratteli diagram (see forexample [14]). A weight on the Bratteli diagram permits to define an ultrametric on the set ofits infinite path. It will be necessary to prove

– the ultrametric Cantor set associated with this Bratteli diagram is bi-Lipschitz homeo-morphic to the subshift with the combinatorial metric;

– the weight on the Bratteli diagram satisfies the hypothesis of Theorem 3 so that theCantor set associated with the diagram is embeddable.

The following result is needed.

Lemma 4 (Durand [12], Lemma 8). If the S-adic system generated by (σn)n∈N is primitive withconstant s0, there exists a constant K such that for all integers r, s, with s− r ≥ s0 and for allb, c in As+1

|σr,s+1(b)||σr,s+1(c)|

≤ K.

Let (σn : An+1 → An) be a primitive S-adic proper system with unique decomposition propertyand let l and r the letters associated with the properness. The Bratteli diagram is defined asfollows

– For all n ∈ N, the set of vertices Vn is in bijection with the alphabet An: for each a ∈ An,there is a va ∈ Vn.

– For all n ∈ N, an edge in En is a triple e = (va, l, vb) where va ∈ Vn, vb ∈ Vn+1 and l ∈ Nsuch that the letter a occurs in the word σn(b) in position l + 1. Hence, the number ofedges from va to vb is equal to the number of times the letter a appears in σn(b).

– If e = (va, l, vb) ∈ En its source is s(e) = va, its range is r(e) = vb and its label is l(e) = l.

Example 1. If σn(b) = labcar, then there are two edges e1, e2 from va to vb: one correspondsto the second letter and the other to the fifth letter of the word above. Their respective labelsare 1 and 4. 2

Definition 6. A path on the Bratteli diagram is a sequence of edges (ek)i≤k<j (with 1 ≤ i <j ≤ +∞), such that ek ∈ Ek for all k and r(ek) = s(ek+1). Let Πn denote the set of paths withi = 1, j = n (simply called “paths of length n”). Let Π denote the union of all Πn and let Π∞be the set of paths of infinite length (i = 1, j = +∞).

Because the σn’s are taken from a finite set of substitutions S the cardinality of the sets En andVn is uniformly bounded in n. Let

wn =(

min{|σ1,n(a)| ; a ∈ An})−1

.

It is a decreasing sequence which tends to 0 as n → ∞. The weight of a finite path γ will bedefined by wn+1 whenever γ has length n. This leads to a metric on Π∞ defined by

dw(x, y) = wn+1, where n is the length of the longest common prefix of x, y.

Page 16: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

16 JEAN V. BELLISSARD, ANTOINE JULIEN

Proposition 9. The subshift (Ξ, d), endowed with the combinatorial metric, and (Π∞, dw) arehomeomorphic though a bi-Lipschitz homeomorphism.

Proof: (i) Constructing a map Ξ→ Π∞. Let x = . . . x−1 ·x0x1 . . . in Ξ. Let v1 ∈ V1 be thevertex corresponding to the letter x0. By the unique decomposition property, x can be writtenin a unique way as the concatenation

x = . . . σ(x′−1)σ(x′0)σ(x′1) . . . ,

with (x′i)i∈Z ∈ (A2)Z and such that the letter of x of index 0 is in the underlined word σ(x′0).Therefore, if x′ is the word x′ = (x′i)i∈Z

(6) x = T kσ(x′),

This index k corresponds to an occurrence of the letter x0 in the word σ(x′0). That is, itcorresponds to an edge from v1 to v2, where v2 ∈ V2 is the vertex corresponding to x′0. Iterating

this process of “de-substitution” leads to construct a sequence of words x′, x′′, . . . , x(n) . . . anda corresponding sequence of edges e1, e2, . . . , en. In particular ψ(x) = (e1, e2, . . .) defines a mapΞ→ Π∞.

(ii) ψ is a bijection. The map ψ has an inverse φ : Π∞ → Ξ which will be built explicitly.Let γ = (e1, e2, . . .) be a path going through the vertices v1, v2, . . .. Given a finite word w =(w0, . . . , wl−1) ∈ A∗, let ε be a symbol not in A (the “empty” symbol). Define then w ∈ (A∪{ε})Zis the sequence

w = . . . ε ε ε · w ε ε ε . . . .

This is a way of seeing a word in A∗ as a (partially defined) element of AZ. For fixed n, thesequence

(7) φn(γ) = T l(e1) ◦ σ1 ◦ T l(e2) ◦ . . . ◦ σn−2 ◦ T l(en−1)+1(r σn−1(an) l

),

where l(e) denotes the label of the edge e, can be seen as an element of (A ∪ {ε})Z, namely asthe sequence

. . . εε σ1,n−1(r) σ1,n(an) σ1,n−1(l) εε . . . ,

where the letter of index 0 occurs in the underlined word. Its exact position is determined bythe labels of the edges. This implies

(8) [φn(γ)]i 6= ε for − |σ1,n−1(r)| ≤ i ≤ |σ1,n−1(l)|.By hypothesis, limn→+∞ |σ1,n−1(b)| = +∞ and this is true, in particular, whenever b ∈ {r, l}.Therefore, the limit as n tends to infinity of φn(γ) is an element of AZ, which is noted φ(x).Using the definitions of φ and ψ (see eq. (6) & (7)), it is straightforward that ψ ◦ φ = idΠ∞ .Therefore, ψ is onto and φ is one-to-one.Conversely, let x ∈ Ξ, and γ = ψ(x). It will be shown that φ(γ) = x. For all n, x has a uniquedecomposition

x = . . . σ1,n(b−1)σ1,n(b0)σ1,n(b1) . . . ,

Page 17: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

BI-LIPSHITZ EMBEDDING OF ULTRAMETRIC CANTOR SETS INTO EUCLIDEAN SPACES 17

with bi ∈ An. So by definition of ψ and φ,

(9) φn(γ) = . . . ε σ1,n−1(r) σ1,n(b0) σ1,n−1(l) ε . . . ,

and the two words coincide on the word σ1,n(b0) (it appears at the same position). Furthermore,by properness of the S-adic system, σn(b−1) ends with the letter r while σn(b1) begins with theletter l. Therefore, σ1,n(b−1) has σ1,n−1(r) as a suffix and σ1,n(b1) has σ1,n−1(l) as a prefix. So forall i ∈ Z such that [φn(γ)]i 6= ε, one has [φn(γ)]i = xi. Taking a limit and using hypothesis (5),φ(γ) and x agree everywhere. Therefore, φ and ψ are inverse bijections of each other.

(iii) φ is bi-Lipshitz. Let γ, γ′ ∈ Π∞ be such that the n first edges of γ and γ′ coincide.Then, by definition, φn(γ) = φn(γ′). Thus φ(γ) and φ(γ′) coincide for all indices i satisfying−|σ1,n−1(r)| ≤ i ≤ |σ1,n−1(l)|. Using Lemma 4 and the definition of wn leads to

∀b ∈ An−1 , (wn−1)−1 ≤ |σ1,n−1(b)| ≤ K(wn−1)−1.

Since S is finite, C = maxσ∈S,a∈A(σ) |σ(a)| is well defined. Then, a word of the form σ1,n(b) =σ1,n−1 ◦ σn−1(b) is at most C times longer than the longest word of the form σ1,n−1(c). That is,using again Lemma 4, (wn)−1 ≤ CK(wn−1)−1. If [φ(γ)]i = [φ(γ′)]i for |i| ≤ K(wn−1)−1, then inparticular this inequality holds for |i| ≤ (wn)−1/C. So

d(γ, γ′) ≤ wn ⇒ d(φ(γ), φ(γ′)) ≤ Cwn.Conversely, let γ and γ′ coincide up to edge n, but differing on their (n + 1)-th one. Then, bydefinition of φ

φ(γ) = . . . σ1,n+1(b−1)σ1,n+1(b0)σ1,n+1(b1) . . .

φ(γ′) = . . . σ1,n+1(b′−1)σ1,n+1(b′0)σ1,n+1(b′1) . . .

where b0 6= b′0 and the letter of index 0 belongs to the word σ1,n+1(b0) (respectively σ1,n+1(b′0)).By the unique decomposition property, φ(γ) and φ(γ′) have to differ for indices i satisfying

−max{|σ1,n+1(b0)|, |σ1,n+1(b′0)|} ≤ i ≤ max{|σ1,n+1(b0)|, |σ1,n+1(b′0)|},that is for

−K(wn)−1 ≤ i ≤ K(wn)−1.

This proves that φ−1 is Lipschitz. 2

Proof of Proposition 9. In order to prove that (Ξ, d) is embeddable in Rp, it is sufficient toprove that (Π∞, dw) is embeddable. It suffices to remark that the corresponding Michon treehas the finite paths in Π∞ as vertices and edges given by pairs of path differing only by oneedge. By the finiteness of S, all paths in Πn have a bounded number of extensions to paths ofΠn+1 and this bound is independent of n. Hence the Michon graph has a bounded number ofchildren per vertex. So, the only thing left is to show that the sequence of weights (wn)n∈N onthe Bratteli diagram is bounded above by a decreasing geometric sequence.

First, remark that for all n, the alphabet An has at least two elements. If one of the An hadonly one element, the subshift Ξ would be periodic (this case is already ruled out, because thenthe embedding is trivial). Using primitivity, for all letter c ∈ As0+1, |σs0(c)| ≥ 2. By iteration,for all k and all c ∈ Aks0+1

Page 18: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

18 JEAN V. BELLISSARD, ANTOINE JULIEN

|σks0(c)| ≥ 2k .

So wks0 ≤ 2−k. Since (wn)n∈N is decreasing, wn ≤ Cλn with λ = 2−1/s0 < 1 and C is a constant.Thanks to Theorem 3 it follows that (Ξ, d) is f -embeddable.

6.3. About the Conjecture 1. The previous proof relies on the fact that linearly repetitivesubshifts have a good representation by Brattli diagrams. While they are not self-similar, thereare only a finite number of substitutions involved, which allows to generalize the methods of [22].In a recent article, Aliste and Coronel [1] provide a good description of any linearly repetitivetiling space as the set of paths on a Bratteli diagram. It is possible in the linearly repetitivecase, to have a Bratteli diagram such that the number of vertices and edges in each Vn or Enis bounded uniformly in n. They also have estimates for the size of the patches associatedwith paths of size n. These estimates are analogues of Durand’s Lemma 4 and allow to definereasonable weights on the Bratteli diagram. Therefore, the path space of this Bratteli diagram,endowed with the distance defined by the weight, is f -embeddable. However, there is no proofyet that the homeomorphism between this paths space and the tiling space is bi-Lipschitz.In Proposition 9, the bi-Lipschitz character of φ is proved through using equation (9). Thisrequires an estimate on the length of σ1,n(b0), of σ1,n−1(r) and of σ1,n−1(l). This is a quantitativeversion of a property known as “forcing the border”. It is unclear to the authors whether or notAliste–Coronel’s construction satisfies a similar quantitative border forcing property. Here is apossible analogue of the border forcing property that would be needed, restated in the contextof [1].

Definition 7. Let (σn : An+1 → An)n∈N be a sequence of substitutions in Rd where Ai is a setof proto-tiles (compact subsets of Rd which are the closure of their interior). Let C, λ > 1 betwo constant such that for all n ∈ N and all t ∈ An+1,

C−1λn ≤ rint(t) ≤ Rext(t) ≤ Cλn,where rint(t) and Rext(t) are the inside and outside radius of the tile t, respectively. The systemhas quantitative border forcing property if there is a C ′ such that, for all n, all tile t, and alltiling T , if T contains σ1,n(t), then the patch of T contained in the C ′λn-neighborhood of thepatch σ1,n(t) does not depend on T .

This property means that the neighborhood of σ1,n(t) is forced, up to a distance which iscontrolled. In a recent article, Giordano-Matui-Putnam-Skau [18], can describe a tiling spaceas an inductive system (a sequence of “zoomed-out” tilings, in the sense of [4]), such that thesize, and to some extend, the shape of the tiles involved is tightly controlled. In particular,the authors can avoid “flat tiles” in a certain sense. It is possible that their methods can beused to produce an inductive sequence associated with a tiling, which has a quantitative borderforcing property. If this could be done in a way which is compatible with Aliste and Coronel’smethods, the proof presented above could be adapted right away to the case of tilings of higherdimension. In the light of this discussion,it seems reasonable to expect that the Conjecture 1could be proved using the ideas above. However, the technical difficulties, namely those inducedby using GMPS’s construction, would be quite important.

7. Sturmian Sequences

Page 19: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

BI-LIPSHITZ EMBEDDING OF ULTRAMETRIC CANTOR SETS INTO EUCLIDEAN SPACES 19

7.1. Definitions and notations. First, a few facts need to be introduced about Sturmiansequences and their coding. This section follows in part the presentation in [2], and proof forsome of the results cited below can be found there.

Definition 8. Given a sequence x = (xn)n∈Z ∈ AZ on the finite alphabet A, define its language:

Ln(x) = {finite words a1 . . . an which appear in x}, and L(x) =⋃n∈NLn(x).

Definition 9. Given x ∈ AZ, its complexity function px is defined by

∀n ∈ N, px(n) = Card(Ln).

When there is no risk of ambiguity, px(n) is just noted p(n).It is known that if for some n, px(n) ≤ n, then the word x is periodic. Sturmian sequencescan be defined using the complexity function: they are the aperiodic sequences which have thelowest possible complexity.

Definition 10. A sequence (xn)n∈Z is called Sturmian if its complexity function satisfies p(n) =n+ 1, and x is not eventually periodic.

In particular, a Sturmian sequence is a sequence on two letters (since p(1) = 2). These twoletters are noted 0 and 1.

Proposition 10. The frequency of the letter 1 in a Sturmian sequence x

freqx(1) := limn→+∞

Card{k ∈ [−n, n] ; xk = 1}2n+ 1

is well defined, and is an irrational number, noted α ∈ ]0, 1[.

Definition 11. The Sturmian sequence x is of type zero if the frequency of 1 is less than onehalf, and of type one if this frequency is more than one half.

It is easy to see that in a Sturmian sequence of type 1, the words 10, 01 and 11 may appear,but not the word 00. The same statements holds for sequences of type 0, just exchanging theletters 0 and 1.Given a Sturmian sequence x, there is naturally a subshift of {0, 1}Z associated with it. It is bydefinition the set of all sequences y such that L(y) = L(x). It can also be defined as the closureof the orbit of x in AZ (for the product topology). The two definitions are equivalent. It is ofcourse shift-invariant (hence the name “subshift”), and it is well known that it is minimal.Note that if x has frequency of ones equal to α, so do all the elements of its subshift. In particular,it makes sense to write that a Sturmian subshift is of type 0 or of type 1. Conversely, the subshiftassociated with x is exactly the set of all Sturmian sequences which have the same frequency ofones. Therefore, it makes sense to denote Ξ(α) the subshift associated with α ∈ (0, 1).Sturmian sequences can be recoded using the following substitutions. Define:

σ0 :

{0 7→ 01 7→ 10

and σ1 :

{0 7→ 011 7→ 1.

Proposition 11. For any Sturmian sequence x of type 0, there is a Sturmian sequence x′ suchthat either x = σ0(x′) or x = Tσ0(x′). (T is the shift operator, see Section 6.)For any Sturmian sequence x of type 1, there is a Sturmian sequence x′ such that either x =σ1(x′) or x = Tσ1(x′).

Page 20: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

20 JEAN V. BELLISSARD, ANTOINE JULIEN

Note Φ this re-coding map x 7→ x′. It is worth noting that if x, y are two elements of the sameSturmian subshift, then Φ(x) and Φ(y) belong to the same subshift. Non-periodicity of Sturmiansequences implies that for any Sturmian sequence y of type 0 (resp. 1), there is a k such thatΦk(y) is of type 1 (resp. 0).

Definition 12. Let x be a Sturmian sequence, and (Φn(x))n∈N be the sequence of re-codedSturmian sequences. By definition, for all n,

σb10 ◦ σb21 ◦ . . . ◦ σ

b2n+1

0 (Φ2n+1(x)) and σb10 ◦ σb21 ◦ . . . ◦ σ

b2n1 (Φ2n(x))

are in the same orbit as x. All the bn are positive, except maybe b0 = 0. The sequence (bn)n∈Nis called the multiplicative coding of the Sturmian sequence x.

All Sturmian sequences in a same subshift have the same multiplicative coding, and an acceptablemultiplicative coding determines uniquely a Sturmian subshift.

7.2. Partial fraction decomposition and multiplicative coding. The properties of a Stur-mian subshift are closely related to the partial fraction decomposition of the number α (whichis the frequency of ones in the subshift).Let α ∈ R be irrational. Then α can be written uniquely α = a0 + α0, with a0 ∈ Z andα0 ∈ (0, 1). The Gauss map G : [0, 1] → [0, 1], applied to α0, generates the continuous fractionexpansion

G(α) =1

α− a(α) , a(α) =

[1

α

],

where [x] denotes the integer part of x, namely the largest integer smaller than or equal to x.Hence

α = a0 +1

a1 + α1, a1 = a(α) , α1 = G(α) .

Iterating this formula gives rise to the continuous fraction expansion

α = a0 +1

a1 +1

a2 +1

a3 + · · ·+1

an + αn

, αn+1 = G(αn) , an = a(αn−1) n ≥ 1 .

The standard notation is

α = [a0; a1, a2, · · · , an, · · · ] ,and the an’s are called the partial quotients of α.

Definition 13. A number α has bounded type whenever, the sequence of its partial quotientsis bounded.

Having bounded type is an exceptional property. This is a theorem of Khintchine. See alsoLevy [26].

Theorem 9 (see [24]). For almost every α ∈ [0, 1] the sequence (an)n∈N∗ of partial quotients ofα is unbounded.

Page 21: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

BI-LIPSHITZ EMBEDDING OF ULTRAMETRIC CANTOR SETS INTO EUCLIDEAN SPACES 21

One famous example of such a typical number is e = 2, 71828 · · · the continued fraction of whichwas computed by Leonhard Euler in 1737 [15] namely

e = [2, 1, 2, 1, 1, 4, 1, 1, 6, · · · , 1, 1, 2l, · · · ] a3l−1 = 2l , a3l−2 = a3l = 1 , l ≥ 1 .

Many properties of a Sturmian subshift are determined by the arithmetic properties of thenumber α associated with it. The following theorem is proved in Hedlund and Morse’s seminalpaper.

Theorem 10 (see [29]). Consider a Sturmian subshift Ξ ⊂ {0, 1}Z, of parameter α. Thissubshift is linearly repetitive (see definition 4) if and only if α has bounded type.

An immediate consequence of this theorem and theorem 4 is the following result.

Proposition 12. Let Ξ be a Sturmian subshift with parameter α, endowed with the combinatorialmetric. If α has bounded type, then Ξ is f -embeddable.

This result needs a converse statement: if α has not bounded type, then the associated Sturmiansubshift is not f -embeddable. This will be proved in section 7.3.One way to understand the deep links between the combinatoric properties of a Sturmian subshiftand the arithmetic properties of its parameter α is the following. A Sturmian sequence x can beseen as the coding of an orbit of the rotation of angle α on the circle: there is {I0, I1} a partitionof the circle, and s a point in the circle such that xn = 0 if and only if (s+α mod 1) ∈ I0. Therotation on the circle can be related to the partial fraction decomposition of α on the one hand,and on the multiplicative coding of x on the other hand, to get to following result.

Theorem 11 ((see [2]). Let x be a Sturmian sequence, and α = [0; a1, a2, . . .] be the frequency of1 in x. Then the multiplicative coding of x is given by the partial quotients of (1−α)/α = α−1−1.

It is straightforward that (1 − α)/α = [a1 − 1; a2, a3, . . .]. In particular, α has bounded type ifand only if the sequence of coefficients of the multiplicative coding is bounded.

7.3. Non embeddability of certain Sturmian subshifts. This section is devoted to theproof of the following result.

Proposition 13. Consider a Sturmian sequence x, with associated subshift Ξ and associatedparameter α. If α has unbounded partial quotients, then Ξ (with the combinatorial metric) isnot embeddable in a finite dimensional space.

A Sturmian subshift Ξ is well described combinatorially by the (bilateral) tree of words of itselements.

Definition 14. Given a Sturmian subshift Ξ, its un-reduced tree of words is defined as follows.

– For all n ≥ 0, define a sequence of refining partitions of Ξ by:

Pn = {[y−n . . . yn] ; y ∈ Ξ},where [y−n . . . yn] is the cylinder set of all words in Ξ which coincide with y on indices−n ≤ i ≤ n.

– The set of vertices of the tree is in bijection with the disjoint union of all the Pn. IfX ∈ Pn, the associated vertex is noted vX .

– There is an edge between vX and vY if and only if X ∈ Pn, Y ∈ Pn+1 and Y ⊂ X.– Add a vertex vΞ, and an edge between vΞ and the vertices vX for X ∈ P0.– The weight of the vertex vX is 1/(n+ 1) if X ∈ Pn (n ≥ 0), and 1 otherwise.

Page 22: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

22 JEAN V. BELLISSARD, ANTOINE JULIEN

The (reduced) tree of words is obtained from this tree by the reduction process defined in section 2.

One remark about this tree: if X ∈ Pn, then diam(X) ≤ n, for the combinatorial distance, withequality if and only if X is the non-trivial union of two distinct elements of Pn+1. Note that thevertices vX of such clopen sets X are exactly the vertices with two children: these are preciselythe ones which are not dropped by the reduction process.This leads to the following proposition.

Proposition 14. The boundary of the reduced tree of words associated with a Sturmian subshiftΞ is bi-Lipschitz homeomorphic to the subshift (endowed with the combinatorial metric).

Proof: The proof is almost tautological: given an infinite path in the tree, say γ, the sequenceof vertices (vXn)n∈N defines a decreasing sequence of compact sets Xn, the diameter of whichtends to zero. Therefore, its intersection is not empty and consists of a single element {x}.Define φ(γ) = x.Conversely, given x ∈ Ξ, there is a unique decreasing sequence of sets Xn ∈ Pn, such that forall n, x ∈ Xn (explicitly: Xn = [x−n . . . xn]). Then it defines an infinite path γ ∈ ∂T . Clearly,γ is the unique pre-image of x by φ.The fact that φ is bi-Lipschitz (and in particular, it is a homeomorphism) results from theremark above, on the diameters of the elements of Pn. 2

Lemma 5. Let y be a Sturmian sequence, assume that y = σbn0 (z), with z a Sturmian sequence.Then y contains the words 10bn1 and 10bn0.

Proof: If y = σbn0 (z), then y is of type 0 and z is of type 1. By iteration, it is straightforward

that σbn0 (0) = 0 and σbn0 (1) = 1(0bn). Since z is of type 1, it contains the words 10 and 11.Therefore, y contains the words:

1(0bn)1(0bn) and 1(0bn)0.

2

Proof of Proposition 13: Let x be a Sturmian sequence, with α the frequency of 1, andassume that α does not have bounded type. Then its multiplicative coding (bn)n∈N is an un-bounded sequence.Let y = Φb0+b1+...+bn−1(x), and z = Φbn(y). Without loss of generality, we assume that z is oftype 1, so that

y = σbn0 (z).

Then, using previous lemma, y contains the words 10bn1 and 10bn0. Therefore, for all 1 ≤ k ≤[bn]/2− 1, y contains the words 002k0 and 002k1.

Applying the substitution σ := σb10 ◦ . . . ◦ σbn−1

0 , the sequence x contains the words

σ(0)σ(0)2kσ(0) and σ(0)σ(0)2kσ(1).

Let a be the last letter of σ(0) and σ(1) (it is the same: 0 if b0 6= 0, 1 otherwise). Since σ(1)starts by 1 and σ(0) starts by 0, x contains the words

(10) aσ(0)kσ(0)k0 and aσ(0)kσ(0)k1.

Then, let Xk = [aσ(0)k · σ(0)k], where the dot separates the indices i ≤ 0 and i > 0. It is anelement of Pk|σ(0)|, where |σ(0)| is the length of σ(0). Let vk be the associated vertex. FromEquation (10), the vertices vk have two distinct children (in the un-reduced tree of words),therefore, they are elements of the reduced tree, and their weight is (k|σ(0)|)−1. In particular,

Page 23: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

BI-LIPSHITZ EMBEDDING OF ULTRAMETRIC CANTOR SETS INTO EUCLIDEAN SPACES 23

this construction shows that there are two vertices u, v, such that the quotient of their weightsis [bn/2]− 1, and their distance in the tree is [bn/2]− 1.This construction can be done for all n. If (bn)n∈N is unbounded, this shows that the weightscannot satisfy the geometric decay condition of Theorem 3, and Ξ is not embeddable. 2

The results presented here (namely proposition 12 and 13) provide a proof of Theorem 5. It isactually possible to give a more precise version of it.

Theorem 12. A Sturmian subshift Ξ(α) is f -embeddable if and only if the irrational number αassociated with it has bounded type. In particular

(i) if α is a quadratic irrational, then Ξ(α) is f -embeddable;(ii) for almost every α ∈ (0, 1), the subshift Ξ(α) is not f -embeddable;

(iii) the boundary of Ξ(e) is not embeddable for e = 2.71828 . . ..

Proof: The first part of this result is theorem 5. Point (i) is a consequence of the fact thatquadratic irrational have an eventually periodic (hence bounded) partial fraction expansion.Point (ii) is a consequence of the theorem of Khintchine on continued fractions, and point (iii)is a consequence of the explicit formula for the partial fraction decomposition of e. 2

8. Random Trees

Random trees are good candidates to provide examples of Cantor sets with typical properties.In the history of random processes, the Galton-Watson process has certainly be a seminal exam-ple. Initially, Sir Francis Galton was concerned by the possible disappearance of family namesamong the British aristocrats in the nineteen century. He addressed this problem in a publishedarticle. Soon after, the reverend Watson proposed a method to attack the problem and provideda solution [35]. Unfortunately his prediction was that family names where all disappearing withprobability one, which was certainly not what Galton could observe. Galton encouraged Watsonto search further. It took up to the 1930’s and the work of Haldane and Steffensen before thecomplete solution was found (see [23] for a fascinating review). As it turns out, branching pro-cesses of the Galton-Watson type appears in many other situations. For instance the descriptionof the nuclear chain reaction or of electron emissions in a photomultiplier tubes can be describedfaithfully by such a process (see [19]).

The Galton-Watson process, as it is called today [19, 3] starts from a root and defines thenumber ξ∅ of offspring as an integer valued random variable with distribution p. Each offspringwill be represented as a vertex v of the first generation of the random tree and a new integervalued random variable ξv, describing the number of offspings of v, is drawn with probability pso that all the ξv’s, including the root, are stochastically independent. Each vertex v such thatξv 6= 0, will have ξv offsprings represented as vertices of the second generation. Proceeding in thesame way for all descendants of the root, the branching process is built inductively. The maindifficulty, for the purpose of the present paper, is that the resulting random tree has, in general, anon zero probability to be finite. Moreover, even if it is infinite, it has an infinite number of deadbranches, namely finite maximal paths. This is because, any vertex has a nonzero probabilityto have no offsprings or to have only one. The problem of pruning the tree to reduce it, leadsto some difficulties that are interesting from the probabilistic point of view but provides uselesscomplications as a nice illustration of the core result of the present article. For this reason,the model will be simplified by requiring that the probabilities p0 and p1, that the number ofoffspring be zero or one, respectively, vanish. So, from now on with probability one, the number

Page 24: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

24 JEAN V. BELLISSARD, ANTOINE JULIEN

of offsprings will be at least two. In this way, each vertex has at least two children, so that theresulting tree is already reduced. This is why this branching process will be called a ReducedRandom Tree.

8.1. Reduced Random Trees: a review. Let p = (pn)n≥2 be the probability supportedby [2,∞) ⊂ N describing the number of children of each vertex. Let Zn be the number ofdescendant at the generation n. Following Watson’s idea, it is convenient to introduce thegenerating function

Pn(x) =

∞∑l=2

Prob{Zn = l} xl

The construction of the random tree implies the following formula, where Vn denotes the set ofvertices at generation n,

(11) Zn+1 =∑v∈Vn

ξv .

Consequently the conditional probabilities are given by

Prob{Zn+1 = l |Zn = k} =∑

j1+···+jk=l

pj1 · · · pjk .

In particular, it shows that (Zn)n∈N∗ defines a Markov chain. Moreover

(12) Pn+1(x) = Pn(P (x)

), P (x) =

∞∑n=2

pn xn .

Since p is a probability, the series defining P converges for 0 ≤ x ≤ 1. In addition, P (1) = 1,m = P ′(1) = E(ξ) represents the average number of offsprings, if it exists. With the presentrestrictions, it follows that m ≥ 2. The function x ∈ [0, 1]→ P (x) ∈ [0, 1] is positive, increasingand convex, more generally all its derivatives, when they exist, are positive, meaning that P iscompletely monotone. From the recursion relation (12) it follows immediately that

Pn(x) = P ◦ P ◦ · · · ◦ P︸ ︷︷ ︸n

(x)

In particular, since P (1) = 1, it follows that E(Zn) = P ′n(1) = mn for all n. Hence the numberof descendants at generation n grows exponentially fast with the generation in the average.

Let now Fn be the sigma algebra generated by the variables ξv for v vertices of the generationsk ≤ n. A classical remark made by Doob [11], is that, thanks to the equation (11) defining theprocess, the family (Zn)n∈N∗ satisfies

E (Zn+1|Fn) = mZn .

Namely Wn = Zn/mn is a martingale. As shown earlier by Doob [10] this implies

Theorem 13 (See [19, 3]). Let p = (pn)n≥2 be the probability distribution for the number ξof offsprings such that the average E(ξ) = m and the variance Var(ξ) = σ2 are finite. Thenthe sequence Wn = Zn/m

n of random variables is a martingale with respect to the increasing

Page 25: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

BI-LIPSHITZ EMBEDDING OF ULTRAMETRIC CANTOR SETS INTO EUCLIDEAN SPACES 25

sequence Fn of σ-algebras. In particular it converges almost surely to a random variable W suchthat

E(W ) = 1 , Var(W ) =σ2

m2 −m.

8.2. Proof of the Proposition 2. If the probability p has an infinite support, given anyinteger M ≥ 2, the probability PM that a given vertex has more than M children is non zero.The construction of the Random Reduced Tree, can be seen by associating inductively withany vertex of generation n a string (b1, b2, · · · , bn) of integers so that 1 ≤ bn ≤ ξb1,··· ,bn−1 . Inparticular, since ξv ≥ 2 for all v’s almost surely the subset W ⊂ V made of vertices for whichall bj ’s belong to {1, 2} is non empty and gives an infinite binary subtree. Since the randomvariables {ξv ; v ∈W} are i.i.d., it follows that , given M ≥ 2

Prob{ξv ≤M ; ∀v ∈ A} = (1− PM )#A .

In particular the probability that all vertices in W have ξv ≤M vanishes. 2

8.3. Random Weight. In order that the boundary of the rooted random tree T = (V,E, ∅)previously built becomes an ultrametric Cantor set, it is necessary to put a weight on eachvertex. The previous construction suggests that the weight itself be random and Markovian aswell. In order to do so, the following model of random weight is proposed: let (λv)v∈V be afamily of i.i.d in [0, 1] with common distribution ρ; then the weight κ(v) will be given by

κ(v) = λv κ(u) , if u = father of v , κ(∅) = 1 .

In order that this defines a good weight, it is required that λv 6= 0 with probability one, namelyρ{0} = 0. Moreover, in order that the weight converges to zero along any infinite paths withprobability one, it will be required that ρ{1} < 1. It will be convenient to use the followinggenerating function (Mellin transform)

h(s) =

∫ 1

0λsρ(dλ) .

It is easy to see that h is decreasing, logarithmically convex, namely h(ηs1 + (1 − η)s0) ≤h(s1)η h(s0)1−η for 0 < η < 1, and lims→∞ h(s) = ρ{1}.

8.4. Proof of Theorem 6. Thanks to Theorem 7, the Hausdorff dimension of the tree can becomputed from the following random variables

(13) Hs(Γ) =∑v∈∂Γ

κ(v)s , Hsn =∑v∈Vn

κ(v)s .

where Γ is a full finite subtree of the random T. In the following G will denote the set of finitefull subtrees of T. This set is ordered by the inclusion of the vertex sets. In addition, givingΓ,Γ′ ∈ G, then they admit a least upper bound Γ ∨ Γ′ and a greatest lower bound Γ ∧ Γ′, as

can be checked immediately. This suggests to define a new family of σ-algebras: let FΓ be theσ-algebra generated by the ξu’s and the λv’s where v ∈ VΓ and u ∈ VΓ \ ∂Γ. In particular, if Γ

is the full tree associated with generation n, then Fn will denote the σ-algebra generated by theξu’s with u vertex of generation k ≤ n − 1 and by the λv’s with v vertex of generation k ≤ n.With these notations, the following result holds

Page 26: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

26 JEAN V. BELLISSARD, ANTOINE JULIEN

Proposition 15. With the assumptions made previously on the construction of the randomrooted weighted tree (T,V, ∅, κ) the following results hold(i) For all s ≥ 0, the family Yn(s) = Hsn/mnh(s)n is a martingale with respect to the family(Fn)n∈N

of σ-algebras. In particular it converges almost surely to a positive random variable

Y (s) such that E(Y (s)) = 1.(ii) There is tm > sm defined as the unique solution of h(2s) = mh(s)2, such that

(a) if s < tm then Yn(s) converges almost surely to a constant,(b) if s = tm, then,

Var(Y (tm)) = 1− 1

m+σ2

m2,

(c) if s > tm the random variable Y (s) has not a finite second moment.(iii) If ρ{1} < m−1 and if sm is the unique solution of mh(s) = 1, then the sequence of random

variables Y (Γ) = Hsm(Γ) defines a martingale with respect to the family(FΓ

)Γ∈G

of σ-algebras.

In particular it converges almost surely to 1.

Proof: (i) From the definition of Hsn in eq. (13), it follows that

E (Hs1) = E

∑v∈Ch(∅)

λs

= h(s)E(ξ∅) = mh(s) .

Moreover,

E(Hsn+1 | Fn) = E

∑u∈Vn

κ(u)s∑

v∈Ch(u)

λsv

∣∣∣ Fn

= h(s)E

∑u∈Vn

κ(u)sξu

∣∣∣ Fn = mh(s)

∑u∈Vn

κ(u)s

= mh(s)Hsn .

In particular, this calculation shows that E(Yn+1(s) | Fn) = Yn(s). In particular it is a martingale

w.r.t. the family Fn of σ-algebras. Since Y1(s) = Hs/mh(s) it follows that E(Y1(s)) = 1.Therefore E(Yn(s)) = 1 for all n’s. The convergence of this family is the main result of themartingale theory [11].

(ii) The calculation of the variance will be done through the second moment of Hsn. By con-struction

E(

(Hsn+1)2 | Fn)

= E

∑u,u′∈Vn

κ(u)sκ(u′)s∑

v∈Ch(u)

∑v′∈Ch(u′)

λsvλsv′

∣∣∣ Fn .

Let the terms with u 6= u′ be considered first. Then, since the λv’s are independent for differentv’s, it follows that

Page 27: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

BI-LIPSHITZ EMBEDDING OF ULTRAMETRIC CANTOR SETS INTO EUCLIDEAN SPACES 27

E

κ(u)sκ(u′)s∑

v∈Ch(u)

∑v′∈Ch(u′)

λsvλsv′

∣∣∣ Fn = h(s)2E

(κ(u)sκ(u′)sξuξu′

∣∣∣ Fn)= h(s)2m2κ(u)sκ(u′)s .

If now u = u′, this gives two terms: the first one are terms for which v 6= v′ and the other onesare for v = v′. The same type of calculation leads to

E

κ(u)2s∑

v 6=v′∈Ch(u)

λsvλsv′

∣∣∣ Fn = h(s)2(m2 + σ2 −m)κ(u)2s ,

E

κ(u)2s∑

v∈Ch(u)

λ2sv

∣∣∣ Fn = h(2s)mκ(u)2s .

Grouping these results, leads to

E(

(Hsn+1)2 | Fn)

= h(s)2m2 (Hsn)2 + {m(h(2s)− h(s)2

)+ σ2 h(s)2}H2s

n .

Averaging on both sides gives

Var(Hsn+1) ={m(h(2s)− h(s)2

)+ σ2 h(s)2

}mnh(2s)n .

It is worth noticing that, thanks to the definition of h, the Cauchy-Schwarz inequality givesh(s)2 < h(2s). This inequality is actually strict because ρ{1} 6= 1, so that λ is not almost surelyequal to one. Therefore

Var(Yn(s)) =

(1

m

h(2s)

h(s)2

)n (1− (1− σ2

m)h(s)2

h(2s)

).

It follows that, if s < sm, then h(s)m > 1. In addition an elementary calculation shows thatthe map g(s) = h(2s)/h(s)2 is monotone increasing, that g(0) = 1 and lims→∞ g(s) = ρ{1}−1.Therefore, there is a unique tm > 0 such that m = g(tm). Using the definition of sm, it is easyto show that sm < tm. Hence

(a) if s < tm, limn→∞Var(Yn(s)) = 0, implying that Yn converges almost surely to a constant;this constant can only be the common average, namely limn→∞ Yn(s) = 1,

(b) if s = tm, then the variance converges to a finite value

s = tm ⇒ limn→∞

Var(Yn(tm)) =

(1 +

σ2

m2− 1

m

),

(c) if s > tm, then the limiting random variable Y (s) does not have a finite second moment.

(iii) Let now Γ′ ⊂ Γ be two full finite subtrees of T. Then there is a decreasing sequence of fullfinite subtrees such that Γ′ ⊂ Γj ⊂ · · ·Γ1 ⊂ Γ0 = Γ, and such that Γi+1 is obtained from Γi bythe following procedure: each vertex v ∈ ∂Γi which is not in ∂Γ′ is removed and replaced by itsfather. It is clear that, if Γi is full, so is Γi+1. This leads to

Page 28: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

28 JEAN V. BELLISSARD, ANTOINE JULIEN

E(Hs(Γ) | FΓ1

)= E

∑u∈∂Γ1;Ch(u)∩∂Γ′=∅

κ(u)s∑

v∈Ch(u)

λsv

∣∣∣ FΓ1

+∑

u∈∂Γ∩∂Γ′

κ(u)s

Thanks to the definition of FΓ1 the r.h.s. becomes

E(Hs(Γ) | FΓ1

)= mh(s)

∑u∈∂Γ1;Ch(u)∩∂Γ′=∅

κ(u)s +∑

u∈∂Γ∩∂Γ′

κ(u)s .

In particular, if s = sm, namely if mh(s) = 1, this gives Y (Γ) = Hsm(Γ) so that

E(Y (Γ) | FΓ1

)= Y (Γ1) .

Proceeding inductively along the chain of Γi’s, this gives

E(Y (Γ) | FΓ′

)= Y (Γ′) .

Therefore the family (Y (Γ))Γ∈G is also a martingale w.r.t. the FΓ’s. The martingale theoremthen shows that it converges almost surely. Since the full tree with boundary Vn is a member ofthis family and since it has been shown that the variance converges to zero (because sm < tm),the family converges to a constant almost surely. 2

Proposition 16. Under the hypothesis of Proposition 15, the Hausdorff dimension of (∂T, dκ)is almost surely equal to sm.

Proof: Thanks to Proposition 15, Hsm(Γ) converges almost surely to 1. It follows that

Hsmδ = infΓ∈Gδ

Hsm(Γ) , ⇒ limδ↓0Hsmδ = 1 .

Consequently, Hsδ →∞ for s < sm and Hsδ → 0 for s > sm. Hence dimH(∂T, dκ) = sm. 2

Proposition 17. Under the hypothesis of Proposition 15, the Hausdorff measure of (∂T, dκ) atthe dimension s = sm exists almost surely and is a random probability.

Proof: In order to prove it, it is sufficient to consider the basis of clopen sets of the form [u]for u ∈ V. It boils down to consider

Hsm(Γ;u) =∑

v∈∂Γ;v�uκ(v)s .

a calculation similar to the one made in the proof of Proposition 15, shows that the family of{Hsm(Γ;u) ; Γ ∈ G , u ∈ VΓ} is also a martingale satisfying

E(Hsm(Γ;u)

∣∣ FΓ0

)= κ(u)sm ,

for all full finite subtree Γ0 with u ∈ ∂Γ0 and Γ ⊃ Γ0. Therefore, the martingale Theoremimplies that µ([u]) = limΓHsm(Γ;u) exists and that it is a random variable. Since the set ofvertices is countable, the set of probability zero on which the convergence does not hold can bechosen independently on u ∈ V. By construction∑

u∈∂Γ0

Hsm(Γ;u) = Hsm(Γ) ,

Page 29: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

BI-LIPSHITZ EMBEDDING OF ULTRAMETRIC CANTOR SETS INTO EUCLIDEAN SPACES 29

showing that, after taking the limit,∑

u∈∂Γ0µ([u]) = 1. Since this is true for any full finite

subtree, the Proposition 6 shows that the same relation occurs if ∂Γ0 is replaced by any finitepartition of ∂T by clopen sets. Since clopen sets generates the σ-algebra of Borel sets, it showsthat µ defines a probability measure on ∂T. 2

Page 30: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

30 JEAN V. BELLISSARD, ANTOINE JULIEN

References

[1] J. Aliste, D. Coronel, “Tower systems for linearly repetitive Delone sets”, Ergod. Th. & Dynam. Sys., 31,(2011),1595-1618.

[2] P. Arnoux, “Sturmian Sequences”, Chap. 6 in N. Pytheas Fogg, Substitutions in dynamics, arithmetics,and combinatorics, Lecture Notes in Mathematics, Springer, (2002).

[3] K. B. Athreya, P.E. Ney, Branching processes, Die Grundlehren der mathematischen Wissenschaften, Band196. Springer-Verlag, New York-Heidelberg, (1972). Reprinted by Dover Publications, Inc., Mineola, NY,(2004).

[4] J. Bellissard, R. Benedetti, J.-M. Gambaudo, “Spaces of tilings, finite telescopic approximations and gap-labeling”, Commun. Math. Phys., 261, (2006), 1-41.

[5] J. Berstel, P. Seebold, “Sturmian Words”, Chap.2 in M. Lothaire, Algebraic Combinatorics on Words,Cambridge University press, (2002).

[6] L. M. Blumenthal, “New theorems and methods in determinant theory”, Duke Mathematical Journal, 2,(1936), 396-404.

[7] L. Brouwer, “On the structure of perfect sets of points”, Proc. Akad. Amsterdam, 12, (1910), 785-794.[8] P. Courrieu, “Straight monotonic embedding of data sets in Eucidean spaces”, Neural Networks, 15, (2002),

1185-1196.[9] T. M. Cover, J. A. Thomas, Elements of Information Theory, Second Edition, Wiley, (2006).

[10] J. L. Doob, “Regularity properties of certain families of chance variables”, Trans. Amer. Math. Soc., 47,(1940), 455-486.

[11] J. L. Doob, Stochastic processes, John Wiley & Sons, Inc., New York; Chapman & Hall, Limited, London,(1953).

[12] F. Durand, “Linearly recurrent subshifts have a finite number of non-periodic subshift factors”, Ergod. Th.& Dynam. Sys. 20, (2000), 1061-1078.

[13] F. Durand, “Corrigendum and addendum to ‘Linearly recurrent subshifts have a finite number of non-periodic factors’ ”, Ergod. Th. & Dynam. Sys. 23, (2003), 663-669.

[14] F. Durand, B. Host, C. Skau, “Substitutions, Bratteli diagrams and dimension groups”, Ergod. Th. &Dynam. Sys. 19, (1999), 953-993.

[15] L. Euler, “De fractionibus continuis dissertation”, Comm. Acad. Sci. Petr., 9, (1737), 98-137, 1744.Reprinted in Leonhardi Euleri Opera Omnia, Ser. I, Vol. 14, Leipzig, Germany: Teubner, pp. 187-215,(1924).

[16] K. Falconer, Fractal Geometry, Mathematical foundations and applications, John Wiley & Sons, Ltd.,Chichester, (1990).

[17] M. Frechet, “Les dimensions d’un ensemble abstrait”, Mathematische Annalen, 68, (1910), 145168.[18] T. Giordano, H. Matui, I. Putnam, C. Skau, “Orbit equivalence for Cantor minimal Zd-systems”, Inven-

tiones Mathematicae, 179 (2010), 119-158.[19] T. E. Harris, The theory of branching processes, Die Grundlehren der Mathematischen Wissenschaften, Bd.

119 Springer-Verlag, Berlin; Prentice-Hall, Inc., Englewood Cliffs, N.J. (1963). The theory of branchingprocesses (Corrected reprint of the 1963 original book), Dover Phoenix Editions, Dover Publications, Inc.,Mineola, NY, (2002).

[20] F. Hippert & D. Gratias Eds., Lectures on Quasicrystals, Editions de Physique, Les Ulis, (1994).[21] P. Indyk, J. Matousek, “Low-distorsion embeddings of finite metric spaces”, in Handbook of Discrete and

Computational Geometry, Chap. 8, p.177-196, Jacob E. Goodman, Joseph O’Rourke Eds, CRC Press,(2004).

[22] A. Julien, J. Savinien, “Embeddings of self-similar ultrametric Cantor sets”, Topology and Applications,158, (2011), 2148-2157.

[23] D. G. Kendall, “Branching processes since 1873”, J. London Math. Soc., 41, (1966), 385-406.[24] A. Khintchine, Continued Fractions, New York: Dover, (1997).[25] J. C. Lagarias, “Geometric models for quasicrystals I. Delone sets of finite type”, Discrete Comput. Geom.,

21, (1999), 161-191.[26] P. Levy, “Sur le developpement en fraction continue d’un nombre choisi au hasard”, (French) Compositio

Math., 3, (1936), 286-303.[27] C. A. Micchelli, “Interpolation of scattered data: Distance matrices and conditionally positive definite

functions”, Constructive Approximation, 2, (1986), 11-22.[28] G. Michon, “Les Cantors reguliers”, C. R. Acad. Sci. Paris Ser. I Math., 300, (1985), 673-675.

Page 31: Bi-Lipshitz Embedding of Ultrametric Cantor Sets Into Euclidean Spaces

BI-LIPSHITZ EMBEDDING OF ULTRAMETRIC CANTOR SETS INTO EUCLIDEAN SPACES 31

[29] M. Morse, G. A. Hedlund, “Symbolic Dynamics II. Sturmian trajectories”, American Journal of Mathe-matics, 62, (1940), 1-42.

[30] J. Pearson, J. Bellissard, “Noncommutative Riemannian Geometry and Diffusion on Ultrametric CantorSets”, Journal of Noncommutative Geometry, 3, (2009), 447-480.

[31] Sadun L., Topology of tiling spaces. University Lecture Series, 46. American Mathematical Society, Provi-dence, RI, 2008.

[32] I. J. Schoenberg, “Remarks to Maurice Frechet’s article ”Sur la definition axiomatique d’une classe d’espacedistancies vectoriellement applicable sur l’espace de Hilbert”, Ann. of Math., 36, (1935), 724732.

[33] I. J. Schoenberg, “On certain metric spaces arising from euclidean spaces by a change of metric and theirimbedding in Hilbert space”, Annals of Mathematics, 38(2), (1937), 787-793.

[34] I. J. Schoenberg, “Metric spaces and positive definite functions”, Transactions of the American Mathemat-ical Society, 44, (1938), 522-536.

[35] H. W. Watson, F. Galton, “On the Probability of the Extinction of Families”, Journal of the AnthropologicalInstitute of Great Britain, 4, (1875), 138-144.

Jean V. Bellissard, Georgia Institute of Technology, School of Mathematics, Atlanta GA30332-0160

E-mail address: [email protected]

Antoine Julien, Department of Mathematics and Statistics, University of Victoria, Victoria,BC, Canada V8W 3R4

E-mail address: [email protected]