supplementary information: predicting the phase diagram of … · o 2) e e e e e pa2 1 0 10 20 30...

8
Supplementary Information: Predicting the phase diagram of titanium dioxide with random search and pattern recognition Aleks Reinhardt, 1 Chris J. Pickard, 2, 3 and Bingqing Cheng 1,4,5, a) 1) Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, United Kingdom 2) Department of Materials Science & Metallurgy, University of Cambridge, 27 Charles Babbage Road, Cambridge, CB3 0FS, United Kingdom 3) Advanced Institute for Materials Research, Tohoku University, Sendai, Japan 4) TCM Group, Cavendish Laboratory, University of Cambridge, J. J. Thomson Avenue, Cambridge, CB3 0HE, United Kingdom 5) Trinity College, Trinity Street, Cambridge, CB2 1TQ, United Kingdom (Dated: 8 May 2020) S1 S1 S3 S5 S6 S8 CONTENTS S1. Overview of known titanium dioxide phases S2. New phases of titanium dioxide S3. Enthalpy of titanium dioxide phases at 0 K S4. Basin and thermodynamic entropies S5. ML predictions of chemical potentials S6. Phase diagram of titanium dioxide S1 OVERVIEW OF KNOWN TITANIUM DIOXIDE PHASES Titanium dioxide is one of the most broadly used metal oxides. 1,2 In addition to the most commonly occurring polymorphs of TiO 2 – rutile and anatase –, a large number of both ambient- and high-pressure phases have been observed and hypothesised, including brookite, 3 TiO 2 (B), 4 hollandite, 5 columbite, 6 baddeleyite, 7 TiO 2 (OI), 8 fluorite, 9 oxygen-distorted fluorite (Pca2 1 ), 10 ramsdellite 11 and cotunnite. 12 Many of the high-pressure phases may have interesting optical properties, 13 and the cotunnite phase has been suggested, somewhat controversially, 14 to be the hardest known oxide. 12 Although phases such as columbite are sometimes described as ‘high-pressure’, in the sense that they are synthesised at pressures higher than atmospheric pressure, they nevertheless remain metastable when the pressure is reduced, and so can be of practical use even at ambient pressure. 15 Hence it is important to understand not only the limits of thermodynamic stability, but also metastability. An illustration of these previously known phases can be found in Ref. 16. We use the Matsui–Akaogi (MA) empirical potential 17 of titanium dioxide, which was parameterised to reproduce some physical properties of rutile, anatase, brookite and columbite, such as their unit cell lattice parameters, densities, Ti–O distances, compressibilities and thermal expansivities. 17 In the context of the MA potential, we note that the oxygen-displaced fluorite (Pca2 1 ) structure is not (meta)stable, 16 and the ramsdellite and OI phases are somewhat different from the experimentally reported structures. 16 S2 NEW PHASES OF TITANIUM DIOXIDE In the random searches we found a total of 15 additional phases that have low enthalpies or high frequency of occurrence, following the ML pattern recognition schemes described in the main text. Those structures were first converted into a canonical representation (using Avogadro 18 ), and then considered in more detail. We illustrate the 15 new structures in Fig. S1. We label them with a standard format P-a-SYM, where P gives the pressure in gigapascals at which the structure was found in RSS, a is a lowercase letter purely used for labelling, and SYM gives the space group of the structure. a) Electronic mail: [email protected] Electronic Supplementary Material (ESI) for Physical Chemistry Chemical Physics. This journal is © the Owner Societies 2020

Upload: others

Post on 25-Jul-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Supplementary Information: Predicting the phase diagram of … · O 2) e e e e e Pa2 1 0 10 20 30 40 50 60 70 30 20 10 0 10 20 P/GPa pyrite I e e e O 2) e e e e Pa2 1 2 1/c 2 1

Supplementary Information: Predicting the phase diagram of titaniumdioxide with random search and pattern recognition

Aleks Reinhardt,1 Chris J. Pickard,2, 3 and Bingqing Cheng1, 4, 5, a)1)Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW,United Kingdom2)Department of Materials Science & Metallurgy, University of Cambridge, 27 Charles Babbage Road, Cambridge, CB3 0FS,United Kingdom3)Advanced Institute for Materials Research, Tohoku University, Sendai, Japan4)TCM Group, Cavendish Laboratory, University of Cambridge, J. J. Thomson Avenue, Cambridge, CB3 0HE,United Kingdom5)Trinity College, Trinity Street, Cambridge, CB2 1TQ, United Kingdom

(Dated: 8 May 2020)

S1

S1

S3

S5

S6

S8

CONTENTS

S1. Overview of known titanium dioxide phases

S2. New phases of titanium dioxide

S3. Enthalpy of titanium dioxide phases at 0 K

S4. Basin and thermodynamic entropies

S5. ML predictions of chemical potentials

S6. Phase diagram of titanium dioxide

S1 OVERVIEW OF KNOWN TITANIUM DIOXIDE PHASES

Titanium dioxide is one of the most broadly used metal oxides.1,2 In addition to the most commonly occurring polymorphsof TiO2 – rutile and anatase –, a large number of both ambient- and high-pressure phases have been observed and hypothesised,including brookite,3 TiO2(B),4 hollandite,5 columbite,6 baddeleyite,7 TiO2(OI),8 fluorite,9 oxygen-distorted fluorite (Pca21),10

ramsdellite11 and cotunnite.12 Many of the high-pressure phases may have interesting optical properties,13 and the cotunnitephase has been suggested, somewhat controversially,14 to be the hardest known oxide.12 Although phases such as columbite aresometimes described as ‘high-pressure’, in the sense that they are synthesised at pressures higher than atmospheric pressure, theynevertheless remain metastable when the pressure is reduced, and so can be of practical use even at ambient pressure.15 Hence it isimportant to understand not only the limits of thermodynamic stability, but also metastability. An illustration of these previouslyknown phases can be found in Ref. 16.

We use the Matsui–Akaogi (MA) empirical potential17 of titanium dioxide, which was parameterised to reproduce somephysical properties of rutile, anatase, brookite and columbite, such as their unit cell lattice parameters, densities, Ti–O distances,compressibilities and thermal expansivities.17 In the context of the MA potential, we note that the oxygen-displaced fluorite(Pca21) structure is not (meta)stable,16 and the ramsdellite and OI phases are somewhat different from the experimentally reportedstructures.16

S2 NEW PHASES OF TITANIUM DIOXIDE

In the random searches we found a total of 15 additional phases that have low enthalpies or high frequency of occurrence,following the ML pattern recognition schemes described in the main text. Those structures were first converted into a canonicalrepresentation (using Avogadro18), and then considered in more detail. We illustrate the 15 new structures in Fig. S1. We labelthem with a standard format P-a-SYM, where P gives the pressure in gigapascals at which the structure was found in RSS, a is alowercase letter purely used for labelling, and SYM gives the space group of the structure.

a)Electronic mail: [email protected]

Electronic Supplementary Material (ESI) for Physical Chemistry Chemical Physics.This journal is © the Owner Societies 2020

Page 2: Supplementary Information: Predicting the phase diagram of … · O 2) e e e e e Pa2 1 0 10 20 30 40 50 60 70 30 20 10 0 10 20 P/GPa pyrite I e e e O 2) e e e e Pa2 1 2 1/c 2 1

S2

0.0

0.21

0.22

0.23

0.27

0.28

0.29

0.5

0.5

0.5

0.5

0.71

0.72

0.73

0.77

0.78

0.79

1.0

0-a-P21/c

a = 7.63 Ab = 4.50 Ac = 5.39 A

𝛼 = 𝛾 = 90°𝛽 = 102.5°

0.0

0.160.16 0.18

0.320.340.34

0.5

0.5

0.5

0.660.660.68

0.82 0.840.84

1.0

0-b-P21/c

a = 7.46 Ab = 4.78 Ac = 4.93 A

𝛼 = 𝛾 = 90°𝛽 = 102.1°

0.19

0.19

0.24

0.24

0.26

0.26

0.31

0.31

0.5 0.5

0.5 0.5

0.5

0.5

0.69

0.69

0.74

0.74

0.76

0.76

0.81

0.81

0-c-I41/a

a = b = 4.79 Ac = 10.25 A

𝛼 = 𝛽 = 𝛾 = 90°

0.33 0.33

0.33

0.33

0.33

0.67

0.67

0.67 0.67

0.67

0-d-P31m

a = b = 4.95 Ac = 4.23 A

𝛼 = 𝛽 = 90°𝛾 = 120.0°

0.03

0.03

0.03

0.140.14

0.17

0.17

0.25

0.25

0.25

0.25

0.33

0.330.36

0.36

0.47

0.47

0.47

0.53

0.53

0.53

0.640.64

0.67

0.67

0.75

0.75

0.75

0.75

0.83

0.830.86

0.86

0.97

0.97

0.97

0-e-C2/c

a = 11.13 Ab = 4.88 Ac = 6.98 A

𝛼 = 𝛾 = 90°𝛽 = 112.4°

0.5

0.5

0.5

0.5

0.5

0.5

0.5 0.5

0.5 0.5

0.5 0.5

0-f-C2/m

a = 14.21 Ab = 3.03 Ac = 4.49 A

𝛼 = 𝛾 = 90°𝛽 = 92.8°

0.0 0.0

0.0 0.0

0.16

0.16

0.34 0.34

0.34 0.34

0.35 0.35

0.35 0.35

0.49

0.49

0.5

0.5

0.66 0.66

0.66 0.66

0.84

0.84

0.85

0.99 0.99

0.99 0.99

0-g-Im2m

a = 3.00 Ab = 4.32 Ac = 14.03 A

𝛼 = 𝛽 = 𝛾 = 90°

0.23

0.27 0.27

0.5

0.73 0.73

0.77

40-a-Pnnm

a = 2.89 Ab = 4.20 Ac = 4.37 A

𝛼 = 𝛽 = 𝛾 = 90°

0.02

0.04

0.05

0.10

0.200.26

0.270.27

0.30.33

0.38

0.41

0.47

0.5

0.53

0.59

0.62

0.670.70

0.730.73

0.74

0.8

0.90

0.95

0.96

0.98

60-a-R3

a = b = c = 5.92 A𝛼 = 𝛽 = 𝛾 = 94.4°

0.02

0.14

0.15

0.24 0.26

0.35

0.36

0.48

0.5

0.52

0.64

0.65

0.74 0.76

0.85

0.86

0.98

60-b-P21/c

a = 4.55 Ab = 4.63 Ac = 7.08 A

𝛼 = 𝛾 = 90°𝛽 = 74.9°

0.05

0.060.06

0.23

0.23

0.23

0.41

0.42

0.44

0.56

0.580.59

0.77

0.77

0.77

0.940.94

0.95

60-c-Pc

a = 7.08 Ab = 4.55 Ac = 4.64 A

𝛼 = 𝛾 = 90°𝛽 = 106.1°

0.260.33

0.41 0.41

0.41 0.41

0.59

0.59

0.670.74 0.74

60-d-P62m

a = b = 5.10 Ac = 2.97 A

𝛼 = 𝛽 = 90°𝛾 = 120.0°

0.25

0.25

0.25

0.25

0.25

0.25

0.25

0.25 0.25

0.75

0.75

0.75

0.75

0.75

0.750.75 0.75

0.75

60-e-P21/m

a = 8.91 Ab = 2.98 Ac = 5.07 A

𝛼 = 𝛾 = 90°𝛽 = 90.6°

0.20 0.20

0.37

0.38 0.38

0.70

0.87 0.87

0.88

60-f-Pmn21

a = 3.03 Ab = 3.46 Ac = 4.59 A

𝛼 = 𝛽 = 𝛾 = 90°

0.06

0.08

0.25

0.25

0.26

0.28

0.29

0.41

0.43

0.57

0.59

0.71

0.72

0.74

0.75

0.75

0.92

0.94

60-g-P1

a = 4.53 Ab = 4.61 Ac = 7.16 A𝛼 = 104.9°𝛽 = 87.6°𝛾 = 93.3°

FIG. S1. Plan views of unit cells of the new crystal structures considered, as labelled. Relative heights above the plane are indicated as fractionsof the lattice parameter along the b axis; where no height is indicated, this corresponds to a relative height of 0 and 1. Titanium atoms are shownin cyan and oxygen atoms in red. Full crystal structures are available in the supporting data.

In fact, the phase labelled 60-d-P62m is the Fe2P phase of TiO2 that was previously considered at higher pressures than we havefocused on here;13 however, as we have not previously computed its free energy, we consider it here together with the other ‘new’phases.

Page 3: Supplementary Information: Predicting the phase diagram of … · O 2) e e e e e Pa2 1 0 10 20 30 40 50 60 70 30 20 10 0 10 20 P/GPa pyrite I e e e O 2) e e e e Pa2 1 2 1/c 2 1

S3

S3 ENTHALPY OF TITANIUM DIOXIDE PHASES AT 0 K

In Fig. S2, we show the enthalpy of each of the phases considered at 0 K using the MA potential with energy optimisation aswell as a machine-learning approach, as discussed in the main text. This figure is analogous to Fig. 3 of the main text, except thatit is shown at absolute zero, where entropy plays no role.

By comparing the position of the curves between this figure and Fig. 3 of the main text, we can note that the entropy cansignificantly affect phase behaviour; for example, the OI analogue phase is entropically disfavoured and so is shifted upwards asthe temperature increases.

In Fig. 3(a) of the main text, we report the chemical potentials of different TiO2 phases at 1600 K. We have chosen to focus on arelatively high temperature because this is where the entropy of the system becomes more important to capture correctly; however,the higher the temperature, the greater the risk that some solid phases will cease to be metastable. For example, the baddeleyitephase and several of the newly considered phases [namely 0-c-I41/a, 0-e-C2/c, 60-c-Pc and 60-e-P21/m] do not appear in Fig. 3(a)of the main text, because they are no longer metastable at such high temperatures. Baddeleyite, for example, spontaneouslyconverts into pyrite at approximately 1150 K when heated at 20 GPa. Although the most stable of the solid phases can remain atleast metastable with respect to the liquid phase up to several thousand kelvin, we have here chosen a temperature of 1600 K sincethe majority of the phases we have so far considered are still metastable, which thus provides a more challenging benchmark forthe ML approach.

For most phases shown in Fig. 3(a) and Fig. S2, the curves do not extend throughout the entire pressure range because thepolymorphs cease to be metastable with respect to other phases as a function of pressure. In this context, one particularly interestingtransition is that between rutile and the new phase 40-a-Pnnm (see Fig. S1), which are structurally particularly similar. Thetransition between them appears to be continuous as the system is pressurised: while the space group changes, there does not seemto be any change in density or enthalpy. In a sense, the 40-a-Pnnm polymorph is thus merely a high-pressure analogue of the rutilephase.

rutile

pyrite

OI analogue

columbite

cotunnite

anatase

TiO2(B)

brookite

baddeleyite

fluorite

C/2c

hollandite

ramsdellite analogue

0 10 20 30 40 50 60 70

0

10

20

P/GPa

rutile

pyrite

OI analogue

columbite

cotunnite

anatase

TiO2(B)

brookite

baddeleyite

fluoriteC/2c

hollandite

ramsdellite analogue

0 10 20 30 40 50 60

0

10

20

P/GPa

rutile

pyrite

OI analogue

columbite

cotunnite

anatase

TiO2(B)

brookite

baddeleyite

fluorite

C/2c

hollandite

ramsdellite analogue

0-a-P21/c

0-b-P21/c0-c-I41/a

0-d-P31m

0-e-C2/c

0-f-C2/m

0-g-Im2m 40-a-Pn

nm

60-a-R3

60-b-P21/c60-c-Pc

60-d-P62m

60-e-P21 /m

60-f-Pmn21

60-g-P1

0 10 20 30 40 50 60 70

0

10

20

P/GPa

rutile

pyrite

OI analogue

columbite

cotunnite

anatase

TiO2(B)

brookite

baddeleyite

fluoriteC/2c

hollandite

ramsdellite analogue

0-a-P21/c

0-b-P21/c

0-c-I41/a

0-d-P31m

0-e-C2/c

0-f-C2/m

40-a-Pnnm

60-a-R360-b-P21/c

60-c-Pc

60-d-P62m60-e-P21 /m

60-f-Pmn2160-g-P1

0 10 20 30 40 50 60

0

10

20

P/GPa

(a) energy minimisation (b) ML toolkit approximation(i) (i)

(ii) (ii)

Relativ

een

thalpy

/kcalm

ol−1

Relativ

een

thalpy

/kcalm

ol−1

FIG. S2. Enthalpy at 0 K using (a) direct energy minimisation and (b) a machine-learning prediction, expressed per formula unit and relative tothe pyrite phase. In each case, in panel (i), only the previously known phases are shown, and in panel (ii), new phases obtained from RSS areshown, with the structures from panel (i) repeated in grey. In (b), the error bar of potentially competitive phases is shown as a coloured band.Where a phase was not obtained at a given pressure from a random search, the marker is hollow and connected to other points by a dashed line.Note that the pressure axis covers different ranges in (a) and (b); dotted lines in (a) indicate the range of panel (b).

Page 4: Supplementary Information: Predicting the phase diagram of … · O 2) e e e e e Pa2 1 0 10 20 30 40 50 60 70 30 20 10 0 10 20 P/GPa pyrite I e e e O 2) e e e e Pa2 1 2 1/c 2 1

S4

In Fig. S3, we show the enthalpy at 0 K from density-functional theory using three different functionals. On the whole, theagreement between the three is remarkably good, with a root mean square deviation of the order of 1.2 kcal mol−1. The data arealso consistent with previous DFT-based calculations19–22 using a variety of functionals, suggesting that the DFT calculation isrelatively robust with respect to the choice of functional. None of the newly obtained phases appears to be absolutely favoured inenthalpy; however, a number of phases are certainly competitive and may well be metastable.

rutile

pyrite

OI

columbite

cotunnite

anatase

TiO2(B)

brookite

baddeleyite

fluorite

holland

ite

ramsdellite

Pca21

0 10 20 30 40 50 60 70

−30

−20

−10

0

10

20

P/GPa

rutile

pyrite

OI

columbite

cotunnite

anatase

TiO2(B)

brookite

baddeleyite

fluorite

holland

ite

ramsdellite

Pca21

0-a-P21/c0-b-P21/c

0-c-I41/a

0-d-P31m

0-e-C2/c

0-f-C2/m

0-g-Im2m 40-a-Pnnm

60-a-R3

60-b-P21/c

60-c-Pc60-d-P62m 60-e-P21 /m

60-f-Pmn21

60-g-P1

0 10 20 30 40 50 60 70

−30

−20

−10

0

10

20

P/GPa

rutile

pyrite

OI

columbite

cotunnite

anatase

TiO2(B)

brookite

baddeleyite

fluorite

holland

ite

ramsdellite

Pca21

0 10 20 30 40 50 60 70

−20

−10

0

10

20

P/GPa

rutile

pyrite

OI

columbite

cotunnite

anatase

TiO2(B)

brookite

baddeleyite

fluorite

holland

ite

ramsdellite

Pca21

0-a-P21/c

0-b-P21/c

0-c-I41/a

0-d-P31m

0-e-C2/c

0-f-C2/m

0-g-Im2m 40-a-Pnnm

60-a-R3

60-b-P21/c

60-c-Pc60-d-P62m 60-e-P21 /m

60-f-Pmn21

60-g-P1

0 10 20 30 40 50 60 70

−20

−10

0

10

20

P/GPa

rutile

pyrite

OI

columbite

cotunnite

anatase

TiO2(B)

brookite

baddeleyite

fluorite

holland

ite

ramsdellite

Pca21

0 10 20 30 40 50 60 70−30

−20

−10

0

10

20

P/GPa

rutile

pyrite

OI

columbite

cotunnite

anatase

TiO2(B)

brookite

baddeleyite

fluorite

holland

ite

ramsdellite

Pca21

0-a-P21/c0-b-

P21/c

0-c-I41/a

0-d-P31m

0-e-C2/c

0-f-C2/m

0-g-Im2m 40-a-Pnnm

60-a-R3

60-b-P21/c

60-c-Pc60-d-P62m 60-e-P21 /m

60-f-Pmn21

60-g-P1

0 10 20 30 40 50 60 70−30

−20

−10

0

10

20

P/GPa

(a) LDA

(b) PBE

(c) PBEsol

(i) (ii)

(i) (ii)

(i) (ii)

Relativ

een

thalpy

/kcalm

ol−1

Relativ

een

thalpy

/kcalm

ol−1

Relativ

een

thalpy

/kcalm

ol−1

FIG. S3. Enthalpies from density-functional theory at 0 K using the (a) LDA, (b) PBE and (c) PBEsol functionals. In each case, in panel (i), onlythe previously known phases are shown, and in panel (ii), new phases obtained from RSS are shown, with the structures from panel (i) repeatedin grey.

Page 5: Supplementary Information: Predicting the phase diagram of … · O 2) e e e e e Pa2 1 0 10 20 30 40 50 60 70 30 20 10 0 10 20 P/GPa pyrite I e e e O 2) e e e e Pa2 1 2 1/c 2 1

S5

S4 BASIN AND THERMODYNAMIC ENTROPIES

In Fig. 2 of the main text, we compared the entropy estimated from the basin of attraction, sb = kB ln(v)/n, and the thermodynamicentropy computed from the chemical potential, showing that the basin entropy estimate is slightly too large for low-enthalpystructures. For reference, in Fig. S4(a), we plot exactly the same data as in Fig. 2 of the main text, except that each point is labelledwith phase, temperature and pressure information, rather than the enthalpy.

Because the minimisation scheme used in RSS can increase the frequency of finding low enthalpy structures, we fittedaH + constant = sb − S over all data points to achieve a better estimate of S. With a = 8 × 10−5 K−1, we obtain a Pearsoncorrelation coefficient of r ≈ 0.7, and we show a comparison between the adjusted entropy sb − aH and S in Fig. S4(b).

(a)

(b)

FIG. S4. (a) A comparison between the basinentropy sb and the thermodynamic entropy Sper formula unit of TiO2. Each data point islabelled by the name of the structure. The colourscheme shows the pressure of the RSS and thefree-energy calculations. Data are shown for700 K (hollow markers) and 1600 K (solid mark-ers). (b) A comparison between the adjustedentropy sb−aH and S, where a = 8 × 10−5 K−1.As only the relative entropy under a given ther-modynamic condition between different phasesmatters, we centred the data about the origin.

Page 6: Supplementary Information: Predicting the phase diagram of … · O 2) e e e e e Pa2 1 0 10 20 30 40 50 60 70 30 20 10 0 10 20 P/GPa pyrite I e e e O 2) e e e e Pa2 1 2 1/c 2 1

S6

S5 ML PREDICTIONS OF CHEMICAL POTENTIALS

In the main text, we described the kernel ridge regression (KRR) approach. We used KRR to to estimate the enthalpy and basin entropy per formula units of TiO2 structures. Here we describe the detailed workflow for the KRR fitted on the enthalpy H data for structures generated from RSS at 0 GPa; the estimation of H or Sb at other pressures is essentially the same.

For the total of 3263 structures generated by RSS at 0 GPa, we first randomly partitioned them into a training set (90 % of the structures) and a test set (10 % of the structures). From the training set we select the M = 200 representative structures using the farthest-point sampling (FPS) approach. This approach ensures a near uniform and complete coverage of the hyper space of descriptors. In the fitting process of KRR, we set the regularisation matrix 𝛬 = 0.01𝐼N , where 𝐼N is an order N identity matrix. The left panel of Fig. S5 shows the outcome of such fitting. The fitting error in RMSE is about 0.8 kcal mol−1 in both the training and the test sets, and the correlation coefficients are both 0.99. The right panel shows the learning curve. It can be seen that the learning rate has not saturated; more data points can reduce the error further. Fig. S5 shows the analogous KRR for the prediction of the basin entropy sb. Notice that we plot the exponential of this entropy in the left panel. The fitting error in RMSE is about 0.2kB in both the training and the test sets, and the correlation coefficients are both 0.77. The ML predictions in Fig. S5 show larger scatter compared with the ones for enthalpy. This may be due to the systematic errors in the estimates of the actual sb because of the finite sampling and the finite system sizes, or it may be because descriptions of the equilibrium configurations cannot fully capture the basin volume. To estimate the uncertainty of the prediction, we used the subsampling technique.23 This method involves first creating an ensemble of models by randomly selecting a subset of the training data, and then computing the variance in the predictions of the models. In our case we created 8 models of size N = 2000, and compared the variance of the prediction on the test set with the absolute prediction errors of the test set. These procedures are implemented in the Python notebook included in the supporting data.

FIG. S5. Left: a comparison between the actual enthalpy and the predicted enthalpy per formula unit of TiO2 for both the training set and the test set. Right: the learning curve of KRR. We show the evolution of the test root mean square error (RMSE) as the number of training points increases.

After performing the KRR procedures for both enthalpy and basin entropy for each of the four pressures (0, 20, 40, 60 GPa) separately, we estimate the chemical potentials and show these results in Fig. S7. For the results at each pressure, the filled blue markers show the chemical potential at 0 K (i.e. the enthalpy) of all the structures found at that pressure, and the hollow blue markers show the ML prediction for the structures not found. For a given found structure, the error estimate of its enthalpy is given by the standard deviations of the enthalpies of all structures in the cluster to which the structure belongs. For the uncertainty in entropy, we assume a constant value of 0.2kB, because that is the value of the root mean squared error in S − Sb per formula unit. After that, we use h(0 K) − T sb to approximate its chemical potential, where h is the enthalpy per formula unit.

For structures that are not found, we first use the ML schemes outlined in the main text to estimate the enthalpy and basin entropy. The error in its enthalpy or entropy is estimated using the standard deviation of KRR models constructed using a subsampling technique.23 Finally we again use h(0 K) − T sb to estimate the chemical potential.

In order to gauge the quality of the ML approach, we have also computed the standard deviation of the difference in chemical potentials estimated using ML and thermodynamic integration. For the data reported in Fig. 3 of the main text, Spearman’s rank correlation coefficients between the two methods were 0.96 for those phases which were found in RSS (indicated by solid lines in Fig. 3) and 0.83 for those where the entropy was also estimated using a ML scheme (indicated by dashed lines in Fig. 3).

Page 7: Supplementary Information: Predicting the phase diagram of … · O 2) e e e e e Pa2 1 0 10 20 30 40 50 60 70 30 20 10 0 10 20 P/GPa pyrite I e e e O 2) e e e e Pa2 1 2 1/c 2 1

S7

FIG. S6. Left: a comparison between the basin volume (equal to exp(sb/kB)) per formula unit of TiO2 and the predicted values for both thetraining set and the test set. Right: the learning curve of KRR for the basin entropy sb , the unit is in kB. We show the evolution of the test rootmean square error (RMSE) as the number of training points increases.

FIG. S7. The chemical potentials at 1600 K (red markers) and at 0 K (blue markers) of the phases at four pressures. For clarity, we ordered the structures at each pressure according their chemical potentials at 0 K (i.e. the enthalpies). Solid markers denote structures found at the particular pressure, and hollow markers mean the converse. If a structure is not found, its entropy and enthalpy are predicted using the KRR ML method.

The standard deviation of the difference between the two averaged over all pressures was 1.1 kcal mol−1 for the former and 1.7 kcal mol−1 for the latter. These standard deviations are comparable to the error in the predictions themselves (see Fig. S5).

Page 8: Supplementary Information: Predicting the phase diagram of … · O 2) e e e e e Pa2 1 0 10 20 30 40 50 60 70 30 20 10 0 10 20 P/GPa pyrite I e e e O 2) e e e e Pa2 1 2 1/c 2 1

S8

S6 PHASE DIAGRAM OF TITANIUM DIOXIDE

rutile

pyrite

OIcotunnite

60-a-R3

0 500 1000 15000

20

40

60

T/K

rutile

pyrite

60-a-R3

0 500 1000 15000

20

40

60

T/K

(a) free-energy calculation (b) ML toolkit approximation

P/GP

a

0.02

0.05

0.040.10

0.200.26

0.270.27

0.3 0.33

0.38

0.41

0.47

0.5

0.53

0.59

0.62

0.67 0.70

0.730.73

0.74

0.8

0.90

0.95

0.96

0.98

0.16

0.160.34

0.34

0.5 0.5

0.5

0.5

0.66

0.660.84

0.84

0.190.31

0.31

0.5

0.69

0.69

0.81

FIG. S8. Pressure–temperature phase diagrams from (a) free-energy calculations and from (b) the ML toolkit approximation. Plan views of thestable crystal structures are also shown for illustration.

In Fig. S8, we estimate the phase diagram of TiO2 using both free-energy calculations and a machine-learning prediction. Thediagram in Fig. S8(a) also shows the previously computed phase diagram (Ref. 16), with a triple point between the pyrite, OI(analogue) and cotunnite phases. Using the new structure obtained from RSS, however, this triple point can be identified to bemetastable, and the 60-a-R3 phase is stable at sufficiently high pressures; this is denoted by the hatched region in the figure. Theagreement between the phase diagrams shown in panels (a) and (b) is rather remarkable. However, it should be borne in mind thatthis result is somewhat fortuitous: the 60-a-R3 phase is conveniently more stable than the OI and cotunnite phases, and the MLapproach would not have correctly predicted the triple point of the pyrite, OI and cotunnite phases. The prediction of the chemicalpotential of each phase, shown in the main manuscript, is a rather more rigorous test of the approach.

1J. H. Braun, A. Baidins, and R. E. Marganski, “TiO2 pigment technology:A review,” Prog. Org. Coat. 20, 105 (1992).

2N. Rahimi, R. A. Pax, and E. M. Gray, “Review of functional titanium oxides.I: TiO2 and its modifications,” Prog. Solid State Chem. 44, 86 (2016).

3E. P. Meagher and G. A. Lager, “Polyhedral thermal expansion in the TiO2polymorphs: Refinement of the crystal structures of rutile and brookite athigh temperature,” Can. Mineral. 17, 77 (1979).

4R. Marchand, L. Brohan, and M. Tournoux, “TiO2(B) A new form oftitanium dioxide and the potassium octatitanate K2Ti8O17,” Mater. Res. Bull.15, 1129 (1980).

5M. Latroche, L. Brohan, R. Marchand, and M. Tournoux, “New hollanditeoxides: TiO2(H) and K0.06TiO2,” J. Solid State Chem. 81, 78 (1989);T. Sasaki, M. Watanabe, and Y. Fujiki, “Structure of K1.0Ti8O16 andK0.0Ti8O16,” Acta Crystallogr. B49, 838 (1993).

6N. A. Bendeliani, S. V. Popova, and L. F. Vereshchagin, “New modificationof titanium dioxide obtained at high pressures,” Geochem. Int. 3, 387 (1966);I. E. Grey, C. Li, I. C. Madsen, and G. Braunshausen, “TiO2-II. Ambientpressure preparation and structure refinement,” Mater. Res. Bull. 23, 743(1988).

7H. Sato, S. Endo, M. Sugiyama, T. Kikegawa, O. Shimomura, and K. Kusaba,“Baddeleyite-type high-pressure phase of TiO2,” Science 251, 786 (1991);V. Swamy, L. S. Dubrovinsky, N. A. Dubrovinskaia, F. Langenhorst, A. S.Simionovici, M. Drakopoulos, V. Dmitriev, and H.-P. Weber, “Size effectson the structure and phase transition behavior of baddeleyite TiO2,” SolidState Commun. 134, 541 (2005).

8N. A. Dubrovinskaia, L. S. Dubrovinsky, R. Ahuja, V. B. Prokopenko,V. Dmitriev, H.-P. Weber, J. M. Osorio-Guillen, and B. Johansson, “Experi-mental and theoretical identification of a new high-pressure TiO2 polymorph,”Phys. Rev. Lett. 87, 275501 (2001).

9M. Mattesini, J. S. de Almeida, L. Dubrovinsky, N. Dubrovinskaia, B. Jo-hansson, and R. Ahuja, “High-pressure and high-temperature synthesis ofthe cubic TiO2 polymorph,” Phys. Rev. B 70, 212101 (2004).

10X.-F. Zhou, X. Dong, G.-R. Qian, L. Zhang, Y. Tian, and H.-T. Wang,“Unusual compression behavior of TiO2 polymorphs from first principles,”Phys. Rev. B 82, 060102 (2010).

11J. Akimoto, Y. Gotoh, Y. Oosawa, N. Nonose, T. Kumagai, K. Aoki, andH. Takei, “Topotactic oxidation of ramsdellite-type Li0.5TiO2, a new poly-

morph of titanium dioxide: TiO2(R),” J. Solid State Chem. 113, 27 (1994).12L. S. Dubrovinsky, N. A. Dubrovinskaia, V. Swamy, J. Muscat, N. M.

Harrison, R. Ahuja, B. Holm, and B. Johansson, “The hardest known oxide,”Nature 410, 653 (2001).

13H. Dekura, T. Tsuchiya, Y. Kuwayama, and J. Tsuchiya, “Theoretical andexperimental evidence for a new post-cotunnite phase of titanium dioxidewith significant optical absorption,” Phys. Rev. Lett. 107, 045701 (2011).

14D. Nishio-Hamane, A. Shimizu, R. Nakahira, K. Niwa, A. Sano-Furukawa,T. Okada, T. Yagi, and T. Kikegawa, “The stability and equation of statefor the cotunnite phase of TiO2 up to 70 GPa,” Phys. Chem. Miner. 37, 129(2010).

15M. Y. Manuputty, J. A. H. Dreyer, Y. Sheng, E. J. Bringley, M. L. Botero,J. Akroyd, and M. Kraft, “Polymorphism of nanocrystalline TiO2 preparedin a stagnation flame: Formation of the TiO2-II phase,” Chem. Sci. 10, 1342(2019).

16A. Reinhardt, “Phase behavior of empirical potentials of titanium dioxide,”J. Chem. Phys. 151, 064505 (2019).

17M. Matsui and M. Akaogi, “Molecular dynamics simulation of the structuraland physical properties of the four polymorphs of TiO2,” Mol. Simul. 6, 239(1991).

18M. D. Hanwell, D. E. Curtis, D. C. Lonie, T. Vandermeersch, E. Zurek,and G. R. Hutchison, “Avogadro: An advanced semantic chemical editor,visualization, and analysis platform,” J. Cheminformatics 4, 17 (2012).

19Z.-G. Mei, Y. Wang, S. Shang, and Z.-K. Liu, “First-principles study of themechanical properties and phase stability of TiO2,” Comput. Mater. Sci. 83,114 (2014).

20Z. Fu, Y. Liang, S. Wang, and Z. Zhong, “Structural phase transition andmechanical properties of TiO2 under high pressure,” Phys. Status Solidi B250, 2206 (2013).

21X. G. Ma, P. Liang, L. Miao, S. W. Bie, C. K. Zhang, L. Xu, and J. J.Jiang, “Pressure-induced phase transition and elastic properties of TiO2polymorphs,” Phys. Status Solidi B 246, 2132 (2009).

22T. Zhu and S.-P. Gao, “The stability, electronic structure, and optical propertyof TiO2 polymorphs,” J. Phys. Chem. C 118, 11385 (2014).

23F. Musil, M. J. Willatt, M. A. Langovoy, and M. Ceriotti, “Fast and accurateuncertainty estimation in chemical machine learning,” J. Chem. TheoryComput. 15, 906 (2019).