mathematical · 2019. 2. 12. · [ 1), and kolmorogov and fomin [ 1). the martingale and...

T R A N S L A T I O N S O F

MATHEMATICAL M O N O G R A P H S V O L U M E 4 7

M. B. Nevelson R. Z. Has'minskff

Stochastic Approximation and Recursive Estimation

fl| American Mathematical Society Providence, Rhode Island

10.1090/mmono/047

CTOXACTMMECKAH AnnPOKCMMAUMf l

M PEKYPPEKTHOE OUEHMBAHM E

M. B. HEBEJlbCOH , P . 3 . XACbMMHCKM W

M3AaTejibCTBo "MayKa " MocKB a 197 2

Trans lated fro m th e R u s s i a n b y

I srae l Progra m fo r S c i e n t i f i c T r a n s l a t i o n s

Trans la t ion e d i t e d b y B . Si lve r

2000 Mathematics Subject Classification. Primar y 62L20 ; Secondary 60J05 , 93E10 .

Abstract. Thi s boo k i s devote d t o sequentia l method s o f solvin g a

c lass o f problem s t o whic h belongs , fo r example , th e proble m o f findin g a

maximum poin t o f a functio n i f eac h measure d valu e o f thi s functio n con -

tains a rando m error . Som e basi c procedure s o f stochasti c approximatio n

are investigate d fro m a singl e poin t o f view , namel y th e theor y o f Marko v

processes an d martingales . Example s ar e considere d o f application s o f

the theorem s t o som e problem s o f estimatio n theory , educationa l theor y an d

control theory , an d als o t o som e problem s o f informatio n transmissio n i n

the presenc e o f invers e feedback .

Library of Congress Cataloging in Publication Data w ^m ay

Nevel'son, Mikhai l Borisovich . Stochast ic approximatio n an d recursiv e estimation .

(Translations o f mathematica l monograph s ; v . ^7 ) Translat ion o f Stokhasticheskai a approksimatfe£ 3

i rekurrentnoe oftenivanie. Bibliography: p. Includes index . 1. Stochast i c approximation . 2 . Estimatio n

theory. I . Kha s 'minskii, Rafai l Zalmanovich , jo in t author. H . T i t l e . I I I . Ser ies . QA27lf.NWl5 519.2 76-U8298 ISBN 0-8218-1597-0

© 197 6 b y th e America n Mathematica l Society . Al l right s reserved . Reprinted b y th e America n Mathematica l Society , 2000 .

The America n Mathematica l Societ y retain s al l right s except thos e grante d t o th e Unite d State s Government .

Printed i n th e Unite d State s o f America .

@ Th e pape r use d i n thi s boo k i s acid-fre e an d fall s withi n th e guideline s established t o ensur e permanenc e an d durability .

Visit th e AM S hom e pag e a t ht tp: / /www.ams.org /

10 9 8 7 6 5 4 3 0 4 0 3 0 2 0 1 0 0

TABLE OF CONTENTS Preface 1 Introduction 4 Chapter 1 . Element s of probability and martingales 1 1

1. Probabilit y 1 1 2. Rando m variables. 1 3 3. Conditiona l probabilities and conditional expectations 1 7 4. Independence . Produc t measures 2 0 5. Martingale s and supermartingales 2 2

Chapter 2. Discret e time Markov processes 2 9 1. Marko v processes 2 9 2. Marko v processes and supermartingales 3 2 3. Recursivel y define d processe s 3 4 4. Discret e model of diffusion . 3 7 5. Exi t of sample functions fro m a domain 3 8 6. Serie s of independent rando m variables 4 1 7. Convergenc e of sample functions 4 3 8. Teachin g pattern recognition 4 9

Chapter 3. Marko v processes and stochastic equations 5 3 1. Continuou s time Markov processes 5 3 2. Stochasti c differential equations . 1 5 7 3. Stochasti c integrals 5 9 4. Stochasti c differential equations . II 6 2 5. Ito' s formula 6 4 6. Supermartingale s 6 9 7. Existenc e of solutions in the large 7 0 8. Exi t from a domain. Convergenc e of sample functions 7 3

Chapter 4. Convergenc e of stochastic approximation procedures . 1 7 9 1. Th e Robbins-Monro procedure 7 9 2. Th e Kiefer-Wolfowitz procedur e 8 3 3. Continuou s procedures 8 6 4. Convergenc e of the Robbins-Monro procedure 8 8 5. Convergenc e of the Kiefer-Wolfowitz procedur e 9 4

ui

CONTENTS

Chapter 5. Convergenc e o f stochasti c approximatio n procedures . I I 10 1 1. Introductor y remark s 10 1 2. Genera l theorems 10 2 3. Auxiliar y result s (continuous time ) 10 6 4. Auxiliar y result s (discrete time ) 11 2 5. One-dimensiona l procedure s 11 8

Chapter 6. Asymptoti c normalit y o f the Robbins-Monr o procedure 12 3 1. Preliminar y remark s 12 3 2. Asymptoti c behavior of solution s 12 9 3. Investigatio n o f the process ^(r) . 13 2 4. Investigatio n o f th e process I2{f) 13 6 5. Asymptoti c normalit y (continuou s time) 14 0 6. Asymptoti c normalit y (discrete time ) 14 7 7. Convergenc e o f moments 15 4

Chapter 7 . Som e modifications o f stochasti c approximatio n procedure s 16 1 1. Statemen t o f th e proble m 16 1 2. Genera l theore m 16 2 3. Auxiliar y result s 16 5 4. Theorem s on convergence and asymptotic normality 16 7 5. Adaptiv e Robbins-Monr o procedure s 16 9 6. Asymptoti c optimalit y 17 5

Chapter 8 . Recursiv e estimatio n (discrete time ) 18 1 1. Th e Crame'r-Rao inequality. Efficienc y o f estimate s 18 1 2. Th e Cramer-Rao inequality i n the multidimensiona l cas e 18 6 3. Estimatio n of a one-dimensional paramete r 18 9 4. Asymptoticall y efficien t recursiv e procedure 19 4 5. Estimatio n of a multidimensional parameter 19 7 6. Estimatio n with dependent observation s 20 2

Chapter 9. Recursiv e estimation (continuous time ) 20 7 1. Th e Cramer-Ra o inequalit y 20 7 2. Applicatio n o f th e Robbins-Monro procedur e 21 0 3. Time-dependen t observation s 21 2 4. Som e applications 21 3 5. A modification 22 0

Chapter 10 . Recursiv e estimation with a control parameter 22 1 1. Statemen t o f th e problem 22 1 2. Asymptoticall y optima l recursiv e plan 22 3 3. Tw o examples... . 22 6 4. Continuou s case 22 9

Notes on the literatur e 23 3 Bibliography 23 7 Subject Index 24 2 Main Notation 24 4

IV

This page intentionally left blank

NOTES ON THE LITERATURE

Chapter 1

The proo f o f al l theorem s cite d i n § § 1 - 4 ma y b e foun d i n Kolmogoro v [ 1 ] , Halmos

[ 1), an d Kolmorogo v an d Fomi n [ 1). Th e martingal e an d supermartingale concept s ar e du e

to Doob . Theorem s S. l an d 5.l ' an d thei r proof s ar e take n fro m Doo b [ 1 ] .

Chapter 2

§ 1. Fo r mor e detail s abou t propertie s o f Marko v processes an d thei r transitio n func -

tions, see Felle r [ 1 ], Dynkin [ 1 ], Loev e [ 1J, an d others .

§2. Theore m 2. 1 i s essentiall y a special cas e o f Doob' s theore m (Doo b [1 ) ) o n th e

stopping o f a supermartingale. Kolomogorov' s inequalit y fo r supermartingale s ma y als o b e

found i n Doo b [ 1 ].

§3. Theorem s 3. 1 an d 3. 2 ar e wel l known . See , fo r example , Gihman an d Skoroho d

HI. §5. Fo r related results , see, fo r example , AB R [ 1 ] , Chapter IV .

§6. Theorem s 6. 1 an d 6. 2 wer e prove d b y Kolomogoro v [1 ] i n a more genera l setting .

§7. Theore m 7. 1 i s conceptuall y simila r t o Theore m 8 i n Chapte r 4 o f AB R [ 1 ] .

§ 8. Fo r a detailed discussio n o f th e question s relate d t o thi s formulatio n o f th e prob -

lem, see AB R [ 1 ], Chapter 5 .

Chapter 3

§ 1. Fo r mor e detail s abou t th e definitio n an d propertie s o f a continuous Marko v

process, see Dynki n [ 1 ].

§2. Th e limi t passag e fro m equatio n (2.1 ) t o a continuou s Marko v process was con -

sidered b y Bernstei n [ 1 ], who, among othe r things , prove d a theorem o n th e limi t behavio r

of th e transitio n probabilit y a s h -* 0 .

§§3 an d 4 . Thi s definitio n o f th e stochasti c integra l i s du e t o ItS . Fo r a proo f o f

Theorem 4.1 , see, fo r example , Gihma n an d Skorohod [ 2 ] .

§7. A proo f o f Theore m 7. 1 ma y b e foun d i n Has'minski T [ 1 ] .

§8. Simila r results ar e presente d in th e authors ' paper [ 1 ] .

Chapter 4

§ 1. A s mentione d i n th e text , s.a . procedure s fo r locatio n o f th e root s o f regressio n

equations wer e first suggeste d i n th e 19S 1 pape r o f Robbin s an d Monr o [ 1 ] , who prove d

convergence i n mea n squar e o f th e procedur e (S.l ) unde r certai n assumptions . Thi s resul t

233

234 NOTES O N TH E LITERATUR E

was generalize d b y Wolfowit z [ 1 ], Blu m [ 1J, Chun g [ 1 ], an d others . Blu m [ 1 ), i n particular ,

presented th e firs t proo f o f convergenc e wit h probabilit y 1 . Theore m 1. 1 an d th e metho d o f

proof use d i n th e tex t ar e du e t o Gladyse v [ 1 ]. Multidimensiona l R M procedure s wer e intro -

duced b y Blu m [ 2 ] . Blu m was th e firs t autho r t o appl y th e theor y o f supermartingale s t o

investigate th e convergenc e o f s.a . procedures .

§2. Th e procedure (2.5 ) wa s propose d b y Kiefe r an d Wolfowit z [1 ] i n 1952 . Blu m

[ 2 ] , Derma n [ 1 ] , Burkholde r [1 ] an d other s studie d th e procedur e i n detai l and , i n particular ,

proved tha t i t i s convergen t wit h probabilit y 1 . N o accoun t i s given her e o f mor e genera l s.a .

procedures suc h a s those propose d i n Burkholde r [ 1 ] , Dvoretzk y [ 1 ] , etc . Mor e detaile d

bibliographies ma y b e foun d i n th e revie w article s o f Dvoretzk y [ 1 ] , Fabia n [ 3 ] , Sakriso n

[ 3 ], Schmettere r [ 1J, Logino v [ 1 ], an d others .

§ 3. Th e continuou s analo g o f th e R M procedur e wa s introduced b y Drim l an d Nedom a

[ 1 ] . The y prove d th e convergenc e o f th e procedur e unde r fairl y genera l assumption s o n th e

stochastic processe s appearin g on th e righ t o f th e equation . Unde r certain assumptions ,

Sakrison [ 1 ] prove d th e convergenc e o f th e continuou s analog of th e Kiefer-Wolfowit z pro -

cedure. Th e continuou s procedure s considere d her e wer e define d i n Has'rninsk u [1 ] an d

Nevel'son an d Has'minsk u [ 1 ].

§4. Th e firs t convergenc e theorem s fo r multidimensiona l R M procedures wer e prove d

by Blu m [ 2 ] ; see als o ABR [ 1 ] , Schmetterer [1 ] an d furthe r bibliograph y cite d there . Theo -

rem 4. 4 i s similar t o th e above-mentione d resul t o f Gladyse v [ 1 ]; Theore m 4. 5 i s du e t o

Braverman an d Rozonoe r (se e ABR [ 1 ]). Th e formulatio n o f th e las t theore m raise s th e

question a s to whethe r on e ca n als o weaken th e conditio n Lfl 2(f) < « > in th e proo f tha t

X(t) - + XQ &J . I t turn s ou t tha t thi s i s indee d possible . Fo r example , i n som e case s i t i s

sufficient t o deman d tha t £o"(f ) < °° for som e n > 0 (althoug h on e mus t the n impos e mor e

stringent condition s o n th e "noise") . Th e theorem s prove d her e fo r continuou s R M proced -

ures ar e du e t o th e author s [ 1 ] .

§ 5. Convergenc e theorem s fo r discret e multidimensiona l K W procedures wer e prove d

by Blu m [ 2 ] , Dupa£ [1 ] an d others. Fo r th e continuou s case , see Nevel'so n an d Has'minsk u

[ 1 ] .

Chapter 5

The conjectur e tha t a K W procedure canno t converg e wit h positiv e probabilit y t o

minimum point s o f th e regressio n functio n wa s advance d b y Fabia n [ 1 J, [ 2 ] . Th e result s

presented her e ar e based o n Has'minsk u (1 ] an d Nevel'son [ 1 ] , [ 2 ] . Th e tw o last-mentione d

papers als o prov e mor e genera l theorems. Se e als o Krasulin a [ 1 ] .

Chapter 6

§1 . Theore m 1. 1 fo r th e one-dimensiona l cas e i s proved , e.g. , i n Loev e [ 1 ]; for th e

multidimensional case , see Sack s [ 1 ].

§2. Condition s fo r th e validit y o f th e estimat e EX 2{t) = 0 ( l / f ) a s t - * » , wher e X(f)

is a discret e one-dimensiona l R M process , were firs t obtaine d b y Chun g [ 1 ]. A n analo g o f

Lemma 2. 1 fo r discret e multidimensiona l R M processes was prove d b y Sack s [ 1 ] unde r th e

additional assumptio n E\G(t, x, t j ) | 2 < c < « . Th e fac t tha t (2.9 ) i s a consequence o f (2.8 )

follows fro m a well-known lemm a of Chun g [ 1 ] .

§§3 an d 4 . Lemm a 3. 1 wa s prove d b y Has'minski f [ 2 ] . Lemm a 4. 3 i s du e t o Sack s

NOTES O N TH E LITERATUR E 235

§5. Theore m 5. 3 fo r M > 0 i s prove d i n Has'minski T [ 2 ] . Th e asymptoti c normalit y

of a R M procedure wa s established , unde r certai n assumptions , b y Holev o [ 1 ].

§6. Theore m 6. 1 wa s prove d b y Sack s under th e followin g additiona l assumptions :

a) th e matri x B ma y b e reduced t o diagona l for m b y a similarity transformatio n involvin g

an orthogona l matrix ; b ) th e nor m o f th e matri x A(t, x) i s bounde d fo r t > 1 an d x € Ej.

Theorem 6. 3 i s new. Asymptoti c normalit y theorem s fo r othe r s.a . procedure s wer e prove d

by Fabia n [ 5 ] , Derman [ 1 ] , Burkhoide r [ 1 ] , DupaS [1 ] an d others . Th e "truncation "

method i n asymptoti c normalit y proof s fo r s.a . procedure s was firs t applie d b y Hodge s an d

Lehman [ 1 ] .

§ 7. Th e firs t convergenc e theorem s fo r moment s o f s.a . processe s wer e prove d b y

Chung [ 1 ] i n th e one-dimensiona l case . Fo r the multidimensiona l case , the resul t o f Theore m

7.2 i s wel l know n (see , fo r example , Schmettere r [ 1J) fo r X > *ta . Bu t i f Conditio n (a ) o f

§ 7 hold s with A < Via, the onl y resul t know n t o u s o n convergenc e o f moment s i s tha t o f

Sakrison [ 2 ] . Constructin g a certai n modificatio n o f th e R M procedure , Sakrison prove d

that th e matri x o f secon d moment s o f th e proces s *Ji(X{i) — XQ) converges t o th e correspon -

ding covarianc e matri x o f a normal law , provided al l moment s E|G(f , x, (J))P t p = 1 , 2 , . . . ,

are bounded .

Chapter 7

§ § 1 - 4 . Alber t an d Gardne r [1 ] establishe d convergenc e an d asymptoti c normalit y

for th e procedur e (1.1 ) an d a more genera l on e i n whic h th e facto r a{t) i s allowe d t o depen d

on pas t observations . Thes e author s als o considere d a multidimensional modificatio n o f (1.1) .

Recently, Cslb i [ 1 ] , [2 ] ha s prove d convergenc e o f truncate d procedure s fo r a broad rang e

of truncations .

§5. Ther e i s a considerable literatur e o n th e optima l choic e o f parameter s fo r R M

procedures an d thei r adaptiv e modifications . Amon g others , we mentio n Chun g [ 1 ],

Dvoretzky [ 1 ] , Keste n [ 1 ] , Cypkin [ 1 ] , [2],an d Stratonovic ' [ 2 ] . Ou r approac h i s base d

on th e wor k o f Vente r [ 1 ] , who prove d Theore m 5. 1 unde r slightly mor e restrictiv e assump -

tions.

§6. Theore m 6. 1 slightl y generalize s a result o f Vente r [ 1 ] ; Theorem 6. 2 i s new .

Chapter 8

§1. Theore m 1. 1 follow s fro m result s o f Chapma n an d Robbin s [1 ] an d Kaga n [ 1 ] .

Theorem 1. 2 wa s prove d b y Ibragimo v and Has'minski i [ 2 ] . Thes e paper s als o deriv e inequal -

ities fo r biase d estimates .

§2. Theore m 2. 1 an d it s analo g fo r biase d estimate s wer e prove d b y Ibragimo v an d

Has'minskii [ 2 ] .

§ 3. Th e us e o f R M procedure s fo r parametri c estimatio n i s discusse d i n Alber t an d

Gardner [ 1 ], wher e a larger clas s o f estimates , no t necessaril y formin g Marko v processes , i s

considered.

§§4 an d 5 . Theorem s 4. 1 an d 4. 5 wer e prove d b y Sakriso n [ 2 ] , [3 ] unde r stronge r

conditions (an d without th e convergenc e o f distributions) . Othe r optimalit y criteri a wer e

considered b y Alber t an d Gardne r [ 1 ] , Cypkin [ 1 ] , and Stratonovic ' [ 2 ] , but thei r procedure s

generally depen d o n pas t observation s (see , fo r example , th e procedur e (0.9 ) i n th e Intro -

duction).

236 NOTES ON THE LITERATURE

Chapter 9

§1. Fo r inequality (1.7) , see, for example, Van Trees [1]. §2. A procedure equivalent to (2.4) for m linear in the parameter (m(*0) = mxg ,

where m is a nonsingular matrix) was considered by Holevo [1] . H e also considered the case that m is not a square matrix but mm* is nonsingular. Fo r the nonlinear case, Holevo proposed a procedure more complicated than (2.4). Unde r certain assumptions, he established asymptotic normality o f the estimation procedure.

§3. Recursiv e procedures similar to (3.1) ma y be derived from the Bayesian approach, using linearization. Fo r linear systems, similar estimation procedures may be found in Kal-man and Bucy [1] , Lipcer and Sirjaev [1] , and elsewhere.

§4. Th e problem of estimating x0 base d on observations of type (4.7) (estimation o f linear regression coefficients) ha s been considered from other points of view in many papers.

§5. Wit h suitable initial conditions, the procedure (5.3) , (5.4) for a linear function m{t, x) = m{t)x coincide s with the Kalman-Buc y optimal linear filtering equations . I n other words, this system is satisfied, in particular, by the conditional expectation of XQ given Y(s), s < f , provide d the a priori distribution of XQ is Gaussian.

BIBLIOGRAPHY

M. A . AIZERMAN , fe. M. BRAVERMA N AN D L L ROZONOE R (ABR ) 1. The method of potential functions in machine learning theory, "Nauka" , Moscow ,

1970. (Russian ) ARTHUR E . ALBER T AN D LELAN D A . GARNDER , JR .

1. Stochastic approximation and nonlinear regression, MI T Press , Cambridge, Mass., 1967. M R 3 6 #7261 .

SERGE BERNSTEI N 1. Principes de la theorie des equations differentiates stochastiques, Trud y Fiz.-Mat .

Inst. Steklov . Otd . Mat . S (1934) , 95-124 . DAVID BLACKWEL L AN D M . A . GIRSHIC K

1. Theory of games and statistical decisions, Wiley , Ne w York ; Chapman an d Hall , London, 1954 . M R 16 , 1135 .

JULIUS R . BLU M 1. Approximation methods which converge with probability one, Ann . Math. Statist .

25 (1954) , 382-386 . M R 15 , 973 . 2. Multidimensional stochastic approximation methods, Ann . Math . Statist . 25 (1954) ,

737-744 . M R 16 , 382 . D. L. BURKHOLDER

1. On a class of stochastic approximation processes, Ann . Math . Statist . 27 (1956) , 1044-1059. M R 1 9 , 7 1 .

DOUGLAS G . CHAPMA N AN D HERBERT ROBBIN S 1. Minimum variance estimation without regularity assumptions, Ann . Math. Statist .

22 (1951) , 581-586 . M R 13 , 367 . K. L CHUN G

1. On a stochastic approximation method, Ann . Math. Statist . 2 5 (1954) , 463—483 . MR 16 , 272 .

HARALD CRAME R 1. Mathematical methods of statistics, Princeto n Univ . Press , Princeton , N . J. , 1946 .

MR 8 , 39 . S. CSIBI

1. Simple and compound processes in iterative machine learning, Lecture s delivere d a t the Internationa l Summe r School , Trieste , 1971 .

2. Learning optimal decision functions recursively, Secon d Internat . Sympos . Infor-mation Theor y (Tsahkadsor , 1971) , Abstract s o f papers , Izdat . Akad . Nauk SSSR , Moscow, 1971 , pp . 2 9 2 - 2 9 4. CC A 7 #12990 .

JA. Z. CYPKI N 1. Adaptation and learning in automatic systems, "Nauka" , Moscow , 1968 ; English

transl., Academic Press , New York , 1971 . 2. Foundations of the theory of learning systems, "Nauka" , Moscow , 1970 ; English

transl., Academi c Press , New York , 1973 . CYRUS DERMA N

1. An application of Chung's lemma to the Kiefer-Wolfowitz stochastic approximation procedure, Ann . Math. Statist . 2 7 (1956) , 532-536 . M R 17 , 1218 .

A. G. D'JACKOV AND M. S. PlNSKER 1. 77ie optimal linear method of transmission over a memoryless Gaussian stationary

channel with complete feedback, Problem y Peredac i Informaci i 7 (1971) , vyp. 2 , 3 8 - 4 6 = Problem s of Information Transmissio n 7 (1971), 123-129 . M R 46 #8741 .

237

238 BIBLIOGRAPHY

J. L. DOOB 1. Stochastic processes, Wiley , New York ; Chapma n an d Hall , London, 1953 . M R

15 ,445 . MlLOSLAV DRIM L AN D JlR I NEDOM A

1. Stochastic approximations for continuous random functions, Trans . Second Pragu e Conf. Information Theory , Publ . House Czechoslova k Acad . Sci. , Prague , 1960 ; Academic Press , New York , 1961 , pp . 145-158 . M R 2 3 #A4171 .

VACLAV DUPA C 1. On the Kiefer-Wolfowitz approximation method, Casopi s Pest . Mat. 82 (1957) , 4 7 -

75. (Czech ; Russian an d Englis h summaries ) M R 19 , 693 . 2. A dynamic stochastic approximation method, Ann . Math. Statist . 3 6 (1965) , 1 6 9 5 -

1702. M R 3 3 #1939 . ARYEH DVORETZK Y

1. On stochastic approximation, Proc . Thir d Berkele y Sympos . Math . Statist.and Prob -ability (1954-55) , vol . I , Univ . o f Californi a Press , Berkeley , Cal. , 1956 , pp . 3 9 -55. M R 18 , 946 .

E B . DYNKIN 1. Markov processes, Fizmatgiz , Moscow , 1963 ; English transl. , Springer-Verlag, Berlin ,

Academic Press , New York , 1965 . M R 3 3 #1886 , 1887 . VACLAV FABIA N

1. Stochastic approximation methods, Czechoslova k Math . J . 1 0 (85 ) (1960) , 123 — 159. M R 22 # 4 0 9 1 .

2. A new one-dimensional stochastic approximation method for finding a local mini-mum of a function, Trans . Third Pragu e Conf . Informatio n Theory , Statistica l Decision Functions , Rando m Processe s (Liblice , 1962) , Publ . House Czech . Acad . Sci., Prague , 1964 , pp . 85-105 . M R 2 9 #6569 .

3. A survey of deterministic and stochastic approximation methods for the minimi-zation of functions, Kybernetik a (Prague ) 1 (1965) , 499-523 . (Czech ; English summary) M R 3 2 #5378 .

4. On the choice of design in stochastic approximation methods, Ann . Math . Statist . 39 (1968) , 457-465 . M R 3 7 # 1 0 3 8 .

5. On asymptotic normality in stochastic approximation, Ann . Math. Statist 3 9 (1968) , 1327-1332. M R 3 7 #6984 .

A. A . FEL'DBAU M 1. Optimal control systems, 2n d rev . ed. , "Nauka" , Moscow , 1966 ; English transl . o f

1st ed. , Academi c Press , New York , 1965 . M R 3 5 #2656 ; 3 2 #9075 . WILLIAM FELLE R

1. An introduction to probability theory and its applications. Vol . II , Wiley, Ne w York, 1966 . M R 3 5 #1048 .

G. M . FIHTENGOL' C 1. A course of differential and integral calculus. Vol . II , 4th ed. , Fizmatgiz , Moscow ,

1959; Germa n transl. , VE B Deutsche r Verla g de r Wissenschaften , Berlin , 1966 . MR 3 9 # l b .

1.1. GlHMAN AND A. V. SKOROHOD 1. Introduction to the theory of random processes, "Nauka" , Moscow, 1965 ; English

transl., Saunders , Philadelphia , Pa. , 1969 . M R 3 3 #6689 ; 4 0 # 9 2 3 . 2. Stochastic differential equations, "Naukov a Dumka" , Kiev , 1968 ; English transl. ,

Springer-Verlag, Ne w York , 1972 . M R 4 1 #7777 ; 4 9 #11625 . E G . GLADYSE V

1. On stochastic approximation, Teor . Verojatnost . i Primenen . 1 0 (1965) , 297 -30 0 = Theor . Probabilit y Appl . 1 0 (1965) , 275-278 . M R 3 2 #3184 .

PAUL R . HALMO S 1. Measure theory, Va n Nostrand , Princeton , N . J. , 1950 . M R 11 , 504 .

BIBLIOGRAPHY 239

R.Z.HAS'MINSIUI 1. Stability of systems of differential equations under random perturbations of their

parameters, "Nauka" , Moscow , 1969 . (Russian ) M R 41 #3925 . 2. The behavior of the processes of stochastic approximation for large time-variables,

Problemy Peredac i Informaci i 8 (1972) , vyp. 1 , 81-91 = Problem s o f Informatio n Transmission 8 (1972) , 6 2 - 7 0 . M R 4 8 # 1 3 2 1 .

J. L . HODGES , JR., AND E . L . LEHMAN N 1. Two approximations to the Robbins-Monro process, Proc . Third Berkele y Sympos .

Math. Statist , an d Probabilit y (1954-1955) , vol . I , Univ . of Californi a Press , Berke -ley, Ca!. , 1956 , pp . 95 -104 . M R 18 , 947 .

A. S. HOLEVO 1. Estimates for the drift parameters for a diffusion process by the method of sto-

chastic approximation, Studie s i n th e Theor y o f Self-adjustin g System s (V. G . Sragovic, editor) , Vycisl . Centr Akad . Nau k SSSR , Moscow , 1967 , pp . 179-200 . (Russian) M R 3 8 #1800 .

I. A . IBRAGIMO V AN D R . Z . HAS'MINSK H 1. The asymptotic behavior of certain statistical estimates in the smooth case. I : In-

vestigation of the likelihood ratio, Teor . Verojatnost . i Primenen . 1 7 (1972) , 4 6 9 -486 = Theor . Probabilit y Appl . 1 7 (1972) , 445-462 . M R 4 6 #10106 .

2. Information inequalities and superefficient methods, Dokl . Akad . Nau k SSSR 20 4 (1972), 1300-130 2 = Sovie t Math . Dokl . 1 3 (1972) , 821-824 . M R 4 7 # 2 7 3 3 .

3. On the approximation of statistical estimators by sums of independent variables, Dokl. Akad . Nau k SSS R 21 0 (1973) , 1273-127 6 = Sovie t Math . Dokl . 1 4 (1973) , 883-887 . M R 4 9 #10020 .

KlYOSI IT O 1. On a formula concerning stochastic differentials, Nagoy a Math . J . 3 (1951) , 55 -65 .

MR 13 , 363 . KlYOSI IT O AN D MAKIK O NlSI O

1. On stationary solutions of a stochastic differential equation, J . Math . Kyot o Univ . 4 (1964) , 1-75 . M R 3 1 # 1719 .

A.M.KAGAN 1. On the theory of Fisher's information quantity, Dokl . Akad. Nau k SSS R 15 1

(1963), 277-27 8 = Sovie t Math . Dokl . 4 (1963) , 991-993 . M R 2 7 #2032 . R. E . KALMA N AN D R . S . BUC Y

1. New results in linear filtering and prediction theory, Trans . ASME Ser . D . J . Basi c Engrg. 83 (1961) , 9 5 - 1 0 8 . M R 3 8 #3076 .

HARRY KESTE N 1. Accelerated stochastic approximation, Ann . Math . Statist . 29 (1958) , 4 1 - 5 9 . M R

20 # 3 7 1 . J. KlEFE R AN D J . WOLFOWIT Z

1. Stochastic estimation of the maximum of a regression function, Ann . Math . Statist . 23 (1952) , 462-466 . M R 14 , 299 ; erratum, M R 14 , 1278 .

A. N . KOLMOGORO V 1. Grundbegriffe der Wahrscheinlichkeitsrechnung, Springer-Verlag , Berlin, 1933 ; Eng-

lish transl. , Chelsea, Ne w York , 1950 ; 2nd ed. , 1956 . M R 11 , 374 ; 18 , 155 . A. N . KOLMOGOROV AN D S . V. FOMI N

1. Elements of the theory of functions and of functional analysis, 2n d rev . ed. , "Nauka", Moscow , 1968 ; English transl . o f 1s t ed. , vols . 1 , 2 , Grayloc k Press , Al -bany, N . Y. , 1957 , 1961 . M R 3 8 #2559 ; 19 , 44; 22 #9565a .

N. N . KRASOVSK U 1. Certain problems in the theory of stability of motion, Fizznatgiz , Moscow , 1959 ;

English transl. , Stability of motion. Application of Lyapunov fs second method to differential systems and equations with delay, Stanfor d Univ . Press , Stanford, Cal. , 1963. M R 2 1 #5047 ; 2 6 # 5 2 5 8 .

240 BIBLIOGRAPHY

T. P . KRASULIN A 1. The Robbins-Monro process in the case of several roots, Teor . Verojatnost . i

Primenen. 1 2 (1967) , 386-39 0 = Theor . Probabilit y Appl . 1 2 (1967) , 333-337 . MR 3 5 # 5 0 8 5 .

L M . LiBKIND 1. On the probability of large deviations for a quadratic functional of a stationary

Gaussian sequence, Problem y Peredac i Informaci i 5 (1969) , vyp . 2 , 7 2 - 7 8 = Prob -lems o f Informatio n Transmissio n 5 (1969) , no. 2 , 56—60 .

R. S. LlPCER AND A. N. SlRJAEV 1. Nonlinear filtering of Markov diffusion processes, Trud y Mat . Inst. Steklov . 10 4

(1968), 135-18 0 = Proc . Steklov Inst . Math . 10 4 (1968) , 163-21 8 (1971) . M R 41 #6342 .

2. The absolute continuity with respect to Wiener measure of measures that corre-spond to processes of diffusion type, Izv . Akad . Nau k SSS R 3 6 (1972) , 847-88 9 = Math . USS R Izv . 6 (1972) , 839-882 . M R 4 7 # 1 1 1 9 .

MICHEL LOEV E 1. Probability theory, 2n d ed. , Va n Nostrand , Princeton , N . J. , 1960 . M R 2 3 #A670 .

N . V . LOGINO V 1. Methods of stochastic approximation, Avtomat . i Telemeh . 2 7 (1966) , no . 4 , 1 8 5 -

204 = Automat . Remot e Contro l 2 7 (1966) , 706 -728 . I. G. MALKIN

1. Theory of stability of motion, 2n d rev . ed., "Nauka" , Moscow , 1966 ; Englis h transl. o f 1s t ed. , AE C Transl . #3352 , U . S . Atomic Energ y Commission , Oa k Ridge, Tenn. , 1958 . M R 3 4 #6228 ; erratum, 43 , p . 1695 .

M. B . NEVEL'SO N 1. Certain properties of continuous procedures of stochastic approximation, Teor .

Verojatnost. i Primenen . 1 7 (1972) , 310-31 9 = Theor . Probabilit y Appl . 1 7 (1972) , 298-308 . M R 4 5 #6124 .

2. The convergence of continuous and discrete Robbins-Monro procedures in the case of several roots of the regression equation, Problem y Peredac i Informaci i 8 (1972) , vyp. 3 , 48—57 = Problem s o f Informatio n Transmissio n 8 (1972) , 215—223. M R 48 #1320 .

M B . NEVEL'SO N AN D R . Z . HAS'MINSK H 1. Continuous procedures of stochastic approximation, Problem y Peredac i Informaci i

7 (1971) , vyp. 2 , 58-6 9 = Problem s o f Informatio n Transmissio n 7 (1971) , 1 3 9 -148. M R 4 8 #1319 .

2. On the convergence of the moments of the Robbins-Monro procedure, Avtomat . i Telemeh. 1973 , no. 1 , 96 -10 0 = Automat . Remot e Contro l 3 4 (1973) , 8 5 - 9 0 .

M. S. PlNSKER 1. The probability of error in block transmission in a memoryless Gaussian channel

with feedback, Problem y Peredac i Informaci i 4 (1968) , vyp. 4 , 3—1 9 = Problem s of Informatio n Transmissio n 4 (1968) , no. 4 , 1—14 .

C. RADHAKRISHN A RA O 1. Linear statistical inference and its applications, Wiley , New York , 1965 . M R

36 #4668 . HERBERT ROBBIN S AN D SUTTO N MONR O

1. A stochastic approximation method, Ann . Math . Statist . 2 2 (1951) , 400—407 . MR 13 , 144 .

JEROME SACK S 1. Asymptotic distribution of stochastic approximation procedures, Ann . Math. Statist .

29 (1958) , 373-405 . M R 2 0 #4886 . DAVID J . SAKRISO N

1. A continuous Kiefer-Wolfowitz procedure for random processes, Ann . Math. Statist . 35 (1964) , 590-599 . M R 2 8 #4650 .

BIBLIOGRAPHY 241

2. Efficient recursive estimation; application to estimating the parameters of a co-variance function, Internat . J . Engrg . Sci . 3 (1965) , 461-483. M R 3 1 #6306 .

3. Stochastic approximation: A recursive method for solving regression problems, Advances i n Communication Systems : Theor y an d Application s (A . V . Balakrishnan , editor), Vol . 2 , Academic Press , New York , 1966 , pp . 51-106 .

J. PlETE R M SCHALKWIJ K 1. Recent developments in feedback communication, Proc . IEE E 5 7 (1969) , 1242 —

1249. J. PlETE R M . SCHALKWIJ K AN D T . KAILAT H

1. A coding scheme for additive noise channels with feedback. I : No bandwidth con-straint, IEE E Trans . Informatio n Theor y IT-1 2 (1966) , 172-182 . M R 4 0 # 3 9 9 1 .

L SCHMETTERE R 1. Multidimensional stochastic approximation, Multivariat e Analysis , I I (Proc . Secon d

Internat. Sympos. , Dayton , Ohio , 1968 ) (P . R . Krishnaian , editor) , Academi c Press , New York , 1969 , pp . 443-460 . M R 4 1 #2844 .

C. E . SHANNO N 1. A mathematical theory of communication, Bel l Syste m Tech . J . 27 , (1948) , 3 9 7 -

423, 623-656 . M R 10 , 133 . A. V . SKOROHO D

1. Studies in the theory of random processes, Izdat . Kiev . Univ . Kiev , 1961 ; English transl., Addison-Wesley , Reading , Mass. , 1965 . M R 3 2 #3082a,b .

R. L STRATONOVI C 1. Conditional Markov processes and their application to the theory of optimal control,

Izdat. Moskov . Univ. , Moscow, 1966 ; English transl. , Amer . Elsevier , Ne w York , 1968. M R 3 3 # 5 3 9 1 ; 3 6 #4912 .

2. Does there exist a theory of synthesis of optimal adaptive, self-learning, and self-adaptive systems'*. Automat , i Telemeh . 1968 , no . 1 , 96-107 = Automat . Remot e Control 2 9 (1968) , 83 -92 . M R 3 9 #5210 .

HARRY L VA N TREE S 1. Detection, estimation and modulation theory. Par t I , Wiley, New York , 1968 .

J. H . VENTE R 1. An extension of the Robbins-Monro procedure , Ann . Math . Statist . 3 8 (1967) ,

181-190 . M R 3 4 #5225 . M. T . WASA N

1. Stochastic approximation, Cambridg e Univ . Press , New York , 1969 . M R 4 0 # 9 7 5 . J. WOLFOWIT Z

1. On the stochastic approximation method of Robbins and Monro, Ann . Math . Statist . 23 (1952) , 4 5 7 - 4 6 1 . M R 14 , 299 .

JOHN M. WOZENCRAF T AN D IRWI N MARK JACOB S 1. Principles of communication engineering, Wiley , Ne w York , 1965 .

K. S . ZlGANGIRO V 1. Transmission of messages in a binary Gaussian channel with feedback, Problem y

Peredaci Informaci i 3 (1967) , vyp. 2 , 98—10 1 = Problem s o f Informatio n Trans -mission 3 (1967) , no . 2 , 7 4 - 7 8 .

SUBJECT INDE X

Admissible control , 22 1 asymptotically optimal , 23 0

Asymptotic efficiency, 18 6 normality, 12 3 variance, 18 5

Borel set , 1 2 Chapman-Kolmogorov relation , 3 0 Condition (A) , 3 6

(B), 7 2 (a), 15 5

Control parameter , 22 1 Convergence

almost sure , 1 5 in distribution , 1 5 in probability , 1 5

Diffusion coefficient, 3 8 matrix, 6 7

Distribution o f a vector, 1 4 Drift

coefficient, 3 8 vector, 6 7

Estimate, 18 1 asymptotically efficient , 186 , 21 0

strongly, 185 , 188 , 20 9 weakly, 186 , 18 8

asymptotically unbiased , 18 2 efficient, 185 , 18 8 maximum likelihood , 5 recursive, 190 , 21 0 strongly consistent , 18 2 unbiased, 18 2 weakly consistent , 18 2

Expectation, 1 5 conditional, 17 , 1 8

relative t o a random vector , 1 9 finite, 1 5

Fisher information , 18 2 matrix, 18 6

Function Borel, 1 3 characteristic (o f a set), 1 7 distribution, 1 4 Ljapunov, 8 9 measurable, 1 3

Gaussian whit e noise , 6 3 Generating operator , 3 1

differential, 6 7 Independent

events, 2 0 random variable an d a-algebra , 2 0 sets o f rando m vectors , 2 0 0-algebras, 20 vectors, 2 0

Inequality Cebysev's, 1 5 CrameV-Rao (information) , 18 2 Kolmogorov's (fo r supermartingales) , 33 -3 4

Information content, 6 , 18 2 Fisher, 18 2 inequality, 18 2 matrix (Fisher) , 18 6

Integral, Lebesgue , 1 4 Itd's formula fo r stochasti c differentials , 66—6 7 Kiefer-Wolfowitz procedure , 83ff . Lebesgue integral , 1 4 Lemma

Fatou's, 1 6 Ljapunov's, 13 3

Martingale, 22 , 23 , 6 9 Measurable space , 1 2 Measure, 1 2

absolutely continuous , 1 6 complete, 1 2 finite, 12 Lebesgue, 1 2 probability, 1 3 a-finlte, 1 2

Observation process , 20 7 Plan o f experiment , 22 1

asymptotically optimal , 22 3 Probability, conditional , 17 , 1 8

relative t o a random vector , 1 9 Probability space , 1 3 Product

of measures , 2 1 of spaces , 2 0

Random variable, 1 3 independent o f th e future , 6 1 vector-valued, 1 3

242

SUBJECT INDE X 243

Recursive procedure , 19 0 truncated, 16 2

Regularity condition , 7 1 Regularization, 3 1 Robbins-Monro procedure , 4 , 79ff . Sample functio n o f a stochastic process , Separating hype r plane, 5 0 Set o f elementar y events , 1 2 o-algebras, 1 2

Borel, 1 2 generated b y a random vector , 1 9 imbedding o f on e i n another , 1 2 intersection of , 1 2 minimal, 1 2

Signalling rate , 21 6 Space o f elementar y events , 11 , 1 2 Stable

matrix, 13 2 point, 10 6

Standard Wiene r process , 5 5 with value s i n £ j , 5 6

Statistic, 18 1 Stochastic

approximation procedure s 4 , 79ff . Kiefer-Wolfowitz, 83ff . Robbins-Monro, 4 , 79ff .

differential (Ito) , 6 3 equivalence, 1 4

of stochasti c processes , 5 4 strong, 6 2

integral (Ito's) , 6 0 22, 5 3 process , 5 3

discrete, 2 2 Markov, 30 , 5 4

homogeneous, 5 4 measurable, 5 3 regular, 7 1 separable, 5 3 stationary (i n th e narro w sense) , 14 4

Supermartingale, 23 , 6 9 Theorem

Fubini's, 2 1 Ito's, 6 6 Kolmogorov's, 5 6 Lebesgue's, 1 6 Radon-Nikodym, 1 6

Transition densit y o f a Markov process , 5 4 Transition function , 30 , 5 4

homogeneous, 3 1 temporally homogeneous , 5 4

Transition probabilit y o f a Markov process , 2 9

MAIN NOTATIO N

11*11 = yfcijbff- nor m of matri x B = (tyy)) , p . 12 5

C2: clas s o f function s V(t, x) continuousl y differentiabl e wit h respec t t o t an d twic e con -

tinuously differentiabl e wit h respec t t o x, p . 5 7

C2' se t o f function s V(x) € C 2 wit h bounde d second-orde r partia l derivatives , p . 9 2

Dr: domai n o f definitio n o f generatin g operator , p . 3 1

D£ ' x ' : domai n o f definitio n o f generatin g operato r L a t poin t (t, x), p . 3 1

Eji euclidea n /-space , p . 1 2

7;: monoton e famil y o f a*lgebras , pp . 36 , 5 9

/(x): Fishe r informatio n matrix , pp . 182 , 18 6

/ : identit y matrix , p . 12 4

L: generatin g operator , pp . 31 , 6 7

1*2fo b]: se t o f measurabl e rando m function s /(f , u>) , t € [a, b], a ; e ft, ^-measurabl e fo r

every f , suc h tha t / £ / (f , w)d f < « • with probabilit y 1 , p . 5 9

)lf: a-algebr a o f event s generate d b y rando m variable s X(u), u < t, pp . 30 , 7 3

U: closur e o f se t U, p . 16 2

U€(B): e-neighborhoo d o f se t B , p . 3 9

U€.RW = K €<*> n <* : W < * * ' P - 4 0

K€(B) = EJ\U €{B), p . 3 9

1: domai n o f value s o f unknow n paramete r x, p . 18 1

X(t) ~9l(m , 5) : asymptoti c normalit y o f X(t) 9 p . 12 3

*W = <y x y w ) ,P-202 2: domai n o f value s o f contro l paramete r z , p . 22 1 Vg/Cx): vector wit h coordinate s \f(x + ce/ ) - / ( x — ce/)]/2c, p . 8 6

dv/bx: matri x wit h element s dty/dxy , p . 10 7

r^: firs t exi t tim e fro m domai n G , pp . 32 , 7 3

4>(B): clas s o f function s define d o n p . 4 0

244

mathematical · 2019. 2. 12. · [ 1), and kolmorogov and fomin [ 1). the martingale and...

Documents