luca biferale dept. physics, infn & cast universityof rome ...cnls.lanl.gov/external/piml/luca...

29
Flow navigation by smart particles via Reinforcement Learning Luca Biferale Dept. Physics, INFN & CAST University of Rome ‘Tor Vergata’ [email protected] PIML2018 SANTA FE CREDITS: SIMONA COLABRESE (TOR VERGATA UNIV. ROME-IT); ANTONIO CELANI (ICTP TRIESTE-IT); KRISTIAN GUSTAVSSON (GOTHEBORG UNIV. SWEDEN)

Upload: others

Post on 15-Jul-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à

FlownavigationbysmartparticlesviaReinforcementLearning

Luca BiferaleDept. Physics, INFN & CAST

University of Rome ‘Tor Vergata’[email protected] SANTA FE

CREDITS:SIMONACOLABRESE(TORVERGATAUNIV.ROME-IT);ANTONIOCELANI (ICTPTRIESTE-IT);KRISTIANGUSTAVSSON(GOTHEBORGUNIV.SWEDEN)

Page 2: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à

- PARTICLESINCOMPLEXFLOWSI:SMARTINERTIALPARTICLES

- PARTICLESINCOMPLEXFLOWSII:SMARTMICROSWIMMERS

- Flownavigationbysmartmicroswimmers viareinforcementlearning

SColabrese,KGustavsson,ACelani,LBiferalePhysicalReviewLetters118(15),158004,2017

-SmartInertialParticles

SColabrese,KGustavsson,ACelani,LBiferalearXiv preprintarXiv:1711.05853,2017

- Findingefficientswimmingstrategiesinathree-dimensionalchaoticflowbyreinforcementlearning

KGustavsson,LBiferale,ACelani,SColabreseTheEuropeanPhysicalJournalE40(12),110,2017

Page 3: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à

Drag: Stokes Timeβ<1 heavy particlesβ>1 light particles

PARTICLES INCOMPLEXFLOWS I:INERTIALPARTICLES

Preferential concentration!

Light(heavy)particles accumulateinside(outside)highly vortical regions

Maxey, J. Fluid Mech. 174, 441 (1987); Falkovich et al, Phys. Rev. Lett. 86, 2790 (2001)

heavy

light

Page 4: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à

tracerbubble heavy

Particle trapping in three-dimensional fully developed turbulenceL.B., G Boffetta, A Celani, A Lanotte, F ToschiPhysics of Fluids 17 (2), 021701

Page 5: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à

DESERT STORMS

RAININITIATION

PESTICIDE SPREADING DIESEL ENGINE INJECTIONS

TURBOMACHINES

Coherent structures and extreme events in rotating multiphase turbulent flows L.L.B., F Bonaccorso, IM Mazzitelli, MAT van Hinsberg, AS Lanotte, ...Physical Review X 6 (4), 041036 (2016)

Lagrangian properties of particles in turbulenceF Toschi, E BodenschatzAnnual Review of Fluid Mechanics 41, 375-404 (2009)

Page 6: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à

Na actions(densities)R = ⌦3

OBSERVATION:

DISCRETIZED VORTICYLEVELS

⇡n : si ! aj

⇡n

! ⇡n+1 ! · · ·⇡

opt

TRGETTARGET

Page 7: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à

Na actions(densities)R = ⌦3

OBSERVATION:

DISCRETIZED VORTICYLEVELS

⇡n : si ! aj

⇡n

! ⇡n+1 ! · · ·⇡

opt

TRGETTARGET

Page 8: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à

Qn(aj , si)

TRAINING:Q-LEARNINGALGORITHM

QUALITYMATRIXATSTEPnà

GREEDYPOLICYATSTEPn:

EXPECTEDDISCOUNTEDFUTURERETURNIFACTIONa_j istaken afterobservationofstates_i

s1

s2

s3

Qn(si, aj) = Rn + �Rn+1 + �2Rn+2 + �3Rn+3 + · · · =1X

t=n

�tRt

⇡n : a = arg max

a0Qn(a

0, s)

2.2 4.3 10.12.0 8.1 2.0

MYOPICàFAR-SIGTHEDà

� = 0� = 1

1.2 0.3 0.1

a1 a2 a3

s2 ! a3s3 ! a2

s1 ! a1⇡n

Page 9: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à

S

R0Q(a,s)

Q(s, a) Q(s, a) + ↵[R0+ �max

a0Q(s0, a0)�Q(s, a)]

OLDOBSERVATIONNEWOBSERVATION

⇡n ! ⇡n+1 ?

Page 10: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à

CHANGING RADIUSbn

Page 11: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à

TRAINING

NOEXPLORATION

Page 12: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à

TRAINING+EXPLORATION (ε-greedy)

Page 13: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à

OPTIMALACTIONS

EXAM

Page 14: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à

TIMEDEPENDENTFLOW

Page 15: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à
Page 16: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à
Page 17: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à
Page 18: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à
Page 19: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à
Page 20: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à

Clustering and turbophoresis in a shear flow without wallsF De Lillo, M Cencini, S Musacchio, G BoffettaPhysics of Fluids 28 (3), 035104 (2016)

Page 21: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à
Page 22: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à

ka = z "

Page 23: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à
Page 24: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à

OPTIMALSTRATEGY

Page 25: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à
Page 26: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à
Page 27: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à

Reddy, G., Celani, A., Sejnowski, T. J., & Vergassola, M. (2016). Learning to soar in turbulent environments. Proceedings of the National Academy of Sciences, 201606075.

Page 28: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à
Page 29: Luca Biferale Dept. Physics, INFN & CAST Universityof Rome ...cnls.lanl.gov/external/piml/Luca Biferale.pdf · n(a j,s i) TRAINING: Q-LEARNING ALGORITHM QUALITY MATRIX AT STEP n à

- Flownavigationbysmartmicroswimmers viareinforcement learning

SColabrese,KGustavsson,ACelani,LBiferalePhysicalReviewLetters118(15),158004

-SmartInertialParticles

SColabrese,KGustavsson,ACelani,LBiferalearXiv preprintarXiv:1711.05853

- Findingefficientswimmingstrategiesinathree-dimensionalchaotic flowbyreinforcement learning

KGustavsson,LBiferale,ACelani,SColabreseTheEuropeanPhysicalJournalE40(12),110

CREDITS:SIMONACOLABRESE(TORVERGATAUNIV.ROME-IT);ANTONIOCELANI (ICTPTRIESTE-IT);KRISTIANGUSTAVSSON(GOTHEBORGUNIV.SWEDEN)