parallel library for study of astroparticles plisa: a...

19
Parallel Library for Identification and Study of Astroparticles pLISA: a parallel Library for Identification and Study of Astroparticles and its application to KM3NeT Cristiano Bozza - University of Salerno and INFN “Gruppo collegato di Salerno” for C.B.*, C. De Sio**, R. Coniglione *** The new Era of Multimessenger Astrophysics Groningen, March 28th 2019 * University of Salerno and INFN “Gruppo collegato di Salerno” ** Formerly at University of Salerno and INFN “Gruppo collegato di Salerno” *** INFN Laboratori Nazionali del Sud - Catania

Upload: others

Post on 06-Feb-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Parallel Library for Study of Astroparticles pLISA: a ...multi-messenger.asterics2020.eu/Documents/presentations/Bozza_Cristiano.pdf · 4D image (or 3D movie) Cristiano Bozza - University

Parallel Library forIdentification andStudy of Astroparticles

pLISA: a parallel Library for Identification and Study of Astroparticles and its

application to KM3NeTCristiano Bozza - University of Salerno and INFN “Gruppo collegato di Salerno”

for C.B.*, C. De Sio**, R. Coniglione ***The new Era of Multimessenger Astrophysics

Groningen, March 28th 2019* University of Salerno and INFN “Gruppo collegato di Salerno”** Formerly at University of Salerno and INFN “Gruppo collegato di Salerno”*** INFN Laboratori Nazionali del Sud - Catania

Page 2: Parallel Library for Study of Astroparticles pLISA: a ...multi-messenger.asterics2020.eu/Documents/presentations/Bozza_Cristiano.pdf · 4D image (or 3D movie) Cristiano Bozza - University

Parallel Library forIdentification andStudy of Astroparticles

Motivations for pLISA• Leverage parallel computing based on GPUs

• Primary particle reconstruction for event-based detectors

• Set up “continuously training farms”

2Cristiano Bozza - University of Salerno and INFN Gruppo Collegato di Salerno

Page 3: Parallel Library for Study of Astroparticles pLISA: a ...multi-messenger.asterics2020.eu/Documents/presentations/Bozza_Cristiano.pdf · 4D image (or 3D movie) Cristiano Bozza - University

Parallel Library forIdentification andStudy of Astroparticles

pLISA structure• Set of Python scripts (conveniently used in Jupyter notebooks)

• Scikit Learn (data managed by image handling techniques)

• TensorFlow(differentiable programming and machine learning)

• Keras (high-level neural network software)

3Cristiano Bozza - University of Salerno and INFN Gruppo Collegato di Salerno

Page 4: Parallel Library for Study of Astroparticles pLISA: a ...multi-messenger.asterics2020.eu/Documents/presentations/Bozza_Cristiano.pdf · 4D image (or 3D movie) Cristiano Bozza - University

Parallel Library forIdentification andStudy of Astroparticles

pLISA concepts• Detector representation

• Suitable for Convolutional Neural Networks (CNNs)• Must be mappable to a rectangular grid in 1,2,3,4,5,... dimensions

• CNN stack does not support more than 4 dimensions as of today (hope to increase in the future)

• Piecewise geometry definition also possible

4Cristiano Bozza - University of Salerno and INFN Gruppo Collegato di Salerno

Page 5: Parallel Library for Study of Astroparticles pLISA: a ...multi-messenger.asterics2020.eu/Documents/presentations/Bozza_Cristiano.pdf · 4D image (or 3D movie) Cristiano Bozza - University

Parallel Library forIdentification andStudy of Astroparticles

pLISA concepts• Event representation

• Spatial discretisation (implied in discretised detector structure)

• Time discretisation (“hits” in sensing elements are binned in time, by count or deposited energy)

5Cristiano Bozza - University of Salerno and INFN Gruppo Collegato di Salerno

tE

x,y,z

Page 6: Parallel Library for Study of Astroparticles pLISA: a ...multi-messenger.asterics2020.eu/Documents/presentations/Bozza_Cristiano.pdf · 4D image (or 3D movie) Cristiano Bozza - University

Parallel Library forIdentification andStudy of Astroparticles

pLISA concepts• Processing and Outputs

6Cristiano Bozza - University of Salerno and INFN Gruppo Collegato di Salerno

Input“images”

Local feature maps

Pooling,downsampling

Invariantfeatures,generalization

Dense layer(s)

Output

No prerequisite reconstruction output - depends on no other softwareCNN could instead help/support standard reconstruction algorithms with first guess

Page 7: Parallel Library for Study of Astroparticles pLISA: a ...multi-messenger.asterics2020.eu/Documents/presentations/Bozza_Cristiano.pdf · 4D image (or 3D movie) Cristiano Bozza - University

Parallel Library forIdentification andStudy of Astroparticles

• KM3NeT Water Cherenkovin deep sea

• Building block: 115 DUs

• Detection Unit (DU):18 evenly spaced DOMs

• Digital Optical Module (DOM):31 PMTs (+ other instruments)

Application to KM3NeT

7Cristiano Bozza - University of Salerno and INFN Gruppo Collegato di Salerno

Page 8: Parallel Library for Study of Astroparticles pLISA: a ...multi-messenger.asterics2020.eu/Documents/presentations/Bozza_Cristiano.pdf · 4D image (or 3D movie) Cristiano Bozza - University

Parallel Library forIdentification andStudy of Astroparticles

+

Phase1 (fully funded - deploy 2019)

KM3NeT 2.0 Letter of Intent:arXiv:1601.07459 and

J.Phys. G43 (2016) 084001

Phase 2 (partially funded - deploy 2019/21)

• Astroparticle Research withCosmics in the Abysskm3 size building blocks (2 )z DOM spacing 36 m

• Oscillation Research withCosmics in the Abyss1 building blockz DOM spacing 9 m

Application to KM3NeT

8Cristiano Bozza - University of Salerno and INFN Gruppo Collegato di Salerno

Page 9: Parallel Library for Study of Astroparticles pLISA: a ...multi-messenger.asterics2020.eu/Documents/presentations/Bozza_Cristiano.pdf · 4D image (or 3D movie) Cristiano Bozza - University

Parallel Library forIdentification andStudy of Astroparticles

Application to KM3NeT - ARCA

9Cristiano Bozza - University of Salerno and INFN Gruppo Collegato di Salerno

Deviation from regularised structure can be introduced later as a next-order correction

• XYZ detector regularisation• exactly 90m spaced in (X,Y)• exactly 36m spaced in Z• Regularised detector

contained in Lattice

Page 10: Parallel Library for Study of Astroparticles pLISA: a ...multi-messenger.asterics2020.eu/Documents/presentations/Bozza_Cristiano.pdf · 4D image (or 3D movie) Cristiano Bozza - University

Parallel Library forIdentification andStudy of Astroparticles

Application to KM3NeT - ARCA• Event representation

• Reduce data sparsity: ignore PMT orientation andconsider only DOM

• Force DOM positions onto regularised lattice

• DOM mapped to a position in 16×15×18 lattice

• Hit time binned to 12 ns bin 75 bins / event

• Matrix structure of any event (TXYZ): [75×16×15×18]4D image (or 3D movie)

10Cristiano Bozza - University of Salerno and INFN Gruppo Collegato di Salerno

Page 11: Parallel Library for Study of Astroparticles pLISA: a ...multi-messenger.asterics2020.eu/Documents/presentations/Bozza_Cristiano.pdf · 4D image (or 3D movie) Cristiano Bozza - University

Parallel Library forIdentification andStudy of Astroparticles

Application to KM3NeT - ARCA• 258,879 events• νeCC and νμCC

• Training no longer than a few hours• Slow models also tested (training for several days) but discarded (no real improvement)

11Cristiano Bozza - University of Salerno and INFN Gruppo Collegato di Salerno

Page 12: Parallel Library for Study of Astroparticles pLISA: a ...multi-messenger.asterics2020.eu/Documents/presentations/Bozza_Cristiano.pdf · 4D image (or 3D movie) Cristiano Bozza - University

Parallel Library forIdentification andStudy of Astroparticles

• Colour code: hits/DOM/time bin

• XY found to be irrelevant forthis task

KM3NeT - Up/Down-going classification

12Cristiano Bozza - University of Salerno and INFN Gruppo Collegato di Salerno

Up-going

Down-going

Page 13: Parallel Library for Study of Astroparticles pLISA: a ...multi-messenger.asterics2020.eu/Documents/presentations/Bozza_Cristiano.pdf · 4D image (or 3D movie) Cristiano Bozza - University

Parallel Library forIdentification andStudy of Astroparticles

• cos θz estimated by network• Performance depends on

track length in detector

KM3NeT - Direction(Z) & Up/Down-going classification

13Cristiano Bozza - University of Salerno and INFN Gruppo Collegato di Salerno

Down-going KM3NeT PRELIMINARYKM3NeT PRELIMINARY

RMS(cos θz,est - cos θz,true) = 0.001 for traditional algorithms but using quality cuts, 0.002 for CNN - but no specific optimization done, and works on all events

Page 14: Parallel Library for Study of Astroparticles pLISA: a ...multi-messenger.asterics2020.eu/Documents/presentations/Bozza_Cristiano.pdf · 4D image (or 3D movie) Cristiano Bozza - University

Parallel Library forIdentification andStudy of Astroparticles

KM3NeT - Event type (νeCC/νμCC) classification• XY shape and TZ evolution analysed separately in the first stages,

merged at later stage after extraction of features

14Cristiano Bozza - University of Salerno and INFN Gruppo Collegato di Salerno

νμCC νeCC

Page 15: Parallel Library for Study of Astroparticles pLISA: a ...multi-messenger.asterics2020.eu/Documents/presentations/Bozza_Cristiano.pdf · 4D image (or 3D movie) Cristiano Bozza - University

Parallel Library forIdentification andStudy of Astroparticles

KM3NeT - Event type (νeCC/νμCC) classification• Classification efficiency found to depend on track length in detector• Energy dependency milder

15Cristiano Bozza - University of Salerno and INFN Gruppo Collegato di Salerno

KM3NeT PRELIMINARY

Page 16: Parallel Library for Study of Astroparticles pLISA: a ...multi-messenger.asterics2020.eu/Documents/presentations/Bozza_Cristiano.pdf · 4D image (or 3D movie) Cristiano Bozza - University

Parallel Library forIdentification andStudy of Astroparticles

• Events are not pre-classified with respect to particle ID

• The CNN does not have any other information than hits

• νeCC and νμCC handled together and mixed in training/test/validation samples

KM3NeT - ν Energy estimation

16Cristiano Bozza - University of Salerno and INFN Gruppo Collegato di Salerno

Page 17: Parallel Library for Study of Astroparticles pLISA: a ...multi-messenger.asterics2020.eu/Documents/presentations/Bozza_Cristiano.pdf · 4D image (or 3D movie) Cristiano Bozza - University

Parallel Library forIdentification andStudy of Astroparticles

KM3NeT - ν Energy estimation• Good linearity• Could improve

with more statistics athigh energy

17Cristiano Bozza - University of Salerno and INFN Gruppo Collegato di Salerno

KM3NeT PRELIMINARY

Page 18: Parallel Library for Study of Astroparticles pLISA: a ...multi-messenger.asterics2020.eu/Documents/presentations/Bozza_Cristiano.pdf · 4D image (or 3D movie) Cristiano Bozza - University

Parallel Library forIdentification andStudy of Astroparticles

Conclusions• pLISA is a convenient framework to flexibly develop neural networks for astroparticle

identification and study

• Based on widespread technologies

• Usable output can already be produced

• Can already provide a cross-check for“traditional” reconstruction algorithms

18Cristiano Bozza - University of Salerno and INFN Gruppo Collegato di Salerno

https://baltig.infn.it/bozza/plisa/

Page 19: Parallel Library for Study of Astroparticles pLISA: a ...multi-messenger.asterics2020.eu/Documents/presentations/Bozza_Cristiano.pdf · 4D image (or 3D movie) Cristiano Bozza - University

Parallel Library forIdentification andStudy of Astroparticles

• Improve and clean up library API

• Provide support for problems with higher dimensionality

• Application side: try to include all neglected ingredients for real situations, including efficiency,irregular shapes, noise, detector distortion, etc.

Outlook

19Cristiano Bozza - University of Salerno and INFN Gruppo Collegato di Salerno