us atlas sctpm machine learning and atlas paolo calafiura what is ml: data-driven statistical...

22
US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML : data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic hardware evolution pushes us towards computing with many simple elements Applications: many! Today will focus on Neuromorphic computing for low-power scalable tracking Deep Neural Networks for data analysis, data movement 1

Upload: aubrie-wilcox

Post on 29-Dec-2015

220 views

Category:

Documents


4 download

TRANSCRIPT

Page 1: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

Machine Learning and ATLAS

Paolo Calafiura

What is ML: data-driven statistical modeling of complex systems

Why now: GPU/MIC/FPGA/Neuromorphic hardware evolution pushes us towards computing with many simple elements

Applications: many! Today will focus on

• Neuromorphic computing for low-power scalable tracking

• Deep Neural Networks for data analysis, data movement

1

Page 2: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

Motivating Example: Tracking

During Run 4 we have to process ~60M tracks/s (~20x Run 2)

I/O will likely constrain us to run full tracking on line, and to write only an xAOD-like format

Guesstimating x86 cost & performance evolution, we should find x2-5 CPU offline, (much) more @ Point 1

Surely we can parallelize our way out of trouble?

2

Page 3: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

Why is it so hard to do particle tracking in parallel?

Algorithms: Iterative (propagation, fitting), irregular (combinatorial searches with lots of branch points)

Data: sparse (hits), non-local access (B-field integration)

Can ML allow us to train Neural Networks (NN) that use regular, trivial algorithms, and a naturally data parallel approach?

3

Page 4: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

Neural Networks:Computing with simple elements

‘neuron’

by themselves, limited functional repertoire.

Simple computing elements…

(Kristofer Bouchard, LBNL)

Page 5: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

Feed-forward NN: Classification

1

‘neuron’

2

as a network, learn to perform diverse functions

Simple computing elements…

Flow of information

Classification

Page 6: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

Convolutional NN: Feature Extraction

Feature Extraction Classification

‘neuron’

60

Simple computing elements…

as a network, learn to perform diverse functions

Page 7: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

Recurrent NN: Time-varying Functions

F(t)

‘neuron’ Simple computing elements…

Feature Extraction Classification Time-varying Functions

Dynamics

as a network, learn to perform diverse functions

Page 8: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

Tracking Kaggle Challenge

Inspired by the very successful Higgs Challenge

Competition among ML experts:

• Problem: Given a list of space-points produce a list of track candidates

• Figure of merit: efficiency for given fake rate and CPU budget (still under discussion)

More next week during C&S plenary

8

Page 9: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

LHCb Trigger Retina Processor

FPGA implementation1mus trackingOffline-quality performanceCertainly good enough for seeding

Track parameter space22K bins, one “receptor” per bin

Page 10: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

Neuromorphic Computing

“Spikey” from Electronic Visions group in Heidelberg

Qualcomm’s NPU’s for robots.

SpiNNaker’s 1B neuron machine

Stanford’s Neurogrid

Intel’s concept design...

IBM’s TrueNorth

(Peter Nugent, LBNL)

Page 11: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

IBM TrueNorth

• 1 million programmable neurons• 256 million synapses• 4096 neurosynaptic cores• Uses 70mW per chip• 5.4 billion transistors• Spiking rate >1000Hz

A single chip can process color video in real-time while consuming 176,000 times less energy than a current Intel chip performing the exact same analysis. Note the Intel chip can not do this analysis in real-time and is in fact 300 times slower! Merolla+ Science (2014)

Page 12: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

Neuromorphic Kalman Filters (LBNL LDRD FY16 proposal)

Paolo Calafiura (CRD), Kristofer Bouchard (Life Sciences),David Donofrio (CRD), Rebecca Carney (Physics),

Maurice Garcia-Sciveres (Physics), Craig E. Tull (CRD)

Implement Kalman filters on neuromorphic chips for low-power, high-throughput, real-time data processing

Brain-machine interfaces Charged particle tracking

Page 13: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

Deep Learning for Data Analysis

Peter Sadowski (UCI)

Page 14: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

Optimizing Higgs Detection

Peter Sadowski (UCI)

Page 15: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

Optimizing Higgs Detection

Peter Sadowski (UCI)

Page 16: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

dianahep (NSF S2I2 project) Improve ML tools in ROOT

Page 17: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

Machine Learning and Data Management

Vast and growing amount of data on user access patterns. Combine engineered and learned features to:

• Pre-fetch data and pre-allocate resources

• Optimize data clustering and replication

• Suggest related data sets

17

Page 18: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

Conclusions

• ML will be predominant in Run 4 analysis (wager #1)• Deep neural networks in tracking, jet reco, and clustering will

allow us to exploit GPUs and FPGAs, and possible neuromorphic architectures (wager #2)

• 2025 grad-students will wonder why we wrote all that C++ junk instead of training a few good networks (wager #3)

ML learning experts are not formed overnight (and command high 6-digits salaries, so tend to disappear fast)

US ATLAS should start developing ML expertise by supporting pilot projects in all relevant areas, e.g. data analysis, reconstruction (tracking), and data/job management

18

Page 19: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

Thanks

• Kristofer Bouchard• David Clark• Kyle Cranmer• Maurice Garcia-Sciveres• Peter Nugent• Peter Sadowski• Tracking Kaggle group• …

]

Page 20: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

Backup

Page 21: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

Possible Tracking Network

F(t)

Seeding Selection Fitting

Dynamics

Track

Candidates

Page 22: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic

US ATLAS SCTPM

Kalman Filters and Recurrent NNs

Classic Data Assimilation algorithm (1960, NASA)Iteratively track evolution of a dynamic system

StateData

KalmanFilter

Dynamics

Data

Dyn

State

RecurrentNeuralNetwork