Transcript
Page 1: A Java Toolbox for the Analysis of MassIve Data STreams using Probabilistic Graphical ...people.cs.aau.dk/~tdn/papers/EDFposter.pdf · 2017-07-31 · A Java Toolbox for the Analysis

A Java Toolbox for the Analysis ofMassIve Data STreams using

Probabilistic Graphical Models

DescriptionWe present a Java toolbox for scalable probabilistic machine learning.

Probabilistic machine learning

Model your problem using a flexi-ble probabilistic language based ongraphical models. Then, fit it withdata using a Bayesian approach tohandle modelling uncertainty.

H x

Multi-core and distributed processing

AMIDST provides tailored parallel anddistributed implementations of Bayesianparameter learning for batch and stream-ing data. This processing is based onflexible and scalable message passing al-gorithms.

Main Features

Probabilistic Graphical Scalable inference Data StreamsModels

Specify your model usingprobabilistic graphical mod-els with latent variables andtemporal dependencies

Perform inference on yourprobabilistic models withpowerful approximate andscalable algorithms.

Update your models whennew data is available. Ap-propriate for learning fromdata streams.

WEKA

Large-scale Data Researchers Interoperability

Use your defined models toprocess massive data sets ina distributed computer clus-ter using Flink or Spark.

Flexible toolbox for re-searchers performing theirexperimentation in machinelearning.

Leverage existing function-alities and algorithms by in-terfacing to other softwaretools.

Real world and highly complex use-cases

Risk prediction in creditoperations

Recognition of trafficmaneuvers

Risk prediction in credit opera-tions. Financial data is collectedcontinuously and reported on amonthly basis. We explore it asan evolving classification prob-lem and co. This work has beenperformed in collaboration withone of our partners, the Spanishbank BCC.

Prototype models for early recog-nition of traffic maneuver inten-tions. Data is continuously col-lected by car on-board sensorsgiving rise to a large and quicklyevolving data stream. This workhas been performed in collabo-ration with one of our partners,DAIMLER.

Identifying global trends in the financial sector

Individual trends forfinancial indicatorsWe can identify seasonal

drifts and gradual changes.

{E[H1,t]}−credit{E[H2,t]}−income{E[H3,t]}−expenses{E[H4,t]}−balance{E[H5,t]}−mortgages{E[H6,t]}−loans

Global trend forfinancial indicatorsHighly correlated with

unemployment rate (UR).Pearson’s correlationcoefficient is 0.96.

Apr

200

7

Aug

200

7

Dec

200

7

Mar

200

8

Jul 2

008

Oct

200

8

Feb

200

9

Jun

2009

Sep

200

9

Jan

2010

Apr

201

0

Aug

201

0

Dec

201

0

Apr

201

1

Sep

201

1

Jan

2012

Jun

2012

Oct

201

2

Mar

201

3

Jul 2

013

Dec

201

3

{UR}{E[Ht]}

Team

Andres Masegosa Ana M. Martınez Darıo Ramos-Lopez Rafael Cabanas Thomas D. Nielsen Helge Langseth Antonio Salmeron Anders L. MadsenDeveloper Developer Developer Developer Scientific Member Scientific Member Scientific Member Scientific Member

NTNU Aalborg University University of Almeria Aalborg University Aalborg University NTNU University of Almeria HUGIN EXPERT S/A

Academic and Industrial partners Contact

• Visit http://amidst.github.io/toolbox/ tosign up to join the growing AMIDST community ordownload the software.

The AMIDST project has received funding from the European Union’s Seventh FrameworkProgramme for research, technological development and demonstration under grant agreement no

619209. More information on amidst.eu.

Top Related