the open analytics platform - knimeknime restful web service client nodes •rest client nodes: get...

123
© 2016 KNIME.com AG. All Rights Reserved. What’s new Bernd Wiswedel

Upload: others

Post on 21-Apr-2020

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved.

What’s new

Bernd Wiswedel

Page 2: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 2

What’s new…

• 2+1 feature releases in the last year: (3.0), 3.1, 3.2

• Changes documented online…

Page 3: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 3

What’s new pages and YouTube

Page 4: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 4

Changelog …

Page 5: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 5

Changelog …

New nodes/features by version:

Version # new nodes/sets # features

v2.12 27 53

v3.0 3 28

v3.1 14 21

V3.2 18 60

Version # new nodes/sets # features

v2.12 27 53

v3.0 3 28

v3.1 14 21

V3.2 18 60

Page 6: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 6

Outline

Interactive feature demos

… by the team

Page 7: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 7

Outline

• Workbench & User Interface

• Analytics / Mining

• PMML - Standardizing predictive models

• Streaming Executor

• Linked Data & Semantic Web

• KNIME Server & Cloud Products

• KNIME Big Data Extensions

Page 8: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 8

Workbench & User Interface

Page 9: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 9

Workflow Coach!

• Three options– Based on community

statistics– Analyze your

workspace*– From local server**

* Requires KNIME Personal Productivty License** Requires KNIME Server LicenseFor more information contact us: [email protected]

Page 10: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 10

Automated installation of features

• If you open a workflowwith a missing node

• The correct plugin isautomatically proposedfor installation

Page 11: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 11

Easier import/export of workflows

New File Extensionfor workflows and

groups Use the KNIME protocol for

starting workflows

Page 12: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 12

Analytics / Mining

Page 13: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 13

Analytics / Mining

• Trees and Tree Ensembles

• (Forward) Feature Selection

• Deep Learning in KNIME: DeepLearning4J Extension

Page 14: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 14

Analytics / MiningTrees and Tree Ensembles

Greg Landrum

Page 15: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 15

The general idea is to take advantage of the “wisdom of the crowd”: combining predictions from a large number of weak predictors leads to a more accurate predictor. This is called ”bagging”.

15

Typically: for classification the individual models vote and the majority wins; for regression, the individual predictions are averaged

X

…1

5 2

2 9 6 7

4

2 7

6 8 9 3

1

7 6

3 9 5 7

y

P1 P2 Pn…

KNIME’s Tree Ensemble models

Page 16: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 16

How does bagging work?

Pick a different random subset of the training data for each model in the ensemble (bag).

1

5 2

2 9 6 7

4

5 7

2 8 9 3

1

7 6

3 9 5 7

Build tree Build treeBuild tree

Page 17: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 17

An extra benefit of bagging: out of bag estimation

Allows testing the model using the training data: when validating, each model should only vote on data points that were not used to train it

X2

…1

5 2

2 9 6 7

4

2 7

6 8 9 3

1

7 6

3 9 5 7

y2OOB

P1 P2 Pn…

X1

…1

5 2

2 9 6 7

4

2 7

6 8 9 3

1

7 6

3 9 5 7

P1 P2 Pn…

y1OOB

Page 18: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 18

Random Forests

• Bags of decision trees, but an extra element of randomization is applied when building the trees: each node in the decision tree only “sees” a subset of the input columns, typically 𝑁.

• Random forests tend to be very robust w.r.t. overfitting (though the individual trees are almost certainly overfit)

• Extra benefit: training tends to be much faster

1

5 2

2 9 6 7

Build tree

Page 19: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 19

Gradient Boosting

• Another algorithm for creating ensembles of decision trees

• Starts with a tree built on a subset of the data

• Builds additional trees to fit the residual errors

• Typically uses fairly shallow trees

• Can introduce randomness in choice of data subsets (“stochastic gradient boosting”) and in variable choice.

Page 20: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 20

Trees and Tree Ensembles: Changes “under the hood”

• Support of binary splits for nominal attributes

• Missing value handling

• Support of byte vector data (high-dimension count fingerprints)

• Code optimization

– Runtime

– Memory

Page 21: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 21

Trees and Tree Ensembles: New nodes

• Gradient Boosting

– Also based on tree ensembles

– Boosting: Improving an existing model by adding a new model

– Shallow trees

• Random Forest Distance

– Distance measure induced by a random forest

– Based on proximity

Page 22: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 22

Demo: Tree Ensembles

Page 23: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 23

Gradient Boosting dialog

Page 24: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 24

Wine-quality prediction workflow

White wine results

Page 25: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 25

Wine-quality prediction workflow

Red wine results

Page 26: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 26

Random Forest distance workflow

Page 27: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 27

Similarity search dialog

Page 28: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 28

Input table

Page 29: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 29

Nearest Neighbors

Page 30: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 30

Nearest Neighbors

Page 31: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 31

Analytics / MiningFeature Selection

Page 32: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 32

Feature Selection

• “Feature selection is the process of selecting a subset of relevant features (variables, predictors) for use in model construction” [Wikipedia]

• Why?

– Better generalization

– Simplification of the model

– Shorter training times

• Dozens of methods to do that…

Page 33: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 33

Feature Selection

• Backward Feature Elimination:Start with full feature set, iteratively remove ‘worst’ feature

• Forward Feature Selection:Start with empty feature set, iteratively add ‘best’ feature

Page 34: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 34

Feature Selection nodes

• Same loop structure as former Backward Feature Elimination nodes

• Different strategies

– Forward selection

– Backward elimination

• Uses Flow Variable as score

– Flexibility

• Preconfigured meta nodes for both strategies

Page 35: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 35

Analytics / MiningDeep Learning in KNIME: DeepLearning4J Extension

Christian Dietz

Page 36: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 36

What is Deep Learning?

• State-of-the-art algorithms for learning tasks on images, videos, text or sound

• Multi-Layer Neural Networks

• Regression and Classification, Unsupervised Learning, Reinforcement Learning, …

Page 37: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 37

What is DeepLearning4J?

• Open-source Deep Learning framework

• Supports state-of-the-art network architectures

• GPU/CPU support

• Distributed computations on Apache Spark and Hadoop

• Word2Vec for Text Mining

Page 38: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 38

Deep Learning in KNIME

• Integration of DeepLearning4J

• Visually assemble networks using KNIME nodes

• Integrates with other KNIME extensions, e.g. KNIME Image Processing / KNIME Text Mining

• Networks can be trained and executed on GPU and CPU

Page 39: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 39

Basic Deep Learning workflow

Start Network

Create Architecture

Learn on Data

Create Predictions

Page 40: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 40

Celebrity Face Recognition

• Problem Description

– Recognize faces of celebrities in web images.

• Solution

– Image classification using a state-of-the-art deep convolutional network architecture (AlexNet).

Page 41: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 41

MSRA-CFW: Data set of celebrity faces

• 27210 faces of 20 celebrities

Page 42: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 42

Workflow

Page 43: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 43

Workflow

Page 44: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 44

Workflow

Page 45: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 45

Workflow

Page 46: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 46

Workflow

Page 47: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 47

Workflow

Page 48: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 48

Workflow

Page 49: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 49

Active Learning

• Labs Extension

• Involve user to construct training data set

• Workflow loop to query and label ‘interesting’ data points

• Used user-labeled data set on remaining data

Page 50: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 50

Active Learning (example from Node.Pedia)

Page 51: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 51

Statistics nodes

• Several new useful statistic nodes in KNIME Labs.

• Thanks to Bob Muenchen(University of Tennessee).

• Work in progress! We are still adding nodes.

• Missing anything?See R integration

Page 52: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 52

R Integration

• Rewrite of infrastructure

– Significantly faster

– Concurrent execution

• No change of usage model

Page 53: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 53

PMML - Standardizing Predictive Models

Page 54: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 54

What is PMML?

• Predictive Model Markup Language

• XML based standard for predictive models

• KNIME can export most of its models as PMML

• To consume 3rd-party models, a scoring engine such as Zementis Adapa/UPPI is more suitable

Page 55: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 55

PMML Creation in KNIME

• Special port for PMML models

• Supported by most KNIME learners

– Decision Trees, Neural Nets, …

– Ensembles

• Also used for Preprocessing

– Normalizing, Binning, Missing Values, …

• Modular PMML

– Built step by step parallel to the data flow

Page 56: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 56

Demo: Modular PMML

Page 57: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 57

Decision Tree to Ruleset

• Transforms a decision tree to a PMML ruleset model

– Easier to interpret

• Also outputs rules as a KNIME table

– Easier to export & deploy

– Can be manipulated using standard KNIME nodes

Page 58: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 58

Applying Rulesets

• New node: Rule Engine (Dictionary)

– Input: data and ruleset table

– Output: Results and optional PMML model

• Import rules from other sources

• Mix rules from multiple sources

Page 59: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 59

Streaming Executor

Page 60: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 60

Streaming

• Default Execution

Page 61: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 61

Streaming

• Streaming Execution

Page 62: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 62

Streaming

• Row-wise

• Process, pass & forget→ Faster with less I/O overhead

• Concurrent execution

Page 63: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 63

Demo: Streaming Executor

Page 64: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 70

Streaming – Pros & Cons

Advantages

• Less I/O overhead (process, pass & forget)

• Parallelization

Disadvantages

• No intermediate results, no interactive execution

• Not all nodes can be streamed

Page 65: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 71

Streaming – Streamed nodes

• More than 100 nodes

• Text Processing nodes

• Image Processing nodes

• …

Page 66: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 73

Semantic Web / Linked Data

Tobias Kötter

Page 67: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 74

Semantic Web

• Access the wealth of Semantic Web from within KNIME

• Create your own Semantic Web with the Memory Endpoint

• Read/write support for Semantic Web file formats

• Manipulate triple stores via SPARQL

• Usage model similar to database integration

Sponsored development byBoehringer-Ingelheim,

Germany

Page 68: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 75

Wrap-up – KNIME Analytics PlatformUtility Nodes & Integrations

Page 69: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 76

KNIME RESTful Web Service Client Nodes

• REST Client nodes: Get / Post / Put / Delete Resources

• Follow-up of famous KREST community extensions

• Integrating with KNIME’s XML/JSON Processing nodes

• Powerful Configuration

• Extensible (e.g. custom auth types)

Follow-up on KRESTcommunity extension

Page 70: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 77

KNIME Tableau Integration

• Tableau: Popular (commercial) visualization and dashboard application

• New KNIME nodes to:

– Write native Tableau files (TDE)

– Send data to Tableau server

Thanks for testing and feedbackgoes to Forest Grove

TechnologyKNIME‘s partner in Australia

Page 71: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 78

KNIME Server

Jon Fuller

Page 72: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 79

KNIME Software

Page 73: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 80

KNIME Server

Shared Repositories Access Management Web Enablement

Flexible Execution

Page 74: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 81

KNIME Server Improvements

• WebPortal enhancements

• Advanced scheduled execution

• Extended REST API

• For the sysadmins…

– KNIME Server installer (Windows/Linux)

– Administration pages – make admin easy

• Distribute license files for Analytics Platform extensions

E.g. Big Data Extensions, Personal Productivity (incl. Workflow Difference)

Page 75: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 82

Advanced scheduled execution

Page 76: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 83

KNIME WebPortal enhancements (JavaScript Views)

• “Paged Table” view to page-wise scroll through data supportingsorting, search and selection

Sponsored development byGenentech, USA

Page 77: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 84

WebPortal templates

Page 78: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 85

KNIME Server – Extended REST API

• Integrate KNIME Server functionality with IT infrastructure

• Execute workflows, check server status, and more

• See Blog Posts for detailed tutorials:– https://www.knime.org/blog/giving-the-knime-server-a-rest

– https://www.knime.org/blog/the-knime-server-rest-api

Page 79: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 86

Execute workflow via REST API

• Add Quickforms to define workflow API

Page 80: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 87

Workflow Automation

Page 81: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 88

Demo: Workflow Automation

Page 82: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 89

Workflow orchestration via REST API

• Calling a remote workflow

Part of the PersonalProductivity Extensions

Page 83: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 90

For the Sysadmins…

Page 84: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 91

KNIME Server Installer

Step-by-step guided Server installation (Windows and Linux)

Page 85: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 92

KNIME Server – Admin made easy

• KNIME Administrator is often not a KNIME Analytics Platform user

• Make tasks like user administration easier

• Get an overview of the KNIME Server health

Page 86: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 93

KNIME Server – Admin made easy

Go to theadministration

portal

Page 87: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 94

KNIME Server – Admin made easy

Page 88: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 95

KNIME Server – Admin made easy

Page 89: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 96

KNIME Server – Admin made easy

Page 90: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 97

KNIME Server – Admin made easy

Page 91: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 98

KNIME Server – Admin made easy

Page 92: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 99

KNIME Server – License Distribution

• License files no longer required in client installation

• Checked out from KNIME Server

• Centrally managed, less configuration required

Page 93: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 105

KNIME Workflow Difference

Page 94: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 106

Demo: KNIME Workflow Difference

Page 95: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 107

KNIME Workflow Difference - Summary

• Identifies changes in workflow-structure

• Aligns workflows to identify differences

• Part of KNIME Personal Productivity Extensions

• KNIME Server license includes Personal Productivity license

Page 96: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 108

KNIME in the Cloud

Page 97: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 109

KNIME Cloud Analytics Platform

• Get started quickly

• Bring your analytics to your cloud hosted data

• Scale your analytics workflow up to 32 cores and 448 Gb RAM

Page 98: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 110

KNIME Cloud Analytics Platform

https://azure.microsoft.com/en-us/marketplace/partners/knime/cloud-analytics-platform/

Page 99: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 111

KNIME Cloud Analytics Platform - Launch

Page 100: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 112

KNIME Cloud Analytics Platform - Launch

Page 101: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 113

KNIME Cloud Analytics Platform - Launch

Page 102: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 114

KNIME Cloud Analytics Platform - Launch

Page 103: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 115

KNIME Cloud Analytics Platform - Connect

Page 104: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 116

KNIME Cloud Analytics Platform - Connect

Page 105: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 117

KNIME Cloud Analytics Platform - Connect

Page 106: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 118

KNIME Cloud Analytics Platform - Connect

Page 107: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 119

Database Integration and KNIME Big Data Connectors

Tobias Kötter

Page 108: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 120

Database Integration - Recap

• Visually assemble complex SQL statements

• Connect to almost all JDBC-compliant databases

• Preconfigured nodes to connect to various databases

• Harness the power of your database within KNIME

Page 109: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 121

New Database Nodes

• Database Pivot

• Database Nummeric-/Auto-Binner and Apply-Binner

• Database Sampling with support for stratified sampling

• Parameterized Database Query

• Python Script (DB)/(Hive)

Page 110: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 122

KNIME Big Data Connectors - Recap

• Package required drivers/libraries for HDFS, Hive, Impala access

• Performs operations on Hadoop

• Extends the open source database integration

• Preconfigured connectors

Page 111: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 123

KNIME Big Data Connectors

• Support for Kerberos secured cluster

• Improved driver handling

• New nodes:

– httpFS Connector

– webHDFS Connector

Page 112: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 124

KNIME Spark Executor

Page 113: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 125

KNIME Spark Executor - Recap

• Based on Spark MLlib

• Scalable machine learning on Hadoop

• Algorithms for

– Classification (decision tree, naïve bayes, …)

– Regression (logistic regression, linear regression, …)

– Clustering (k-means)

– Collaborative filtering (ALS)

– Dimensionality reduction (SVD, PCA)

Page 114: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 126

• Usage model and dialogs similar to existing nodes

• Spark nodes start and manage Spark jobs

• No coding required

Familiar Usage Model

Page 115: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 127

• Spark RDDs as input/output format

• Data stays within your cluster

• No unnecessary data movements

• Several input/output nodes e.g. Hive, hdfs files, …

In-Hadoop Processing

Page 116: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 128

Combine with Existing KNIME Nodes

Page 117: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 129

Let KNIME Control Your Spark Jobs

Page 118: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 130

47 Spark Nodes and Counting

Page 119: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 131

KNIME Spark Executor

• Kerberos secured cluster support

• Easier installation

• Supports Spark version 1.2, 1.3, 1.5 and 1.6

Page 120: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 132

• More than 170 Mio rows with energy usage data from smart meters

• Uses KNIME Analytics Platform, Big Data Connectors and Spark Executor to forecast energy consumption

Use Case: Smart Meter Analysis

Page 121: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 133

Demo: KNIME Spark Executor

Page 122: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 134

Summary

• Constantly improving, also thanks to feedback of customers/partners/community

• Questions / Interested in demo / comments? Talk to us in the breaks / at the booth

Page 123: The Open Analytics Platform - KNIMEKNIME RESTful Web Service Client Nodes •REST Client nodes: Get / Post / Put / Delete Resources •Follow-up of famous KREST community extensions

© 2016 KNIME.com AG. All Rights Reserved. 135

The KNIME® trademark and logo and OPEN FOR INNOVATION® trademark are used by KNIME.com AG under license from KNIME GmbH, and are registered in the United States.

KNIME® is also registered in Germany.