smila in cubrik
DESCRIPTION
SMILA Unified Information Access Architecture extended in CUbRIK, illustrated by Ralph Traphoener (Empolis Information Management GmbH)TRANSCRIPT
CUbRIK Summer School 2014
CUbRIK Summer School 0
Introducing SMILA
Unified Information Access Architecture
Ralph Traphoener
Empolis Information Management GmbH
CUbRIK Summer School 2014Content.
CUbRIK Summer School 2014
Creating structure.
CUbRIK Summer School 2014
Bridging the gap.
CUbRIK Summer School 2014
Systematic.
CUbRIK Summer School 2014Dynamic.
CUbRIK Summer School 2014
Need for speed.
CUbRIK Summer School 2014
SMILA
Solr
OntologyService
SimpleFile
Objectstore
SimpleClusterConfig
…
SMILA is an
extensible framework
for building Big Data and/or search solutions
to access and processunstructured information
SMILA is …
CUbRIK Summer School 2014
CUbRIK Summer School 2014
Information Factory.
CUbRIK Summer School 2014
„Mapping fromunstructured datato structured datasets will be a key
Web Squaredcompetency.“
Tim O‘Reilly and John Battelle
CUbRIK Summer School 2014
Lorem.
CUbRIK Summer School 2014
Guinea Pig
Empolis senior developer
Java/JavaScript background
Used SMILA once before
… but different use case
Is not a SMILA comitter
CUbRIK Summer School 2014
CUbRIK Summer School 2014
Crawl the
seeds
Extractcontent
Extractproject
Extractcategory
NERCUbRIKCrowd
Index all facets
CUbRIK Summer School 2014
CUbRIK Summer School 2014
BPEL Designer
1/10/2011 CUbRIK Presentation 16
CUbRIK Summer School 2014
Synchronous and Asynchronous
Bla
ck
bo
ard
Indexation
REST API
ZooKeeper
REST API
Search
Workflow
Worker A
Worker B
Worker C
Worker D
Job Management
Pipeline
Pipelet X
Pipelet Y
Pipelet Z
BPEL
OSGI
ObjectStore
CUbRIK Summer School 2014
OSGi
Java (Runtime Environment)
OSGi
Bundles Services
SMILA Job
Manager
Task
Manager
WebCrawler
Worker
n
JobHandler
ZooKeeper
Service
...
org.eclipse.smila.jobmanager
org.eclipse.smila.taskmanager
...
CUbRIK Summer School 2014Interfaces
• Your own Software
• proprietary
• asset
• cost
• Open APIs
• No Lock-In
• Protection ofInvestments
• Protection ofIntellectual Property
CUbRIK Summer School 2014
SMILA
Solr
OntologyService
SimpleFile
Objectstore
SimpleClusterConfig
…
SMILA is an
extensible framework
for building Big Data and/or search solutions
to access and processunstructured information
SMILA is …
CUbRIK Summer School 2014
IAS
Smartfinder
Text MiningEngine
Distributed
Objectstore
Node/Cluster Control
…
Information Access System (IAS)
The Empolis IAS is the
semantic platform for value added knowledge
management solutions
CUbRIK Summer School 2014
Add more meaning.
CUbRIK Summer School 2014
Choose the patternyou like.