24
CUbRIK Summer School 2014 CUbRIK Summer School 0 Introducing SMILA Unified Information Access Architecture Ralph Traphoener Empolis Information Management GmbH

SMILA in CUbRIK

Embed Size (px)

DESCRIPTION

SMILA Unified Information Access Architecture extended in CUbRIK, illustrated by Ralph Traphoener (Empolis Information Management GmbH)

Citation preview

Page 1: SMILA in CUbRIK

CUbRIK Summer School 2014

CUbRIK Summer School 0

Introducing SMILA

Unified Information Access Architecture

Ralph Traphoener

Empolis Information Management GmbH

Page 2: SMILA in CUbRIK

CUbRIK Summer School 2014Content.

Page 3: SMILA in CUbRIK

CUbRIK Summer School 2014

Creating structure.

Page 4: SMILA in CUbRIK

CUbRIK Summer School 2014

Bridging the gap.

Page 5: SMILA in CUbRIK

CUbRIK Summer School 2014

Systematic.

Page 6: SMILA in CUbRIK

CUbRIK Summer School 2014Dynamic.

Page 7: SMILA in CUbRIK

CUbRIK Summer School 2014

Need for speed.

Page 8: SMILA in CUbRIK

CUbRIK Summer School 2014

SMILA

Solr

OntologyService

SimpleFile

Objectstore

SimpleClusterConfig

SMILA is an

extensible framework

for building Big Data and/or search solutions

to access and processunstructured information

SMILA is …

Page 9: SMILA in CUbRIK

CUbRIK Summer School 2014

Page 10: SMILA in CUbRIK

CUbRIK Summer School 2014

Information Factory.

Page 11: SMILA in CUbRIK

CUbRIK Summer School 2014

„Mapping fromunstructured datato structured datasets will be a key

Web Squaredcompetency.“

Tim O‘Reilly and John Battelle

Page 12: SMILA in CUbRIK

CUbRIK Summer School 2014

Lorem.

Page 13: SMILA in CUbRIK

CUbRIK Summer School 2014

Guinea Pig

Empolis senior developer

Java/JavaScript background

Used SMILA once before

… but different use case

Is not a SMILA comitter

Page 14: SMILA in CUbRIK

CUbRIK Summer School 2014

Page 15: SMILA in CUbRIK

CUbRIK Summer School 2014

Crawl the

seeds

Extractcontent

Extractproject

Extractcategory

NERCUbRIKCrowd

Index all facets

Page 16: SMILA in CUbRIK

CUbRIK Summer School 2014

Page 17: SMILA in CUbRIK

CUbRIK Summer School 2014

BPEL Designer

1/10/2011 CUbRIK Presentation 16

Page 18: SMILA in CUbRIK

CUbRIK Summer School 2014

Synchronous and Asynchronous

Bla

ck

bo

ard

Indexation

REST API

ZooKeeper

REST API

Search

Workflow

Worker A

Worker B

Worker C

Worker D

Job Management

Pipeline

Pipelet X

Pipelet Y

Pipelet Z

BPEL

OSGI

ObjectStore

Page 19: SMILA in CUbRIK

CUbRIK Summer School 2014

OSGi

Java (Runtime Environment)

OSGi

Bundles Services

SMILA Job

Manager

Task

Manager

WebCrawler

Worker

n

JobHandler

ZooKeeper

Service

...

org.eclipse.smila.jobmanager

org.eclipse.smila.taskmanager

...

Page 20: SMILA in CUbRIK

CUbRIK Summer School 2014Interfaces

• Your own Software

• proprietary

• asset

• cost

• Open APIs

• No Lock-In

• Protection ofInvestments

• Protection ofIntellectual Property

Page 21: SMILA in CUbRIK

CUbRIK Summer School 2014

SMILA

Solr

OntologyService

SimpleFile

Objectstore

SimpleClusterConfig

SMILA is an

extensible framework

for building Big Data and/or search solutions

to access and processunstructured information

SMILA is …

Page 22: SMILA in CUbRIK

CUbRIK Summer School 2014

IAS

Smartfinder

Text MiningEngine

Distributed

Objectstore

Node/Cluster Control

Information Access System (IAS)

The Empolis IAS is the

semantic platform for value added knowledge

management solutions

Page 23: SMILA in CUbRIK

CUbRIK Summer School 2014

Add more meaning.

Page 24: SMILA in CUbRIK

CUbRIK Summer School 2014

Choose the patternyou like.