22
WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect

WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

WEX Overview(Part II)Eric Smith – Watson Explorer Solution Architect

Page 2: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

Topics

• New Insights from Unstructured Data

• Tailoring WEX to your Environment

• Connecting WEX to other Analytic Tools

2

Page 3: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

What is Unstructured Data?

3

News Articles Email Social Media

Page 4: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

4

WEX

Page 5: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

5

Page 6: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

6

Page 7: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

Watson Knowledge Studio• Cloud based, machine learning

solution for developing new domain knowledge for Watson tools

• Information stored in knowledge

graphs

• Runtime environment for fine tuning

and refining annotations

• Leverage in Watson Explorer or

Alchemy Language applications

IBM Confidential

Page 8: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

Watson Knowledge Studio Clip

8

Page 9: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

NHTSA Demo with WKS Annotator

9

Page 10: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

Ontolection Trainer

• Provides a true machine learning

capability for creating an ontology

• ML algorithms are executed against a text corpus of data

• Output is leveraged within a

search collection to enable query expansion.

• Enhances Natural Language

Querying

10

Page 11: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

Tailoring WEX to Your Environment

11

Page 12: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

Analytic Components

Content Miner

Content Analytics

Admin Console

Analytics Infrastructure

Control Monitor

Configuration

Security Scheduler Logging

Websphere(Embedded or Enterprise)

360

Admin

360 Info

App

Foundational Infrastructure

Control Monitor ConfigurationSecurity Scheduler Logging

Crawlers

Content

Preparation(Text Analytics)

1

Indexer

2

Advanced

Analytics

& Search(runtime)

CrawlersConversion

Pipeline

1

Indexer

2

Search(runtime)

3

360 Info App User

Business Analysts &

Data Scientists

Data Sources

Foundational Components

4

4

3

UIMA

Domain Expert

Content

Analytics

Studio

Integrations with

REST

API

Annotator Admin

Console for

Foundational

Components

IBM Master Data

Mgmt

IBM

Counter

Fraud

BigInsights

BI Reporting

IBM Product

Integrations

Websphere(Embedded or Enterprise)

WEX Advanced Edition

Page 13: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

WEX Functional Architecture

On- Premise

Connector

Admin UI

Document Text

ExtractionIndexing

Application

Builder

On Premise

Annotation

Watson CloudWatson

Services

WEX Conversion Pipeline

Watson

Services

13

Page 14: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

14

Development EnvironmentLink to System Requirements: http://www-

01.ibm.com/support/docview.wss?uid=swg27045727

DE

V U

sers

Number of Server(s) CPUs Cores For Memory (GB) Storage

Each Server Each Server Each server

WEX FC Development

-18 64 3 TB

• Up to 3 to 6 TB of data

• No High-availability -failover

• RHEL Linux• On-Premise

• Application Builder

• WAS Liberty Profile

• Result aggregation

• Display rendering

• *Can be a VM

WEX AC

(Content

Analytics)

Developme

nt Server

WEX FC

Developme

nt Server

• NLP Annotation• Content Mining• *Can be a VM

64-bit x86, IBM POWER7, IBM POWER8, or IBM Z System

64-bit (AMD64 or Intel 64) x86 system

Normal flow (Primary)

Data replication (DI)

Fail-over flow

Normal flow (Secondary)

Annotator Flow

Page 15: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

Normal flow (Primary)

Data replication

Load B

ala

ncer

• 15TB of data (10% structured)

• Projected index size:• Structured (1.5TB)• Unstructured (2 TB)

• High-availability - failover

• 8 Queries/second

• Indexing• Query service

Engine

Layer

• Crawling• Connectors

• Indexing failover

Crawl/Index Layer

• Clustering• Federated Search

• Result aggregation• Display rendering• App Builder

Application

Layer

Type of Server CPUs Cores For Memory (GB) Storage

Each Server Each Server Each server

Application 8 32 200 GB

Engine 16 64 2 TB

Crawler 16 32 1TB

Page 16: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

16

WEX EE Production Environment – 3 Tier Architecture

HW

Load B

ala

ncer

Number of Server(s) CPUs Cores For Memory (GB) Storage

Each Server Each Server Each server

Application -6

Engine- 6

16

32

128

128

500 GB

3 TB

Data – 6 32 64 3 TB

Normal flow (Primary)

Data replication (DI)

Fail-over flow

Normal flow (Secondary)

• Up to 14 TB of data

• High-availability -failover

• 7 Queries/second• RHEL Linux

• Indexing• Query

Routing• Search

Results

Engine

Layer

• Crawling and Indexing

• Data Refreshing

Data Layer

• User Interface• Integration

Layer

Application

Layer

64-bit (AMD64 or Intel 64) x86

system

64-bit (AMD64 or Intel 64) x86 system

64-bit (AMD64 or Intel 64) x86 system (Can be

VMs)

Page 17: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

Questions to Ask

• What is the use case?

– Search, Analytics, Both?

• How much data?

• What kind of data?

• Data Growth?

• Usage?

• Interface Options?

17

Page 18: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

Connecting WEX to Other Analytic Tools

18

Streams

Page 19: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

Product Integrations

19

MDM* InfoSphere

BigInsights*StreamsFileNet P8

WebSphere Portal

I2 Analyst Notebook

Cognos

SPSS

Page 20: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

© 2017 International Business Machines Corporation 20

Natural Language Understanding – Augmented Indexes

Extract metadata automatically to improve exploration without building any annotators!

Page 21: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

© 2017 International Business Machines Corporation 21

Watson Discovery Service – Runtime Integration

Bring curated news and blog content that has been augmented by Alchemy in context into a Watson Explorer application

Page 22: WEX Overview (Part II) - IBM...WEX Overview (Part II) Eric Smith – Watson Explorer Solution Architect Topics • New Insights from Unstructured Data • Tailoring WEX to your Environment

Questions?

22