30
New Analytical Architectures March 21, 2013 Casey Kiernan • [email protected] Blog • www.the-data-platform.com Why Classic Data Warehousing Approaches Miss the Mark with Big Data

New Analytical Architectures for Big Data

Embed Size (px)

DESCRIPTION

Why Classic Data Warehousing Architectures miss the mark with the New Analytics

Citation preview

Page 1: New Analytical Architectures for Big Data

New Analytical Architectures

March 21, 2013

Casey Kiernan • [email protected]

Blog • www.the-data-platform.com

Why Classic Data Warehousing Approaches Miss the Mark with Big Data

Page 2: New Analytical Architectures for Big Data

Doug Cutting“Hadoop is the kernel of a new Distributed Data OS”

“The Future is Data”

Page 3: New Analytical Architectures for Big Data

Transactional

Communities

Personal

Data has Changed

> Trailing Indicators

> Reach/Influence

> Interactive

> Analytics has Changed

Can the Data Warehouse Architecture adapt?

Page 4: New Analytical Architectures for Big Data

The World as I See it

“Data” is the Platform

Page 5: New Analytical Architectures for Big Data

New DataClutch Analytics

Wink Eller

Page 6: New Analytical Architectures for Big Data

My Mountain Bike

Guidance

PerformanceRate of ClimbCalories BurnedMiles ObtainedTotal ClimbedElapsed Time

Current, Average, Max Values

Data CollectionSpeed / Trip Miles

Data CollectionCadence / RPM

Data Collection Heart Rate

Data Collection AltitudeTemperatureTime

Data Architecture - on a Local Wireless Network (ANT+ Protocol)

as a Data Platform

Page 7: New Analytical Architectures for Big Data

“Personal” Ride Analytics

…is this a Data Warehouse?

Page 8: New Analytical Architectures for Big Data

Beha

vior

s

Content

Progression of B

ehaviors

New Data Behaviors (individual actions) > Content > Time

Time Varia

nce

Page 9: New Analytical Architectures for Big Data

9

Gui

danc

e

Data

Meaningful

Massive

New Data More is Better…

Page 10: New Analytical Architectures for Big Data

BUSINESS INTELLIGENCEOLAP / DATA WAREHOUSE

OLTP / TRANSACTIONSDATA.

“Business” Analytics - Classic “DW”

Answers the question: What are our most profitable Products?

Page 11: New Analytical Architectures for Big Data

11

What will Happen?What did Happen?

StrategicTactical TrendingOperational Reporting

Months Weeks Weeks Months Years

Classic “Business” AnalyticsGood for Reporting, Forecasting

Descriptive/Trending Analytics

Page 12: New Analytical Architectures for Big Data

New“Personal” Analytics

Answers the question: Show me a good movie to watch!

DATA.

SELF-SERVICEGUIDANCE

BEHAVIOURS

Page 13: New Analytical Architectures for Big Data

StrategicTactical TrendingOperational Reporting

13

What will Happen?What did Happen?

Months Weeks Weeks Months Years

What is Happening RIGHT NOW!

“Personal” Analytics“Right Now” is a very important time-frame!

Predictive/Prescriptive Analytics

Page 14: New Analytical Architectures for Big Data

14

Page 15: New Analytical Architectures for Big Data

15

Page 16: New Analytical Architectures for Big Data

16

Ordering App

Data WarehouseOLTP to OLAP

Mapping

OLAP / ReportsFacts/DimensionsFinancial App

Master Data

BusinessAnalyst

What are our most Profitable Products?

Stag

ing

“Business” Analytical ArchitectureClassic “DW” Data Flow - Uni-Directional, Latent,…

Business Metrics, KPI, YTD ReportingFacts &

Dimensions

Page 17: New Analytical Architectures for Big Data

17

Application / UX

AnalyticsData

“Personal” Analytical Architecture

DataAnalysts

Analytical CapabilitiesScoring/Ranking, Recommendations,Natural Language Processing, Relevancy, Classification, Optimization, Collaborative Filtering,Personalization,Digital Attribution,…

“New” Data Flow - Iterative, Specialized, Extensible, plug & play Analytics, near real-time [Some components are open-source]

What movie should I watch tonight?

Page 18: New Analytical Architectures for Big Data

18

Published Analytics “Read” Performance

App Persistence“State” PersistencePersistence/Analytics

Mass Data StorageBehaviors / “Write” Performance

PersonalizedRecommendations

Personalization,Preferences, State

End-User ExperienceBrowser, Tablet,

Mobile,…Self-Service Application

“Personal Analytics” Data Architecture

Analytics EnginesPluggable

Data Scientists

“New” Data Flow – Detailed View of Components

Social SignalsRSS/Facebook/…

Page 19: New Analytical Architectures for Big Data

SALLY LIKES TACOS

HOW DO WE MODEL THIS DATA?

Let’s get personal…

Page 20: New Analytical Architectures for Big Data

Classic “DW” Data Model

Page 21: New Analytical Architectures for Big Data

OBJECT PREDICATE (Score) SUBJECT

SALLY LIKES (143) TACOS

MARY LIKES (200) TACOS

THE_TACO_SHOP MENU_ITEM TACOS

SALLY LIKES (125) THE_TACO_SHOP

SALLY CITY VENICE BEACH

THE_TACO_SHOP CITY VENICE BEACH

SALLY FRIEND (187) MARY

“Triples” - Directed (Weighted) Acyclic GraphModeling Social Data

Reach and Influence

Collaborative Filtering

Page 22: New Analytical Architectures for Big Data

Analyzing Relationships Reach and Influence

Page 23: New Analytical Architectures for Big Data

How important is Social?

Install ghostery.comShows you who is actively watching you surf the web! Lots of people!!!

Page 24: New Analytical Architectures for Big Data

Signals – The Core of New Data

SocialPersonalContent

Time

Mixture of Proprietary and Public Data

Page 25: New Analytical Architectures for Big Data

26

Published AnalyticsHbase

App PersistenceCassandra, Riak,…Persistence/Analytics

Data-Center or Cloud

Mass Data StorageHadoop

PersonalizedRecommendations

Personalization,Preferences, State

End-User ExperienceBrowser, Tablet,

Mobile,…Self-Service Application

Specialization of Data Technologies

AnalyticsR, Mahout, Pig

The New “Analytical Application” Architecture“New” Data Flow – Specialized Technology Choices

Page 26: New Analytical Architectures for Big Data

p. 27

Published Analytics

HBase

PersistenceRiak

Mass Data StorageBehaviors / “Write” Performance

Hadoop / AWS

Self-Service Application A

Analytics EnginePluggable

Data Scientists

Analytics EnginePluggableAnalytics Engine

Pluggable

Published Analytics

MySQL

PersistenceCassandra

Self-Service Application B

Servicing Multiple Analytical SystemsUsing Shared Analytical Mas- Storage

Page 27: New Analytical Architectures for Big Data

Integrating the Architectures

28

Data WarehouseOLTP to OLAP Mapping

OLAP / Reports

BusinessAnalystSt

agin

g

AppOnly Financial Events ($$$) cross the threshold(and are recorded into) the Data Warehouse

App

App

“Local” Events stay Local (they are analyzed locally)

“Personal” Analytics Stack + Classic “DW” Stack

Not all DATA Belongs in the Data Warehouse!

Page 28: New Analytical Architectures for Big Data

Classic DW New Analytics

Scope Enterprise Application

Analytics Trailing: OLAP Predictive: Machine LearningSentiment Analysis, Recommendations, Personalization, Natural Language Processing, Classification, Clustering, Optimization, Collaborative Filtering,Digital Attribution,…

Actionable? Loosely Coupled Tightly Coupled Analytics Embedded in Application

Data Structures Facts/Dimensions(Requires a DW)

Semantic Data, Graph / Triples, Observations, Direct Signals

Knowledge Expert Business Analyst Data Scientist

Technology Stack Vendor Driven ($$$) Open-Source

Architecture Scale-Up Scale-Out (or in the Cloud)

Classic DW Vs. the New AnalyticsThe Shift from “Business” Analytics to “Personal” Analytics

Page 29: New Analytical Architectures for Big Data

New Signals + New Analytics = New Scenarios

Data

Signals

Social

Location

Personal

Behaviors

Transactions

Content

Time

New Analytics

Recommendations,Natural Language

Processing, Relevancy,

Classification, Optimization, Collaborative

Filtering,Digital

Attribution,…

NewScenariosCustomer

Engagement, Customer Loyalty / Attrition / Retention, Fraud, Risk Analysis,

Intent, Customer Personalization

Page 30: New Analytical Architectures for Big Data

Thank You!

[email protected]: www.the-data-platform.com