31
The Evolution of Data Warehouse Automation Barry Devlin, 9sight Consulting April 16, 2015

The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

  • Upload
    lamnhan

  • View
    222

  • Download
    1

Embed Size (px)

Citation preview

Page 1: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

The Evolution of Data Warehouse Automation

Barry Devlin, 9sight Consulting

April 16, 2015

Page 2: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

2

Sponsor

Page 3: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

3

Speakers

Mark Budzinski

President,

WhereScape

Barry Devlin Founder & Principal,

9sight Consulting

Page 4: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

Copyright © 2015 9sight Consulting, All Rights Reserved

Dr Barry Devlin

Founder & Principal

9sight Consulting

The Evolution of Data Warehouse Automation

TDWI Webinar

16 April 2015

Page 5: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

Dr. Barry Devlin

5 Copyright © 2015, 9sight Consulting

Founder and Principal

9sight Consulting, www.9sight.com

Dr. Barry Devlin, founder and principal of 9sight Consulting

(www.9sight.com), is among the foremost authorities on

business intelligence (BI), big data and beyond. He is a

founder of data warehousing, having defined its first

architecture in 1985. A respected visionary and thought-

leader in the evolving data industry, Barry has authored two

ground-breaking books: the classic "Data Warehouse--from

Architecture to Implementation" and “Business

unIntelligence--Insight and Innovation Beyond Analytics

and Big Data” (bit.ly/BunI_Book) in 2013.

With over 30 years of experience in IT, until 2007 with IBM

as a consultant, manager and distinguished engineer,

Barry provides strategic consulting and thought-leadership

to buyers and vendors of BI and Big Data solutions. He is

an associate editor of TDWI's Journal of Business

Intelligence, and a regular keynote speaker, teacher and

writer on all aspects of information creation and use.

Barry operates worldwide from Cape Town, South Africa.

Email: [email protected]

Twitter: @BarryDevlin

Page 6: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

35 years of evolution of BI needs: ever bigger, always faster and increasingly complex

6 Copyright © 2014, 9sight Consulting

1985

1995

2005

2015

1990

2000

2010

Consolidating reporting

across business lines

Access to closer

to real-time data

E-Commerce converges

operational-informational

Web logs offer view of

interactions – not just

transactions

Social media data offers

sentiment and relationships for

marketing – predictive

analytics

Devices on Internet of

Things reveal individual

behaviors and measures -

instantaneous analytics

Data mining and

basic BI querying

“Big Data

Eclipse”

Page 7: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

Gaining value from social media and the Internet of Things depends on “(sm)all data” management.

The need for data warehousing

continues to grow despite “data lakes” – Core business information

– Consistent and integrated

business management

Business needs change faster

than ever – Short iteration projects

– Ongoing dev/maint process

Business and IT must work together – Biz-tech ecosystem

7 Copyright © 2015, 9sight Consulting

Business

Information

Technology

Page 8: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

Functional drivers

Consistency across sources

Cleanliness of base data

“Single version of the truth”

All increasingly needed in an

always-connected, information-

overloaded world

8

The layered Data Warehouse has been a constant since the early ’90s.

Copyright © 2014, 9sight Consulting

Data marts

Enterprise data warehouse

Me

tad

ata

Data

warehouse

Operational systems

“An architecture for a business

and information system”,

B. A. Devlin, P. T. Murphy,

IBM Systems Journal, (1988)

Page 9: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

Working across the layers

Project-specific business

needs for data and function

Enterprise data model

and database design

ETL transformation

and cleansing (x 2)

“Data archaeology”

9

It has also long presented development challenges.

Copyright © 2014, 9sight Consulting

Data marts

Enterprise data warehouse

Me

tad

ata

Data

warehouse

Operational systems

Automation is mandatory

Page 10: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

Big Data support adds to the development challenge.

Pillar architecture supports

multiple data types – Data Warehouse + Operational

Systems = Process-mediated

– IoT = Machine-generated

– Social media = Human-sourced

More development challenges – Diverse Big Data sources

– Access to data at source vs. Load

to Data Warehouse

– Assimilation of context and

relationships across pillars

– Data governance

10 Copyright © 2015, 9sight Consulting

Transactions

Human-

sourced (information)

Machine-

generated (data)

Process-

mediated (data)

Context-setting (information)

Assimilation

Transactional (data)

Events Measures Messages

Instantiation

Reification

Automation again mandatory

Page 11: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

Three development (and maintenance) challenges

Highly iterative development cycle

between business and IT

Disparate and unconnected

development tooling

Multiple, unrelated stores of

metadata

11 Copyright © 2015, 9sight Consulting

Page 12: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

Data Warehouse Automation is an evolution from more manual, traditional approaches.

Reduce number of tools and environments

Create an integrated, agile requirements /

design / development / maintenance

environment

Provide a common, shared store for all context-

setting information (metadata) – business,

technical and more

Support full collaboration between

business and IT

12 Copyright © 2015, 9sight Consulting

Page 13: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

Automation of the full development and maintenance process increases productivity of business and IT.

Traditional ETL approach

separates design scopes – Disparate tools for modeling,

design and ETL

– Additional servers and licenses

ELT (extract, load & transform)

integrates the entire design scope – Single design and development

environment

– Benefits from performance of

relational database

13 Copyright © 2015, 9sight Consulting

Operational systems

EDW

Data marts

Transform

ETL Server

ETL Server

Transform

Transform

Transform

Extract

“Extract” Extract Extract

Load

Load

Load Load

Load

Extract

“Load”

Data marts

ETL ELT

Inte

gra

ted

De

sig

n S

co

pe

Se

pa

rate

d D

esig

n S

co

pe

s

Page 14: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

The demise of metadata… and the rise of context

Metadata is two four-letter words! – Information (not data)

– Describes all “stuff” (not just data)

– Indistinguishable from “business

information” by non-IT people

(and some IT people)

– Many (or most) metadata projects fail

NSA popularizes/repurposes metadata – “It’s metadata, not personal info…

so, we can collect it”… How ironic!

Context-setting information (CSI): – From business meaning to information collected

– From data stored to context understood

14 Copyright © 2015, 9sight Consulting

Lo

cu

s

Structure

Loose Strict

Information

Knowledge

Meaning

Know-why

Know-how

Know-that Know-of

Data

structure

CSI

Content

Page 15: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

A common CSI (metadata) foundation for business and IT enables extensive collaboration.

15 Copyright © 2015, 9sight Consulting

• Requirements

• Data needs and

availability

• Models and

databases

• Queries and

reports

• Etc.

Collaborative

Team

Business Person

Interaction

Scope:

Business &

Technical

Innovate Context-

setting

information

Interact

Shared

Information IT Person

Page 16: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

Balancing consistency with time to value engages business with IT in a joint pursuit.

Integrated development process

between business and IT – Requirements to data design and

query in one session

– Iterations over further joint sessions

– Collaborative working approach

– All context-setting information stored

in one place

All context is carried over to

maintenance process – Maintenance as an extension of dev.

– Business and IT involvement

– Data quality as a focus

16 Copyright © 2015, 9sight Consulting

Page 17: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

Conclusions

17 Copyright © 2015, 9sight Consulting

1. Big Data value and use depends on having

core business information (Data Warehouse)

2. Data Warehouse Automation is key to timely

and standardized development and

maintenance

3. Shared context-setting information

enables ongoing business/IT

collaboration

4. Business/IT collaboration supports

uncertain and changing requirements

Page 18: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

Copyright © 2015 9sight Consulting, All Rights Reserved

Dr Barry Devlin

Founder & Principal

9sight Consulting

Thank you Questions?

18

Page 19: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

19

Using Automation Techniques to Satisfy the Thirst for Data

Mark Budzinski, President [email protected]

Page 20: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

20

• Overcomplicated

• Too many tools

REQUIREMENTS

DW framework

Profile

Logical Model

Physical Model

DB Architecture

Storage Mgmt

Index Mgmt

OLAP Design

ETL Mapping

ETL Dev

Version Control

Workflow

Deployment

Maintenance

Word/Excel

Mainly in-house

solutions

Informatica

Microstrategy

IBM Clear Case

Trillium

AbInitio

DataStage

TOAD

PowerDesigner

Enterprise Architect

Cognos

JIRA

SSMS

SSIS

SSAS

SVN

TFS

IBM Clear Quest

Change tool

Change tool

Change tool

Change tool

Change tool

Change tool

Informatica

ErWIN

DB Management Tools

DataStage

Documentation

Traditional Approach to Development

Page 21: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

21

More with less - Automated Data Integration

Automated Data Integration software:

• that designs, builds and operates data warehouses

• automates repeatable best practice development standards

• with built in Data Governance

• using metadata to drive agile delivery

• with consistent quality and full documentation

• and delivers value, faster, to business & IT stakeholders

• that saves time and money for our customers

Page 22: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

22

Page 23: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

23

Data Warehouse Automation

WhereScape Approach

• Simplification

• Automation

• Data Driven!

DW Framework

Profile

Physical Model

DB Architecture

Storage Mgmt

Index Mgmt

OLAP Design

ETL Mapping

ETL Dev

Version Control

Workflow

Deployment

Maintenance

Logical Model

Documentation

REQUIREMENTS

WhereScape RED

WhereScape 3D

Page 24: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

24

Some of Our Customers

Page 25: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

25

“The six-month project has now been completed, smashing the previous three-year forecast by using agile development to overhaul each data warehouse business area in ‘sprints’.” Andy Ruckley, Head of Technology – Data Platforms, Tesco PLC

Page 26: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

26

“WhereScape is enabling us to get value from sensor data and shorten times to market; we are able to deliver our BI solutions faster than ever before.

Using WhereScape RED, what used to take 1 hour coding by hand now takes 6 minutes using Data Automation.”

Stijn Roelens, Enterprise BI Architect, Volvo Trucks

Page 27: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

27

“WhereScape has accelerated and amplified our output by orders of magnitude. A week of development in our prior environment can now be done in 30 minutes using WhereScape RED.” Dana Keith, manager of applications and data warehousing – WVNET

Page 28: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

28

“Our results using WhereScape have been extremely impressive. WhereScape enabled us to design, develop, document and deploy a production-ready solution in 8 weeks. Using traditional data warehouse development methods would have taken us 6-8 months.”

Ryan Fenner, VP, Data Solutions Architect – Union Bank

Page 29: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

29

Thank You

Automated Data Integration from WhereScape

us.wherescape.com

Page 30: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

30

Questions and Answers

Page 31: The Evolution of Data Warehouse Automation - 1105 Mediadownload.101com.com/pub/tdwi/Files/041615 WhereScape.pdf · The Evolution of Data Warehouse Automation ... TOAD PowerDesigner

Contacting Speakers

• If you have further questions or comments:

Barry Devlin

[email protected]

Mark Budzinski

[email protected]