39
Hans Willem Cortenraad, directeur 22 november 2012 Deploying Extreme Performance and Large-Scale BI 1 Emiel van Bockel / Berry van den Heuvel Oracle Open World 2014

Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Hans Willem Cortenraad, directeur

22 november 2012

Deploying Extreme Performance

and Large-Scale BI

1

Emiel van Bockel / Berry van den Heuvel

Oracle Open World 2014

Page 2: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Introduction

2

@bifacts

Emiel van Bockel

@obiee247

Berry van den Heuvel

Page 3: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

CB > Logistic Fulfillment > The Netherlands

3

600+

publishers

1800+

booksellers

#70.000.000 €1.000.000.000 (CAP) 3.000

Page 4: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Diversifaction strategy

4

BOOKS

Page 5: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Exalytics goals

5

Mobile usage

3000 Users 120 M Rows < 2 seconds

High performance

Great visuals

Page 6: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

This is no fairy tail

6

Page 7: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Tough road ….

7

Page 8: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

We made it!

8

Page 9: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Team

9

Information Services

Oracle Development

Customer Engineering

(CEAL team)

Page 10: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

2 years …

10

Order

Delivery

ACS

Multi Instance (CB)

Not supported

PHASE 1 – Bare metal

Install OVM

IB not working

Errors every install

Lots of discussion

Not working

PHASE 2 – OVM

Engineered

3000 users / 2s

Multi Instance

Mobile

Succeeded

PHASE 3 – POC

2 boxes

TimesTen

7 instances

Mobile

DONE!

PHASE 4 – Live

April – July 2012 Aug – Dec 2012 Aug – Nov 2013 April 2014

Page 11: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Proof of concept: Goal

• Engineered systems

– OBIEE / TimesTen / Infiniband integrated

• Multiple Instance

– Internal / External

– Flexible design (loosely coupled)

– Dev / Test / Production environments

• Speed of thought analyses

– 3000 concurrent users

– Dashboard click every 5-15 seconds Dashboard page contains ~ 5 reports

~ 2000 queries a second

– Based on a 120 million records fact table

– 90% queries < 2 seconds

– Mobile usage

11

Page 12: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Internal & External BI

12

3000 customers

EXTERNAL INTERNAL

100 business people

Page 14: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Exalytics Component Stack

14

Exalytics Oracle Linux

Weblogic

Oracle BI

TimesTen

RCU

Datamarts

• Host names

• IP addresses

• IB listener

• Host names • Port numbers

• Admin names

• Directory structure

• Instance names

• Port numbers

• Directory structure

• Sizing

• Schema names

• User names

• Database instances

• Users

• Database instances

• Port numbers

• BI Foundation names

Page 15: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Configuration Design - Template

15

Page 16: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

3 Best Practices to avoid mistakes

16

Knowledge Linux Network Databases Architecture Usage Tracking OBIEE

Architecture Naming Ports Users Database Design

Double Eye’s Principle Avoid typo’s Check every step

Page 17: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Installation ……..

17

• Do it right once

• Predefined configuration sheet

Page 18: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Result …

18

EXTERNAL INTERNAL

Hello World

Page 19: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

And another few hours ..…

19

EXTERNAL PRD INTERNAL PRD

PRODUCTION EXTERNAL TST INTERNAL TST

DEV / TEST

INTERNAL DEV EXTERNAL DEV

1 ready, 1 to go

Page 20: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

This is only the beginning ….

20

Page 21: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Extreme engineering

21

Page 22: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

22

Page 23: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Extreme Engineering – Golden Gate Bridge

23

Page 24: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Extreme Engineering - Exalytics

• Design before implement

• Dimensional data model

• Dashboard functionality

• Queries

• Joins

• Indexes

• Ask yourself

– How would I write the query?

– What aggregate would I use?

– What indexes are needed?

24

Page 25: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Data Driven Design Principle

25

TABLE AGG

INDEX

Page 26: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Tool: Aggregation Wizard

• Easy to create scripts

• Execute scripts using nqcmd

• When errors occur hard & time consuming

to solve

• Roll back scenario fragile

• Can’t handle Row Level Security

• Creates a complex data structure

• Be aware of open connections (rpd & tt)

26

nqcmd

Page 27: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Model yourself • Better understanding of your design

– Keep meaningful attributes

• Reuse of all dimensions

• You are in total control

• Loosely coupled design

– TimesTen & Exadata

• Only minutes of additional work

– Aggregate creation time is equal in both scenario’s

• Can save hours troubleshooting if an error occurs

• We saved additional costs on memory

– 1 TB upgrade costs 100K €

– By lean aggregate and index design we went from 400

Gb usage to 100 Gb usage

– You need 2 x 1TB (DEV/TEST + PROD)

– saved 2 x 30.000 = 60.000€ !

– Lack of intelligence in advisory tools

(summary advisor, index advisor)

27

Page 28: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Loading data / creating aggregates into TimesTen

• Options to load data into TT store:

– ttIsql .. (load within timesten, agg creation)

– ttBulkCp .. (load a file)

– ttLoadFromOracle ... (load from exadata, through Infiniband)

• Use “insert into table” instead of “create table as select …”

– Will lock a row in system tables

• Don’t use CTRL C

• No open sessions

– Risk of locks

• Count records before create

– Prevents expensive memory consumption

– Aggregation creation time awareness

– Hint: Use your Exadata machine to calculate

• Be aware of the data types (Oracle DB vs TimesTen)

• Do it right once (changes afterwards are difficult to make)

– (e.g.) rename table or column not possible (TimesTen)

28

Page 29: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Manual Aggregation Steps

29

Create Table (TT)

Create Index (TT)

Import or Copy (RPD)

Physical Joins (RPD)

Logical Sources (RPD)

Column Mapping (RPD)

Logical Level (RPD)

Row Level Security (RPD)

Page 30: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Test Multiple Architectures

30

Exadata Only

Exadata + MV’s

TimesTen Only

TimesTen + AGG’s

Indexes

Database

Rewrite

RPD

Rewrite

Page 31: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Use case

• 16 steps

• Thinktime random 5-15 seconds

31

login

filter page

filter slice filter slice

slice filter slice filter filter

logout

page

page

slice

Page 32: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Performance metrics multiple architectures

0

50

100

150

200

250

Exadata Only

TimesTen Only

Exadata + MV's

TimesTen + AGG's

AVG user story (seconds)

32

Page 33: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Metrics TimesTen with Aggregates

33

3000 Users 120 M Rows < 2 seconds

Page 34: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Demo

34

Page 35: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Publishers reaction

35

Looks great No Excel

needed

Performs

well Nice

overview

Page 36: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Lessons learned

• Support from Oracle (stovepipe organization)

– Early adopter: experience & knowledge

– No engineered organization

– Default answers to unknown problems

• Oracle CEAL Team

– Open / focus

– Good communication

– Right engineer, right manager -> SPOC

• Tons of documentation

• TimesTen is undiscovered

– We love the command: ttdestroy

• Flash disk

• Most important: data architecture & design

36

Page 37: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Key message

• Understand your data!

– Save money and time

• Only start engineering if the systems works as engineered

– OBIEE

– Infiniband

– TimesTen

• If and only if you understand:

– Technology (Dashboard, RPD, Query & TT)

– Dimensional Modelling

• Than you can engineer

the best BI system of the world!

37

Page 38: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie

Make customers happy!

38

Page 39: Deploying Extreme Performance and Large-Scale BI · Ondertitel van de presentatie Key message •Understand your data! –Save money and time •Only start engineering if the systems

Integrale logistieke oplossingen

Ondertitel van de presentatie