92
Ghislain Fourny Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo

Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Ghislain Fourny

Big Data13. Data Warehousing

fotoreactor / 123RF Stock Photo

Page 2: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

The road to analyticsAurelio Scetta / 123RF Stock Photo

2

Page 3: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Another history of data management(T. Hofmann)

Age of Transactions

Age of Business Intelligence

Age of Big Data

1970s – 2000s

1995 -

2000s -3

Page 4: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Paradigms

OLTP OLAP

vs.

4

Page 5: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

OnLine Transaction Processing

ConsistentandReliableRecord-Keeping

5

Page 6: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

OnLine Transaction Processing

Transactionsand resultson

small portionsof data 6

Page 7: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

OnLine Transaction Processing

Lotsof transactionson

small portionsof data

7

Page 8: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

OnLine Transaction Processing

NormalizedData

8

Page 9: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

OnLine Analytical Processing

Data-basedDecisionSupport arturaliev / 123RF Stock Photo

9

Page 10: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

OLAP is Big

Possiblymany joins

Large portionsof the data

Few longheavy queries

10

Page 11: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Web analytics

Sales analytics

Management support

Statistical analysis (census)

Scientific databases (e.g., bio-informatics)

OLAP Examples

11

Page 12: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

OLTP vs. OLAP

DetailedIndividualRecords

HistoricalSummarizedConsolidated

DataAurelio Scetta / 123RF Stock Photo

OLTP OLAP

vs.

12

Page 13: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

OLTP vs. OLAP

OLTP

Lots of writes

OLAP

Lots of reads

vs.

13

Page 14: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

OLTP vs. OLAP

OLTP

Small setsof records

OLAP

vs.Analysis

over big chunks

14

Page 15: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

OLTP vs. OLAP

OLTP OLAP

Slow interactive

vs.

fully interactive(< 1s)

15

Page 16: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

OLTP vs. OLAP

OLTP OLAP

Consistency

RedundancyRedundancyRedundancy

16

Page 17: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

OLAPAurelio Scetta / 123RF Stock Photo

17

Page 18: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

A data warehouse... is a

subject-orientedintegrated

time-variantnonvolatilecollection of data

in support of management'sdecision-making process

18

Page 19: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Subject-oriented

customers

products

sales

events19

Page 20: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Integrated

20

Page 21: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Time-variant

Timein data warehouses is

paramount(not so in OLTP systems)

21

Page 22: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Time-variant

2016Y-1Y-2Y-3Y-4Y-5Y-6Y-7Y-8Y-9

Often past 5-10 years22

Page 23: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Non-volatile

Milosh Kojadinovich / 123RF Stock Photo

Load.Access.Period.

no updates

23

Page 24: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Architecture

ERP

CRM

OLTP

ETL

Files

Analyze

Report

Mine

24

Page 25: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

OLAP: Redundancy

Materializedviews

(denormalized)25

Page 26: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

1st Normal Form (tabular) – The Key

26

Page 27: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

2nd Normal Form (not joined) – The Whole Key

27

Page 28: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

3rd Normal Form – Nothing But The Key

28

Page 29: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Why materialize?

Operational data sourcesare too heterogeneous 29

Page 30: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

OLAP: Special-purpose indices

30

Page 31: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

OLAP: Derived data

31

Page 32: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Querying OLAP

vs.Continuous

monitoring/tracking0

1

2

3

4

5

6

Category 1 Category 2 Category 3 Category 4

Slow interactive

Series 1 Series 2 Series 3

1 - 10s hours

32

Page 33: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Summary of differencesOLTP OLAP

Source Original (operational) Derived (consolidated)Purpose Business tasks Decision supportInterface Snapshot Multidimensional viewsWriting short and fast, by end user period refreshes, by batch jobsQueries Simple, small results Complex and aggregatingDesign Many normalized tables Few denormalized cubesPrecision ACID Sampling, confidence intervalsFreshness Serializability ReproducibilitySpeed Very fast Often slowOptimization Inter-query Intra-querySpace Small, archiving old data Large, less space efficientBackup Very important Re-ETL

33

Page 34: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Data Model

34

Page 35: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Data Cubes

Data is stored in

multidimensionalhypercubes

35

Page 36: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Data Cubes

Year

36

Page 37: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Data Cubes

Country

37

Page 38: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Data Cubes

Product38

Page 39: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Fact

2016CH

Server

39

Page 40: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Where

What?

Who? etc.

Dimensions Which currency?

When?

40

Page 41: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Fact tableWhere?

Germany 2016 Peter 1,000$

Germany 2015 Mary 15,000$

Switzerland 2016 Mary 1,500$

Switzerland 2015 Peter 3,000$

Australia 2015 Peter 6,000$

China 2015 Mary 1,000$

41

Page 42: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

AggregationWhere?

Germany 2016 Peter 1,000$

Germany 2015 Mary 15,000$

Switzerland 2016 Mary 1,500$

Switzerland 2015 Peter 3,000$

Australia 2015 Peter 6,000$

China 2015 Mary 1,000$

42

Page 43: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Aggregation

43

Page 44: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

AggregationWhere?

Germany 2016 Peter 1,000$

Germany 2015 Mary 15,000$

Switzerland 2016 Mary 1,500$

Switzerland 2015 Peter 3,000$

Australia 2015 Peter 6,000$

China 2015 Mary 1,000$

44

Page 45: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Aggregation

2016 Peter 1,000$

2015 Mary 16,000$

2016 Mary 1,500$

2015 Peter 9,000$

45

Page 46: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Slicing

46

Page 47: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Slicers and Dicers

Slicers Dicers47

Page 48: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Slicers and Dicers

Slicers Dicers

Usually between1 and 3 dicers,

often 2

48

Page 49: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Slicers and DicersServers

World

USD

Slicers

49

Page 50: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Slicers and Dicers

2014 2015 2016

Peter 1,000,000$ 1,500,000$ 1,400,000$

Mary 2,000,000$ 2,300,000$ 2,200,000$

Servers

World

USD

Dicers

Slicers

50

Page 51: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Products: the big three

Essbase

Cognos

Analysis Services

51

Page 52: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

ETLing

52

Page 53: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

OLAP: Derived data

53

Page 54: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

OLAP: Derived data

ETL

54

Page 55: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

ETL

ExtractTransformLoad

55

Page 56: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Extract

Triggers Gateways

Incremental updates Log extraction56

Page 57: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Transform

Derivation Value transformation

Herr

Mister

CleaningFilter, split, merge, join

57

Page 58: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Load

Integrity constraints Sorting

Build indices Partition

58

Page 59: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Considerations

When?Granularity

Infrastructure59

Page 60: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Implementation

60

Page 61: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Two flavors of OLAP

ROLAP MOLAP61

Page 62: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Fact table (ROLAP)

Dim1 Dim2 Dim3 Dim4 Dim5 Value

62

Page 63: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Star Schema

Dim1 Dim2 Dim3 Dim4 Dim5 Value

63

Page 64: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Snow-flake schema

Dim1 Dim2 Dim3 Dim4 Dim5 Value

NormalizeMore

64

Page 65: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Querying

65

Page 66: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Querying cubes

Tables:SQL

Cubes:MDX

66

Page 67: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

MDX stands for...

Multi-DimensionaleXpressions

67

Page 68: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Measures

Amount of licenses

Revenues

Taxes paid

...

68

Page 69: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Dimensions

Quarter

Salesperson

Product

Country

69

Page 70: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

In short...

A cube is a list of

dimensionsindexing a list of

measures

70

Page 71: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Hierarchies

Dimension values are organized in hierarchies.

[Location]

[Geo] [Economy]i.e., slice and aggregateby geographic region, etc

i.e., slice and aggregateby economic partnership, etc

71

Page 72: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Members

Members correspond to levels in a hierarchy.

[Geo][Europe] [Asia] [America]

[Switzerland][ZH][BE]

[Germany]...

[China][India]...

[Canada][USA][Brazil]...

[Africa][Ocenia]

72

Page 73: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Identifying a member

[Location].[Geo].[Europe].[Switzerland].[ZH].[Zurich]

73

Page 74: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Tuples

([Location].[Geo].[Europe].[Switzerland].[ZH].[Zurich],[Salesmen].[People].[John],[Time].[Year].[2016].[Q4])

A list of members

Associated with a dimensionality(list of hierarchies)

([Location].[Geo],[Salesmen].[People][Time].[Year])

74

Page 75: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Sets

{([Location].[Geo].[Europe].[Switzerland].[ZH].[Zurich],[Salesmen].[People].[John],[Time].[Year].[2016].[Q4]),

([Location].[Geo].[Europe].[Switzerland].[BE].[Bärn],[Salesmen].[People].[Mary],[Time].[Year].[2016].[Q4]),

([Location].[Geo].[Europe].[Germany].[Berlin],[Salesmen].[People].[John],[Time].[Year].[2016].[Q3])}

A set of tuples with same dimensionality

75

Page 76: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

MDX statements: dicing

SELECT[Measures].Members ON COLUMNS,[Location].[Geo].Members ON ROWS

FROM [Sales]

76

Page 77: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

MDX statements: slicing

SELECT[Measures].Members ON COLUMNS,[Location].[Geo].Members ON ROWS

FROM [Sales]

WHERE [Products].[Line].[Laptops].[MBP]

77

Page 78: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Syntax

78

Page 79: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

XBRL Architecture

Instance (.xml)

Schema (.xsd)

Linkbase (.xml)

Discoverable Taxonomy Set79

Page 80: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Technologies

XML

XML SchemaXML Link

XML Names

80

Page 81: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Fact

<us-gaap:AssetscontextRef="FI2012Q4"decimals="-6"id="Fact-600212FD4D06E63B4F8F6874C6E5BE74"unitRef="usd">86174000000

</us-gaap:Assets>

Dimension ValueWhat? AssetsWho? Coca ColaWhen? Dec 31, 2011Of what? USD

81

Page 82: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Context

<xbrli:context id="FI2011Q4"><xbrli:entity><xbrli:identifier scheme="http://www.sec.gov/CIK">0000021344

</xbrli:identifier></xbrli:entity><xbrli:period><xbrli:instant>2011-12-31</xbrli:instant>

</xbrli:period></xbrli:context>

December 2011

1 2 3 4

5 6 7 8 9 10 11

12 13 14 15 16 17 18

19 20 21 22 23 24 25

26 27 28 29 30 31

82

Page 83: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Unit

<xbrli:unit id="usd"><xbrli:measure>iso4217:USD</xbrli:measure>

</xbrli:unit>

83

Page 84: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Concept (XML Schema)

<xs:elementid='us-gaap_Assets'name='Assets'nillable='true'substitutionGroup='xbrli:item'type='xbrli:monetaryItemType'xbrli:balance='debit'xbrli:periodType='instant' />

84

Page 85: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Graphs

85

Page 86: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

DAGs

86

Page 87: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Trees

87

Page 88: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Node: locator

<locxlink:href="http://xbrl.fasb.org/us-gaap/2013/elts/us-

gaap-2013-01-31.xsd#us-gaap_Assets"xlink:label="loc_us-

gaap_Assets_102D7A4D204ED45AC0DEDA6BBC78F386"xlink:type="locator" />

88

Page 89: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Node: resource

<link:labelid="lab_ko_NetChangeInOperatingAssetsAndLiabilitiesDisclosureAbstract_A6469A522E35CBF355816876394722EE_label_en-US"xlink:label="lab_ko_NetChangeInOperatingAssetsAndLiabilitiesDisclosureAbstract_A6469A522E35CBF355816876394722EE"xlink:role="http://www.xbrl.org/2003/role/label"xlink:type="resource"xml:lang="en-US">NET CHANGE IN OPERATING ASSETS AND LIABILITIES

DISCLOSURE [Abstract]</link:label>

89

Page 90: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Edge

<presentationArcorder="10"preferredLabel="http://www.xbrl.org/2003/role/totalLabel"xlink:arcrole="http://www.xbrl.org/2003/arcrole/parent-child"xlink:from="loc_us-

gaap_AssetsAbstract_2F55ECB2BF7C1A62009CDA6BBC757094"xlink:to="loc_us-

gaap_Assets_102D7A4D204ED45AC0DEDA6BBC78F386"xlink:type="arc" />

90

Page 91: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Summary

91

Page 92: Ghislain Fourny Big Data - Systems Group · 2016-12-12 · Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo. The road to analytics Aurelio Scetta / 123RF Stock Photo

Architecture

ERP

CRM

OLTP

ETL

Files

Analyze

Report

Mine

92