21
Analytics – Infrastructure, Platforms and Methods. Feyzi Bagirov 26 Jan 2015 1

Analytics infrastructure, platforms and methods

Embed Size (px)

Citation preview

Page 1: Analytics   infrastructure, platforms and methods

1

Analytics – Infrastructure, Platforms and Methods.

Feyzi Bagirov26 Jan 2015

Page 3: Analytics   infrastructure, platforms and methods

3

Data Mining ◦ Retail Use cases◦ Data Mining Process

Data Mining Methodologies

Data◦ Data Training◦ Types of Business Information Systems◦ Data Warehouses◦ Data Mining Tools◦ Data Visualization Tools◦ Big Data

Data Mining

A

B

CD

E

Page 4: Analytics   infrastructure, platforms and methods

4

Machine Learning is a scientific discipline that explores the construction and study of algorithms that can learn from data (Ron Kovahi; Foster Provost (1998). “Glossary of terms”.

Data Mining is the process of achieving Machine Learning.

What is Data Mining?

A

B

CD

E

Page 5: Analytics   infrastructure, platforms and methods

5

Response modeling for direct marketing Uplift modeling for direct marketing Customer retention with churn modeling Churn uplift modeling

Retail Use Cases

A

B

CD

E

Page 6: Analytics   infrastructure, platforms and methods

6

Use Case 1 – Response Modeling For Direct Marketing

Lifeline Screening: Response up 38%, cost down 20%, 62K more customers annually

PREMIER Bankcard: Direct mail response up 3-5%

Sun Microsystems: Doubled the number of leads per phone call

A

B

CD

E

Based on the past experience, who will respond tomorrow?

Page 7: Analytics   infrastructure, platforms and methods

7

Use Case 2- Uplift Modeling for Direct Marketing

A

B

CD

E

Page 8: Analytics   infrastructure, platforms and methods

8

Use Case 2- Uplift Modeling for Direct Marketing

A

B

CD

E

Leading financial institution: incremental conversion up 0.02% to 0.43%; Revenue per contact up by over 20 times

Page 9: Analytics   infrastructure, platforms and methods

9

Use Case 3 – Customer Retention With Churn Modeling

Reed Elsevier’s Caterer & Hotelkeeper: Reduced churn by 16%; Retention ROI up by 10%

PREMIER Bankcard: $8 million est. retained

Leading North American Telecom: Identified customers with a 600% increased risk of churn with social network analysis.

Optus (Australian telecom): Doubled churn model performance with social data

A

B

CD

E

Page 10: Analytics   infrastructure, platforms and methods

10

Use Case 4 – Churn Uplift Modeling

A

B

CD

E

Telenor: Reduced churn 36%; Cost-of-contact down 40%; Campaign ROI up 11-fold

US Bank: Costs down 40%, lift up 2 times, and cross-sell ROI up 5 times

Page 11: Analytics   infrastructure, platforms and methods

11

Data Mining ProcessCRISP-DM

(Cross Industry Standard Process for Data Mining)

SEMMA(Sample, Explore, Modify, Model, Assess)

A

B

CD

E

Page 12: Analytics   infrastructure, platforms and methods

12

Business Task

Data Set

Data Preparation

Data cleaning

ModelingEvaluation

and validation

Use of DM results/deplo

yment

Results of action based

on DM results

Development

Data Mining Process

Strategic Objectives

Operational

Objectives

Marketing Objectives

Other Objectives

A

B

CD

E

Page 13: Analytics   infrastructure, platforms and methods

13

Supervised (data training) and unsupervised methods

Age: 25-35Gender: MaleMarital Status: MarriedEducation: Graduate

Historically

Historically

Training Data

Unknown DataPrediction

Superv

ised

Unknown Data

Historically

Unsu

perv

ised

Page 14: Analytics   infrastructure, platforms and methods

14

Transactional vs. Analysis-Based Systems

Transactional Information Systems

Analysis-Based Information Systems

Page 15: Analytics   infrastructure, platforms and methods

15

Data Warehouses

Data Warehouse 4 main features:• Topical Orientation (customer, product, etc.)• Logical integration and homogenization (relational integration)• Presence of a reference period (vs operational)• Low volatility (should not change often)

3 components of Data Warehouses:• DBMS (Database Management System)• DB (Database)• DBCS (Database Communication System)

Snowflake Star

Page 16: Analytics   infrastructure, platforms and methods

16

Data Marts

Page 17: Analytics   infrastructure, platforms and methods

17

Data Mining Tools

Page 18: Analytics   infrastructure, platforms and methods

18

Data Visualization Tools

Page 19: Analytics   infrastructure, platforms and methods

19

What is Big Data?

1. Velocity2. Variety3. Volume

Page 20: Analytics   infrastructure, platforms and methods

20

Q&A?

Page 21: Analytics   infrastructure, platforms and methods

21