16
An Introduction to Advanced Analytics and Data Mining Transforming Data Dr Barry Leventhal April 2014 © Copyright BarryAnalytics Limited All Rights Reserved

An Introduction to Advanced analytics and data mining

Embed Size (px)

DESCRIPTION

This presentation introduces the basic ideas of advanced analytics, data mining and the data mining process

Citation preview

Page 1: An Introduction to Advanced analytics and data mining

An Introduction to Advanced Analytics and Data Mining

Transforming Data

Dr Barry Leventhal

April 2014

© Copyright BarryAnalytics Limited All Rights Reserved

Page 2: An Introduction to Advanced analytics and data mining

Agenda

• What are Advanced Analytics and Data Mining?

• The toolkit of data mining techniques

• Some issues to keep in mind

Transforming Data

• Which technique should you use?

Page 3: An Introduction to Advanced analytics and data mining

A process of discovering and interpreting patterns in (often large) data setsin order to solve business problems

What is Data Mining?

Transforming Data

Converts Data into Information

Pattern Information ActionData

Page 4: An Introduction to Advanced analytics and data mining

What is Advanced Analytics?

“Any solution that supports the identification of meaningful

patterns and correlations among variables in complex, structured

and unstructured, historical, and potential future data sets for the

purposes of predicting future events and assessing the

Data

Visualisation

Web

Analytics

Text

Analysis

Transforming Data

purposes of predicting future events and assessing the

attractiveness of various courses of action. Advanced Analytics

typically incorporate such functionality as data mining, descriptive

modelling, econometrics, forecasting, operations research

optimisation, predictive modelling, simulations, statistics and text

analytics.” (Source: Forrester Research)

Data

MiningContact

OptimisationSimulation

Social

Network

Analysis

Page 5: An Introduction to Advanced analytics and data mining

How can Advanced Analytics help?

• By helping companies to increase revenues or reduce costs

increase

revenues

reduce

costs

Transforming Data

improve

profit

Tom Davenport:

“Companies have long used business

intelligence for specific applications, but these

initiatives were too narrow to affect corporate

performance. Now, leading firms are basing their

competitive strategies on the sophisticated

analysis of business data.”

Page 6: An Introduction to Advanced analytics and data mining

Where can Advanced Analytics add Value?

Store location Product

management

Customer

Transportation/Fleet

management

Resource planning

Transforming Data

Customer management Web site

management

Anywhere else where I have

large numbers to manage

planning

Page 7: An Introduction to Advanced analytics and data mining

The Toolkit of Data Mining Techniques

Transforming Data

Traditional Statistics

• Regression Models

• Survival Analysis

• Factor Analysis

• Cluster Analysis

• CHAID

Machine Learning• Rule Induction

• Neural Networks

• Genetic Algorithms

Page 8: An Introduction to Advanced analytics and data mining

Two main types of Analytical Model

Type 1: Models driven by a Target Variable

e.g. Which customers to cross sell?

- Implies building a Predictive Model

- ‘Directed’ Data Mining Techniques

Transforming Data

- ‘Directed’ Data Mining Techniques

Type 2: Models with no Target Variable

e.g. What are our most important customer segments?

- Implies a Descriptive Model

- ‘Undirected’ Data Mining Techniques

Page 9: An Introduction to Advanced analytics and data mining

The Data Mining Process

Transforming Data

Data

Cross Industry Standard Process for

Data Mining

Reference:

Step-by-step data mining guide

CRISP-DM 1.0

Page 10: An Introduction to Advanced analytics and data mining

Some issues to keep in mind:Issue 1: Use an appropriate technique

• Some years ago, the DMA Targeting & Statistics Group held a seminar to explain and compare four analytical techniques:

– Cluster Analysis

– Decision Tree

– Neural Network (supervised)

Transforming Data

– Neural Network (supervised)

– Regression Model

• The four techniques were applied to a sample of lifestyle data in order to predict private healthcare cover

Page 11: An Introduction to Advanced analytics and data mining

Comparison of private healthcare targeting via

four analytical techniques

300

350

400

Some issues to keep in mind:Issue 1: Use an appropriate technique

Transforming Data

100

150

200

250

10% 20% 30% 40% 50%

Cluster Analysis Decision Tree Regression Model Neural Net

Source:

CMT/ DMA Targeting & Statistics Interest Group

Page 12: An Introduction to Advanced analytics and data mining

Issue 2: Modelling and Deploying are separate stages in the data mining process

Historical

Data+ Known

Outcomes

Modelling

Model

Deploying

Transforming Data

• “Modelling” is ‘one-off’ – until model requires rebuild

• “Deploying” takes place repeatedly

Recent

Data+

Deploying

Model

Predictions

Page 13: An Introduction to Advanced analytics and data mining

Issue 3: Do not forget your data!

Your data is the key to gaining value from analytics and modelling – essentials to consider:

• Data quality

Transforming Data

• Data predictivity

• Data integration

• Data governance

Page 14: An Introduction to Advanced analytics and data mining

The Importance of Data Integration

Customer Transactions

Customer

Attributes

Integration

• Business value increased by integrating complementary datasets• New insights may be created by data integration, e.g.

Transforming Data

Market Research Online Behaviour

Integration

Many applications of data integration...

• Predictive models to target behaviours identified by research

• Integration of web, email and traditional offline channels

• Tracking across channels, e.g. Attribution of media effects

Page 15: An Introduction to Advanced analytics and data mining

Which analytical technique should you use ?

The choice generally depends on…

• business problem

• whether problem is predictive or descriptive

• underlying data environment

• variables to be predicted or described

Transforming Data

• variables to be predicted or described

• ability to implement solution

• whether key statistical assumptions hold

• Obtain help from a Statistician or Data Mining Consultant

• All about the problem, not the technique

• Combination of approaches works best

Page 16: An Introduction to Advanced analytics and data mining

Thank you!

Barry Leventhal

Transforming Data

+44 (0)7803 231870

[email protected]