SAS Modernization architectures - Big Data Analytics

Preview:

DESCRIPTION

Big Data Analytics

Citation preview

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

IT STRATEGY FOR SCALABLE ANALYTICS, MODERN DATA ARCHITECTURES

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

MODERN ARCHITECTURES

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

STUNNING FACT

Making the Modern World: Materials and Dematerialization - Vaclav Smil

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

Scarcity

• Technology constrained

• Process-centric

• Focus on cost control

Everything is forbidden unless it is permitted

Abundance

• Focus on value

• Discovery-centric

• Technology empowered

Everything is permitted unless it is forbidden

Shift in Mindset

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

Trends Big Data, Storage, Hadoop & In-memory Technology

Vertica

Teradata

Greenplum

Oracle

Microsoft PDW

Hadoop

$- $20,000 $40,000 $60,000 $80,000 $100,000

Today 2009

Cost of Storage, Memory, Computing • In 2000 a GB of Disk $17 today < $0.07• In 2000 a GB of Ram $1800 today < $1• In 2009 a TB of RDBMS was $70K today < $ 20K

Cost per Terabyte

THE PERFECT STORM: STORAGE TECHNOLOGY COSTS AND CPU SPEED

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

MODERN REALITY

• Commoditization• Architectures• ScaleInfrastructure

• New Complex Streams• Perishable Considerations• Cost Data

• New Category of Business Problems• Analytical Algorithms• OperationalizationAnalytics

8Copyright © 2011, SAS Institute Inc. All rights reserved.

Finding treasures in unstructured datalike social media or survey tools

that could uncover insightsabout consumer sentiment

Mine transaction databases for data of spending patterns that indicate a stolen card..

Leveraging historical data to drive better insight into decision-makingfor the future

Analyze massiveamounts of data inorder to accurately

identify areas likely toproduce the mostprofitable results

FORECASTING

DATA MINING

TEXT ANALYTICS

OPTIMIZATION

STATISTICS

ADVANCED ANALYTICS

INFORMATIONMANAGEMENT

Copyright © 2011, SAS Institute Inc. All rights reserved.

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

CURRENT TRENDS IN ANALYTICS

Complex Business Problems Are Driving Analytics Innovation

Speed Will Be Of Essence

Leverage Analytics To Unlock The Information Contained In Unstructured Data

Operationalizing Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

CURRENT AND FUTURE ARCHITECTURES

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

WHERE WE ARE TODAY?

SETTING THE SCENE

Operational Data Sources

EDW

Data Mart

Data Mart

Analytic Mart

Analytic Mart

BI and Analytics

Unstructured, Semi-structured and Streaming data (i.e. sensor data) handled often outside the Warehouse flow

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

WHERE DOES HADOOP FIT?

HADOOP AS A “NEW DATA” STORE

Operational Data Sources

EDW

Data Mart

Data Mart

Analytic Mart

Analytic Mart

BI and Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

WHERE DOES HADOOP FIT?

HADOOP AS AN ADDITIONAL INPUT TO THE EDW

Operational Data Sources

EDW

Data Mart

Data Mart

Analytic Mart

Analytic Mart

Analytic Mart

Data Mart

BI and Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

WHERE DOES HADOOP FIT?

HADOOP DATA PLATFORM AS A “STAGING LAYER” AS PART OF A “DATA LAKE” – Downstream stores could be Hadoop, data appliances or an RDBMS

Data Mart

Operational Data Sources EDW

Data Mart

Analytic Mart

Analytic Mart

BI and Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

15

SAS BIG DATA STRATEGY – SAS AREAS

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

Impala

SAS & HADOOP SAS® WITHIN THE HADOOP ECOSYSTEM

Next-GenSAS® User

User Interface

Metadata

Data Access

DataProcessing

FileSystem

SAS® User

MPI Based

SAS® LASR™ AnalyticServer

SAS® High-Performance

Analytic Procedures

HDFS

Base SAS & SAS/ACCESS® to Hadoop™

SAS Metadata

Pig

Map Reduce

In-MemoryData Access

SAS® Visual Analytics

SAS®

Enterprise Miner™

SAS® Data Integration

SAS®

EnterpriseGuide®

Hive

SAS Embedded Process

Accelerators

SAS® In-Memory Statistics for

Haodop

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

IDENTIFY /FORMULATE

PROBLEM

DATAPREPARATION

DATAEXPLORATION

TRANSFORM& SELECT

BUILDMODEL

VALIDATEMODEL

DEPLOYMODEL

EVALUATE /MONITORRESULTS

IN SUMMARY SAS ENABLES THE ENTIRE LIFECYCLE AROUND HADOOP

SAS Visual AnalyticsSAS Visual StatisticsSAS In-Memory Statistics for Hadoop

Done using either the Data Preparation, Data Exploration or Build Model Tools

SAS High Performance Analytics Offerings supported by relevant clients like SAS Enterprise Miner, SAS/STAT etc.

Decision Manager

SAS Scoring Accelerator for HadoopSAS Code Accelerator for Hadoop

SAS Visual AnalyticsDecision Manager

Done using either the Data Preparation, Data Exploration or Build Model Tools

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

SAS® VISUAL ANALYTICSA SINGLE SOLUTION FOR DATA DISCOVERY,

VISUALIZATION, ANALYTICS AND REPORTING

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

SAS® VISUAL ANALYTICS

EXAMPLE: TEXT ANALYSIS GIVES YOU INSIGHT TO CUSTOMER EXPERIENCE AND OPINION

VISUALIZATION POWERED BY SAS ANALYTICS Analytics applied

to text provides real MEANING

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

VISUALIZATION EXAMPLES

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

SAS® VISUAL STATISTICS

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

DATA TO DECISION LIFECYCLE

SAS® Visual StatisticsTEXT

COMPETITIVEADVANTAGE

MANAGE DATA

EX

PL

OR

ED

ATA

DEVELOP MODELS

DE

PL

OY

&

MO

NIT

OR

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

APPLICATION AREAS

Segmentation

Classification

Prediction

Ad-hoc Discovery

Data Preparation

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

SAS IN-MEMORY STATISTICS FOR HADOOP

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

SAS® IN-MEMORY STATISTICS FOR

HADOOP

WHY IT IS IMPORTANT?

SPEED

Multi-user interactive analytics environment for increased productivity

Proven state-of-the-art statistical algorithms and machine learning techniques

Highly scalable, in-memory environment grows easily as needed

Memory and data efficient for a significant reduction of data latency to rapidly analyze large and complex data in Hadoop

PRECISION

INTERACTIVE

SCALABLE

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.sas.com

Recommended