SQL Track: SQL Server unleashed meet SQL Server's extreme sides

Preview:

DESCRIPTION

This session is a special one, and yes because of the subject matter but also because of the set-up of the session. It is split in two mini-sessions, one about New Technologies and an introduction to Parallel Datawarehouse: New Technolgies: This part of the session is all about the discovering the extremes of SQL Server. First we will talk about the new SQL Servers In-memory technologies the updatable ColumnStore and Heckaton technology, both pushing the boundaries of SMB machines far beyond what we taught possible 3 years ago. SQL Server PDW: With the new SQL Servers In-memory technologies SQL Server pushes the SMB machines far, for some of us these boundaries are still to close for comfort. So meet the scalable version of SQL Server, obliterating the limits of SMB machines. This is an introduction to SQL Server PDW, the next step in the (r)evolution of SQL Server, capable of running high performance data warehouse queries on big data even offering seamless integration with Hadoop using PolyBase.

Citation preview

SQL Server unleashed: Meet SQL Server's Extreme sidesKarel Coenye

The world of data is changing

New Questions, More data

Do more with less

Previous Limitations

Urban Myths

But… Do you know the challenges

Data is the Key

PDW vs. SMB

SMB on Steroids

OLTP

Hekaton

SMB

DWH’s

Next Gen DWH Performance

Updateble ColumnStore

Enter Big Data

Hadoop

ScaleStandard Enterprise Fasttrack PDWReliable SMB Reliable Business

Critical SMBReference Architecture

High End MPP DWH

Needs Maintenance hours

Online Maintenance24/7/365

Based upon Enterprise edition

High end Data marts and EDWs

Software Only Sofware Only Architecture (hard and software)

Appliance

Scale Up Scale Up Scale Up DWH Scale outOLTP OLTP / /Small DWH DWH up to 10’s of

TBData Marts and small to midsize DWH

Up to PB’s

MPP - PDW

MPP - APS

Hardware

Virtualization

Yes… But does it work with Excel…

Polybase

Control Node Compute Node

Name Node Data Node

PDW Appliance

Compute Node

Data Node

Data Node

Data Node

Data Node

Data Node

Data Node

Data Node

Hadoop Cluster

Ok sounds cool, but what does it do

Control Node

Compute Node

Compute Node

Compute Node

Compute Node

Name Node

Data Node

Data Node

Data Node

Data Node

PDW

Ok so you said it’s fast… but now show me

Load performance

PDW

DEV SQL -> SQL

PROD TeraData -> SQL

0 10 20 30 40 50 60 70

Loading 100 milion rows in minutes (shorter is better)

Data Pumps• Reading 132 MB/s from disk = 8 GB

per minute• Reading 2 DVDs per minute

Scaling• Scales Lineair– Demo PDW has only 2 units

• There is a pdw development edition– but it is a developer appliance! For an

msdn ultimate subscription, there is 1 pdw developer license.

Future Proof• DWLoader is the fastest load mechanism• Transformations can be done using CTAS

statements• Loading from remote server:

– Any remote server connected with infiniband switch– Multiple servers allowed

• Pollybase– Ready for big data

• Can use existing SSIS

DEMO

Follow Technet Belgium@technetbelux

Subscribe to the TechNet newsletteraka.ms/benews

Be the first to know

Belgium’s biggest IT PRO Conference