12
Deploying Microsoft SQL Server Big Data Clusters with HPE Synergy and Nimble storage Maurice De Vidts SQL Engineer November 5, 2019

Deploying Microsoft SQL Server Big Data Clusters with HPE ... · HPE -> OpenShift Container Platform -> MS SQL BDC = Resilient, scalable and manageable data platform Nimble

  • Upload
    others

  • View
    21

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Deploying Microsoft SQL Server Big Data Clusters with HPE ... · HPE -> OpenShift Container Platform -> MS SQL BDC = Resilient, scalable and manageable data platform Nimble

Deploying Microsoft SQL Server Big Data Clusters with HPE Synergy and Nimble storage

Maurice De VidtsSQL EngineerNovember 5, 2019

Page 2: Deploying Microsoft SQL Server Big Data Clusters with HPE ... · HPE -> OpenShift Container Platform -> MS SQL BDC = Resilient, scalable and manageable data platform Nimble

Session agenda

– What are SQL Server Big Data Clusters?

– Key functionality and use cases

– HPE Design

– HPE Synergy

– HPE Nimble

– Deployment steps

– Scaling the solution

– Resources and call to action

2

Page 3: Deploying Microsoft SQL Server Big Data Clusters with HPE ... · HPE -> OpenShift Container Platform -> MS SQL BDC = Resilient, scalable and manageable data platform Nimble

What are SQL Server Big Data clusters?

3

New feature of SQL Server 2019

- Native, scalable HDFS storage and spark processing

- Virtual data hub for external data source integration (Polybase)

- Ability to manage and integrate both relational and big data in one platform

- AI and ML tools

Page 4: Deploying Microsoft SQL Server Big Data Clusters with HPE ... · HPE -> OpenShift Container Platform -> MS SQL BDC = Resilient, scalable and manageable data platform Nimble

SQL Server Big Data cluster scenarios

4

Managed SQL Server, Spark, and data lake

SQL Server

Data virtualization

T-SQLAnalytics Apps

Open database

connectivity

NoSQL Relational databases

HDFS

Complete AI platform

SQL Server External Tables

Compute pools and data pools

Spark

Scalable, shared storage (HDFS)External

data sources

Admin portal and management servicesIntegrated AD-based security

SQL ServerML Services

Spark &Spark ML

HDFS

REST API containers for models

Page 5: Deploying Microsoft SQL Server Big Data Clusters with HPE ... · HPE -> OpenShift Container Platform -> MS SQL BDC = Resilient, scalable and manageable data platform Nimble

SQL Server Big Data Cluster Architecture – at a glance

Compute pool

SQL Compute Node

SQL Compute Node

SQL Compute Node…

Compute pool

SQL Compute Node

IoT data

Directly read from

HDFS

Persistent storage

Storage pool

SQL ServerSpark

HDFS Data Node

SQL ServerSpark

HDFS Data Node

SQL ServerSpark

HDFS Data Node

Kubernetes pod

AnalyticsCustomapps BI

SQL Server master instance

Node Node Node Node Node Node Node

SQL

Data mart

SQL Data Node

SQL Data Node

Compute pool

SQL Compute Node

Storage Storage

Control Plane

Controller Svc

Configuration Store (SQL Server)

Grafana

Elastic Search

Azure FSM Engine

InfluxDB

Kibana

Page 6: Deploying Microsoft SQL Server Big Data Clusters with HPE ... · HPE -> OpenShift Container Platform -> MS SQL BDC = Resilient, scalable and manageable data platform Nimble

HPE Design - Synergy

6

• Software Defined Infrastucture

• Flexible

• Easy to manage

• Scalable

• Ideal for any workload

• Fast deployment

• OneView management

• Automates infrastructure operations

Page 7: Deploying Microsoft SQL Server Big Data Clusters with HPE ... · HPE -> OpenShift Container Platform -> MS SQL BDC = Resilient, scalable and manageable data platform Nimble

HPE Design – Nimble storage

7

• Reliable

• Efficient storage

• Predictive

• Scalable

• Six Nines

• Store more TB

• Elminates App Gap

Page 8: Deploying Microsoft SQL Server Big Data Clusters with HPE ... · HPE -> OpenShift Container Platform -> MS SQL BDC = Resilient, scalable and manageable data platform Nimble

Node sizing and mapping

– Rough node sizing

8

Standard Medium

Instances Cores Memory (GB) Data Instances Cores Memory Data

SQL Server master 3 16 192-1024 1TB 3 64 1024 10TB

Compute pool 4 4 96 N/A 8 16 192 N/A

Data pool 4 4 96 1TB 8 16 192 10TB

Storage pool w/ Spark

8 8 192 10TB 24 32 1024 100TB

Spark pool 4 4 96 N/A 8 16 1024 N/A

Storage pool w/o Spark

8 4 96 10TB 24 16 192 100TB

Control/Management services

1 4 192 100GB 1 16 192 500GB

Page 9: Deploying Microsoft SQL Server Big Data Clusters with HPE ... · HPE -> OpenShift Container Platform -> MS SQL BDC = Resilient, scalable and manageable data platform Nimble

High level deployment steps

9

HPE -> OpenShift Container Platform -> MS SQL BDC = Resilient, scalable and manageable data platform

Nimble CSI dynamic driver

Key deployment steps

1) Image Servers2) Setup prerequisites (DNS, NTP etc)3) Install docker4) Install container management <Kubernetes, open shift>5) Create the storage classes for each kid of storage device (SAN, Local, etc)6) Edit BDC configuration files to reflect node count, storage sizes7) Annotate and label nodes 8) Run AZDATA utility to install SQL Server Big Data Cluster

Page 10: Deploying Microsoft SQL Server Big Data Clusters with HPE ... · HPE -> OpenShift Container Platform -> MS SQL BDC = Resilient, scalable and manageable data platform Nimble

HPE Design – Scaling the solution

10

Page 11: Deploying Microsoft SQL Server Big Data Clusters with HPE ... · HPE -> OpenShift Container Platform -> MS SQL BDC = Resilient, scalable and manageable data platform Nimble

Resources and call to action!

– Tested HPE reference architectures and reference configurations

– Come visit our booth and talk to us about HPE, Big Data clusters or SQL performance in general

11

Page 12: Deploying Microsoft SQL Server Big Data Clusters with HPE ... · HPE -> OpenShift Container Platform -> MS SQL BDC = Resilient, scalable and manageable data platform Nimble

HPE Data and Analytics Reference Architectures

12

– HPE Reference Configuration for Red Hat OpenShift Container Platform on HPE Synergy Composable Infrastructure

– HPE Reference Architecture for Microsoft SQL Server 2017 on Linux with HPE 3PAR StoreServ Storage using compression

– HPE Reference Architecture for Microsoft SQL Server 2014 on HPE SimpliVity 380

– HPE Reference Architecture for Microsoft SQL Server 2016 Operational Analytics on the HPE ProLiant DL580 Gen9

– HPE Reference Architecture for Microsoft SQL Server 2016 on HPE Synergy

– HPE Reference Configuration for Microsoft SQL Server 2016 on HPE ProLiant for Microsoft Azure Stack

– Microsoft SQL Server 2017 OLTP Workloads on HPE MSA 2052 Storage

Check out all Data and Analytics Reference Architectures: www.hpe.com/info/dm-RA