71
Data Platform Airlift 21 de Outubro \\ Microsoft Lisbon Experience What’s new in the Azure Data Platform Ricardo Peres Luis Calado

What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

  • Upload
    others

  • View
    21

  • Download
    0

Embed Size (px)

Citation preview

Page 1: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Data Platform Airlift21 de Outubro \\ Microsoft Lisbon Experience

What’s new in the Azure Data PlatformRicardo Peres

Luis Calado

Page 2: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Azure DocumentDB

Azure Search

Azure Machine Learning Marketplace

Azure SQL Database

Azure Data Lake

Azure Data Factory

Page 3: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 4: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Agenda

Headline

Core Concepts

Resources

Indexes

Querying

Paging

Updating

Transactions

Partition Resolvers

User Defined Functions

Stored Procedures

Triggers

Security

Limits

Search

Best Practices

Page 5: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Headlines

NoSQL database as a service for JSON documents

Schemaless

RESTful

Part of Azure – only available online

Highly scalable

Several bindings (.NET, JavaScript, Python, ...)

Page 6: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Core Concepts

Page 7: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Resources (1 of 3)

Documents that live in DocumentDB

All have a unique addressable URL (_rid or id):https://{account}.documents.azure.com/dbs/{_rid-db}/colls/{_rid-col}/docs/{_rid-doc}

All live inside a collection

A collection lives inside a database

A database belongs to an account

A collection can take different kinds of documents

Page 8: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Resources (2 of 3)

Either POCOs or inherit from Resource

Some built-in properties:

If an id property is not specified, one will be provided (Guid)

Case matters!

Property User Settable Purpose

_rid No System generated, unique and hierarchical

identifier

_etag No HTTP etag required for optimistic concurrency

control

_ts No Last updated timestamp

_self No Unique addressable URL

id Yes User defined unique name

Page 9: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Resources (3 of 3)

Can have attachments:https://{account}.documents.azure.com/dbs/{_rid-db}/colls/{_rid-col}/docs/{_rid-doc}/attachments/{_id-attch}

Additional properties:

Property User Settable Purpose

contentType Yes The content type of the attachment

media Yes The URL link or file path where the

attachment resides

Page 10: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Indexes (1 of 2)

Consistency can be configured per collectionConsistent: indexes are updated synchronously

Lazy: indexes are updated asynchronously

None

Page 11: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Indexes (2 of 2)

By default, all paths are indexed, can be overriden

Three kinds of property indexes:Hashed: for exact matchesRange: for range comparisons, orderingSpatial: for geospatial queries

Three kinds of property value indexes (from JSON):String (precision: 1-100 or -1)Number (precision: 1-8 or -1)Point

A collection can have several indexes at once

If a collection does not have an index, it cannot be queried except by id or self link!

Page 12: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Querying – SQL (1 of 3)

Returns JSON

Joins only inside document (collections)

No comparison of different data types (undefined)

Math: +, -, *, /, %

Bitwise: |, &, ^, <,>>, >>>

Logical: AND, OR, NOT

Comparison: =, !=, <, >, <=, >=, <>

String: ||

Ternary and coalesce: ?, ??

IN, BETWEEN, ORDER BY

Parameterized – no SQL injection

Page 13: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Querying – SQL (2 of 3)

SQL functions:Math: ABS, CEILING, EXP, FLOOR, LOG, LOG10, POWER, ROUND,

SIGN, SQRT, SQUARE, TRUNC, ACOS, ASIN, ATAN, ATN2, COS, COT, DEGREES, PI, RADIANS, SIN, TAN

Type checking: IS_ARRAY, IS_BOOL, IS_NULL, IS_NUMBER, IS_OBJECT, IS_STRING, IS_DEFINED, IS_PRIMITIVE

String: CONCAT, CONTAINS, ENDSWITH, INDEX_OF, LEFT, LENGTH, LOWER, LTRIM, REPLACE, REPLICATE, REVERSE, RIGHT, RTRIM, STARTSWITH, SUBSTRING, UPPER

Array: ARRAY_CONCAT, ARRAY_CONTAINS, ARRAY_LENGTH, ARRAY_SLICE

Spatial: ST_DISTANCE, ST_WITHIN, ST_ISVALID, ST_ISVALIDDETAILED

Page 14: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Querying – SQL (3 of 3)

SQL Ternary and coalesce: ?, ??

SELECT (c.grade < 5)? "elementary": "other" AS gradeLevel

FROM Families.children[0] c

SELECT f.lastName ?? f.surname AS familyName

FROM Families f

Projecting into new JSON objects:SELECT { "state": f.address.state, "city": f.address.city, "name": f.id }

FROM Families f

WHERE f.id = "AndersenFamily“

Creating arrays:SELECT [f.address.city, f.address.state] AS CityState

FROM Families f

Returning single values:SELECT VALUE “Hello World”

[{ "$1": { "state": "WA", "city": "seattle" }, "$2": { "name": "AndersenFamily" } }]

[ { "CityState": [ "seattle", "WA" ] }, { "CityState": [ "NY", "NY" ] } ]

[ "Hello World" ]

Page 15: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Querying - LINQ

LINQ functions:Math: Abs, Acos, Asin, Atan, Ceiling, Cos, Exp, Floor, Log, Log10,

Pow, Round, Sign, Sin, Sqrt, Tan, Truncate

String: Concat, Contains, EndsWith, IndexOf, Count, ToLower, TrimStart, Replace, Reverse, TrimEnd, StartsWith, SubString, ToUpper

Array: Concat, Contains, and Count

Spatial: Distance, Within, IsValid, and IsValidDetailed

Page 16: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Paging

Can specify maximum number of items to retrieve

Has more results / get next results

Ordering

Page 17: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Updating

InsertsFrom POCOFrom StreamBatching:

Document ExplorerData Migration ToolStored Procedures

ReplacesConcurrency control from Etags

DeletesBy self link or id

Page 18: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Transactions

No explicit transactions

Implicit inside triggers and stored procedures – only at collection level

Page 19: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Partition Resolvers

Specified per database

Possibly several

Can decide on which collection a document is to be saved or retrieved from

Included:HashPartitionResolver: distribute data evenly accross collections

RangePartitionResolver<T>: when there is a “natural” ordering, such as with date and time

Page 20: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

User Defined Functions

JavaScript-based

Exist in collections

No side effects

var regexMatchUdf = new UserDefinedFunction {

Id = "REGEX_MATCH",

Body = "function (input, pattern) {

return input.match(pattern) !== null;

};",

};

SELECT udf.REGEX_MATCH("ardo", s.Id) FROM Session s

Page 21: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Stored Procedures

JavaScript-based

Exist in collections

Can do batching

Implicit transactions

function (gender) {

var response = getContext().getResponse();

var collection = getContext().getCollection();

var query = 'SELECT * FROM c WHERE c.Gender= "' + gender + '"';

collection.queryDocuments(collection

.getSelfLink(), query, {},

function(err, documents, options) {

response.setBody(response.getBody() + JSON.stringify(documents));

}

);

}

Page 22: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Triggers

JavaScript-based

Exist in collections

Two types:Pre trigger

Post trigger

function updateTrigger() {

var request = getContext()

.getRequest();

var doc = request.getBody();

doc[‘message’] = ‘Added by trigger’;

request.setBody(doc);

}

Page 23: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Security

Access keys:

Master (single)

Read only (multiple)

Database users – specify use at DocumentClient level

Permissions for users over resources (resource tokens: default expiration is 1h, up to 5h):

All

Read

Resources:

Collections

Documents

Attachments

Stored procedures

Triggers

User defined functions

Page 24: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

LimitsFeature Limit

Maximum Request Units / second / collection 2500

Maximum execution time for stored procedure

and trigger

5 s

Provisioned document storage / collection 50 GB

Maximum collections per database account* 100

Maximum document storage per database

(100 collections)*

1 TB

Maximum Length of the Id property 255 chars

Maximum request size of document and

attachment

512 KB

Maximum number of JOINs per query* 5

Number of stored procedures, triggers and

UDFs per collection*

25

Number of users per database account 500.000

Page 25: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Search

Based on Elasticsearch and Lucene

.NET + REST APIs

Can retrieve data from DocumentDB

Page 26: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Best Practices

Cache the DocumentClient instance

Choose right collection index update policy

Index only properties that will be searchable and with appropriate values – watch out for ranges

Store small documents

Measure and tune request costs

Retrieve only what you need – paging, projections

Cache self links – they never change

Use partition resolvers for distributing burden

Beware throttling!

Page 27: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Meet the Competition

MongoDBOpen source + support model

No joins

Aggregations

Time to live

Offline deployment

Replication

Eventual consistency

ACID transactions

Map/Reduce

Several programming languages supported

RavenDBOpen source + support model

Joins across documents

Aggregations

Expiry

Offline deployment

Replication

Eventual consistency

ACID transactions

Map/Reduce

.NET, REST

Page 28: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 29: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

References

Query Playground: https://www.documentdb.com/sql/demo

.NET Azure DocumentDB Samples: https://github.com/Azure/azure-documentdb-net

DocumentDB Studio: https://studiodocumentdb.codeplex.com/

Azure DocumentDB Data Migration Tool: http://www.microsoft.com/en-us/download/details.aspx?id=46436

Pricing: https://azure.microsoft.com/en-us/pricing/details/documentdb/

Connecting DocumentDB with Azure Search using indexers: https://azure.microsoft.com/en-us/documentation/articles/documentdb-search-indexer/

Page 30: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 31: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

A search-as-a-service solution allowing developers to incorporate great search experiences into applicationswithout managing infrastructure or needing to become search experts.

Page 32: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 33: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Type Ahead

Page 34: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 35: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

FacetsFacets

Page 36: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Hit Highlighting

Page 37: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Spelling Mistakes

Page 38: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Geo-Spatial Search

Page 39: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Paging

Page 40: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Sorting & Scoring

Page 41: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

New indexers (SQL Database and DocumentDB)

New language support (35 languages including pt-PT)

Index creation in the new Management Portal

New Regions

New APIs for index creation

Page 42: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

• Distance

• Intersection

Page 43: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 44: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 45: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 46: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Full Text Search

Secure data with authentication, authorization and encryption

Page 47: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Extended Events

Page 48: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 49: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 50: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Azure Portal

Azure Ops Team

ML Studio

Data Scientist

HDInsight

Azure Storage

Training Set

from on-prem

Azure Portal &

ML API service

Azure Ops Team

PowerBI/DashboardsMobile AppsWeb Apps

ML API service Developer

ML Studio and the Data Scientist

• Access and prepare data

• Create, test and train models

• Collaborate

• One click to stage for

production via the API service

Azure Portal & ML API serviceand the Azure Ops Team

• Create ML Studio workspace

• Assign storage account(s)

• Monitor ML consumption

• See alerts when model is ready

• Deploy models to web service

ML API service and the Developer

• Tested models available as an url that can be called from any end point

Business users easily access results:

from anywhere, on any device

Page 51: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 52: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Cloud

Event Hubs

ML Studio ML API Service

Microsoft

Azure Portal

Blob Storage

Page 53: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

ML Apps

Marketplace

ML Operationalization

ML Studio

ML Algorithms

Page 54: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 55: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 56: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Observation

Pattern

Theory

Hypothesis

What will happen?

How can we make it happen?

Predictive

Analytics

Prescriptive

Analytics

What happened?

Why did it happen?

Descriptive

Analytics

Diagnostic

Analytics

Top-Down

Confirmation

Theory

Hypothesis

Observation

Page 57: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Implement Data Warehouse

Physical Design

ETL

Development

Reporting &

Analytics

Development

Install and Tune

Reporting & Analytics Design

Dimension Modelling

ETL Design

Setup Infrastructure

Understand Corporate Strategy

Data sources

ETL

BI and analytic

Data warehouse

Gather Requirements

Business Requirements

Technical Requirements

Page 58: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Ingestregardless of requirements

Storein native format without

schema definition

AnalyzeUsing analytic engines

like Hadoop

Interactive queries

Batch queries

Machine Learning

Data warehouse

Real-time analytics

Devices

Page 59: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Store and analyse data of any kind and size

Develop faster, debug and optimise smarter

Interactively explore patterns in your data

No learning curve—use U-SQL, Spark, Hive, HBase and Storm

Managed and supported with an enterprise-grade SLA

Dynamically scales to match your business priorities

Enterprise-grade security with Azure Active Directory

Built on YARN, designed for the cloud

Page 60: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

`

AZURE DATA LAKE

DEV

TOOLSVisual

Studio

PowerShell

MS

Azure Data Factory

Azure Stream

Analytics*

MS

HDInsight

Kona

Azure SQL

DW*

AzureML*

3rd Party

Informatica*

3rd Party

Cloudera*

Hortonworks*

MapR*

Open Source

Sqoop

Flume

MS

RevolutionR*

PowerBI*

3rd Party

TBA

PLATFORMS

APPLICATIONS

DATA INTEGRATION TOOLS

Page 61: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 62: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 63: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 64: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 65: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 66: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 67: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 68: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Last Name First Name Country Age …

Flasko Mike Canada 32

Anand Subbaraj USA 30

Gaurav Malhotra USA 72

… …. …. ….

Last Name First Name At risk of

churning

….

Flasko Mike Yes

Anand Subbaraj No

Gaurav Malhotra Yes

… ….

Page 69: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*

Call Log Files

Customer Table

Call Log Files

Customer Table

Customer

Churn Table

Data Sources Ingest Transform & Analyze Publish

Customer

Call Details

Customers

Likely to

Churn

Page 70: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*
Page 71: What’s new in the Azure Data Platformdownload.microsoft.com/download/6/5/0/65023338-AE17...AZURE DATA LAKE DEV TOOLS Visual Studio PowerShell MS Azure Data Factory Azure Stream Analytics*