58
Lightbend Fast Data Platform A Technical Overview

Lightbend Fast Data Platform

Embed Size (px)

Citation preview

Page 1: Lightbend Fast Data Platform

Lightbend Fast Data Platform

A Technical Overview

Page 2: Lightbend Fast Data Platform

A Bit of History:IT Services vs. Big Data

Page 3: Lightbend Fast Data Platform

Big DataServices

Some Overlap: Concerns, Architecture

The Recent Past…

Page 4: Lightbend Fast Data Platform

Microservices & Fast Data

Much More Overlap

The Future?

Page 5: Lightbend Fast Data Platform

Drilling Down:Monoliths to Microservices

Page 6: Lightbend Fast Data Platform

lightbend.com/reactive-microservices-architecture

Page 7: Lightbend Fast Data Platform

• Tangled responsibilities, lead to infrequent, “big-bang” deployments

• App lifetimes months to forever!

Monoliths

Page 8: Lightbend Fast Data Platform
Page 9: Lightbend Fast Data Platform

• Each does one thing, so must be message driven & asynchronous

• Updates easier, deployments frequent • App lifetimes: minutes! to forever

Microservices

Page 10: Lightbend Fast Data Platform

• But be careful: • Message overhead is much better than

function calls!

Microservices

Page 11: Lightbend Fast Data Platform

Reactive Systems

reactivemanifesto.org

Page 12: Lightbend Fast Data Platform

OpenSourceCoreForDevelopmentinScala/Java

Produc8onSuiteForOpera8ons/DevOps

Lightbend Reactive Platform

Page 13: Lightbend Fast Data Platform

Drilling Down:Big Data

Page 14: Lightbend Fast Data Platform

YARN

HDFS

MRjob#1

MRjob#2

Flume Sqoop

DBs

SlaveNode

DiskDiskDiskDiskDisk

NodeMgr

DataNode

Master

ResourceManager

NameNode

Hadoop

Page 15: Lightbend Fast Data Platform

YARN

HDFS

MRjob#1

MRjob#2

Flume Sqoop

DBs

SlaveNode

DiskDiskDiskDiskDisk

NodeMgr

DataNode

Master

ResourceManager

NameNode

Hadoop 2013: Embrace Spark

Sparkjob#1

Sparkjob#2

Page 16: Lightbend Fast Data Platform

YARN

HDFS

MRjob#1

MRjob#2

Flume Sqoop

DBs

SlaveNode

DiskDiskDiskDiskDisk

NodeMgr

DataNode

Master

ResourceManager

NameNode

Go beyond batch?

Sparkjob#1

Sparkjob#2

Page 17: Lightbend Fast Data Platform

The Emergence of Fast Data:(Time Is Money)

Page 18: Lightbend Fast Data Platform

Respond to change

Page 19: Lightbend Fast Data Platform

IoT

Page 20: Lightbend Fast Data Platform

20

Logs

Sockets

Storage

HDFS,S3,…

SQL/NoSQL ES

KafkaStreams

Streaming

Streams

SQL

Batch

Streaming

Flink

Page 21: Lightbend Fast Data Platform

Fast Data and Microservices; Are they Converging?

Page 22: Lightbend Fast Data Platform

• Each [stream app or μservice]: • does one responsibility • ingests unending [data or messages]

Synergies

22

Logs

Sockets

Storage

HDFS,S3,…

SQL/NoSQL ES

KafkaStreams

Streaming

Streams

SQL

Batch

Streaming

Flink

Page 23: Lightbend Fast Data Platform

• Each [stream app or μservice] must: • operate asynchronously • offer never-ending service

Synergies

23

Logs

Sockets

Storage

HDFS,S3,…

SQL/NoSQL ES

KafkaStreams

Streaming

Streams

SQL

Batch

Streaming

Flink

Page 24: Lightbend Fast Data Platform

• These architectures are converging: 1.Similar design problems 2.Data becomes dominant problem

Thesis

24

Logs

Sockets

Storage

HDFS,S3,…

SQL/NoSQL ES

KafkaStreams

Streaming

Streams

SQL

Batch

Streaming

Flink

Page 25: Lightbend Fast Data Platform

Lightbend Fast Data Platform

Page 26: Lightbend Fast Data Platform

1. An accelerated on ramp for building streaming data systems, data applications, and other microservices.

Value Three Ways

Page 27: Lightbend Fast Data Platform

2. Best practices guidance for solving specific design problems:

- Sample apps - Documentation - Enablement services

Value Three Ways

Page 28: Lightbend Fast Data Platform

3. Machine learning-based monitoring and management:

-Keep your systems resilient, scalable, and responsive with minimal user intervention.

Value Three Ways

Page 29: Lightbend Fast Data Platform

Lightbend Fast Data Platform V1.0

Internet

Logs

Sockets

DC/OS:OnPremiseorCloud

FDP

KafkaConnect

Storage

HDFS,S3,…

SQL/NoSQL ES

KafkaStreams

CloudHosted

FDP.ai

ClusterAnalysis

MachineLearningMicroservices

ProducDonSuite

MachineLearning

ML

Streaming

Streams

SQL

Batch

Streaming

Flink

Intelligent

Man

agem

ent

…Co

nsoles

Page 30: Lightbend Fast Data Platform

Lightbend Fast Data Platform V1.0

Internet

Logs

Sockets

DC/OS:OnPremiseorCloud

FDP

KafkaConnect

Storage

HDFS,S3,…

SQL/NoSQL ES

KafkaStreams

CloudHosted

FDP.ai

ClusterAnalysis

MachineLearningMicroservices

ProducDonSuite

MachineLearning

ML

Streaming

Streams

SQL

Batch

Streaming

Flink

Intelligent

Man

agem

ent

…Co

nsoles

Page 31: Lightbend Fast Data Platform

Lightbend Fast Data Platform V1.0

Internet

Logs

Sockets

DC/OS:OnPremiseorCloud

FDP

KafkaConnect

Storage

HDFS,S3,…

SQL/NoSQL ES

KafkaStreams

CloudHosted

FDP.ai

ClusterAnalysis

MachineLearningMicroservices

ProducDonSuite

MachineLearning

ML

Streaming

Streams

SQL

Batch

Streaming

Flink

Intelligent

Man

agem

ent

…Co

nsoles

Page 32: Lightbend Fast Data Platform

Lightbend Fast Data Platform V1.0

Internet

Logs

Sockets

DC/OS:OnPremiseorCloud

FDP

KafkaConnect

Storage

HDFS,S3,…

SQL/NoSQL ES

KafkaStreams

CloudHosted

FDP.ai

ClusterAnalysis

MachineLearningMicroservices

ProducDonSuite

MachineLearning

ML

Streaming

Streams

SQL

Batch

Streaming

Flink

Intelligent

Man

agem

ent

…Co

nsoles

Page 33: Lightbend Fast Data Platform

Service 1

Log & Other Files

Internet

Services

Service 2

Service 3

Services

Services

N * M links ConsumersProducers

Page 34: Lightbend Fast Data Platform

Service 1

Log & Other Files

Internet

Services

Service 2

Service 3

Services

Services

N + M links ConsumersProducers

Page 35: Lightbend Fast Data Platform

Lightbend Fast Data Platform V1.0

Internet

Logs

Sockets

DC/OS:OnPremiseorCloud

FDP

KafkaConnect

Storage

HDFS,S3,…

SQL/NoSQL ES

KafkaStreams

CloudHosted

FDP.ai

ClusterAnalysis

MachineLearningMicroservices

ProducDonSuite

MachineLearning

ML

Streaming

Streams

SQL

Batch

Streaming

Flink

Intelligent

Man

agem

ent

…Co

nsoles

Page 36: Lightbend Fast Data Platform

Lightbend Fast Data Platform V1.0

Internet

Logs

Sockets

DC/OS:OnPremiseorCloud

FDP

KafkaConnect

Storage

HDFS,S3,…

SQL/NoSQL ES

KafkaStreams

CloudHosted

FDP.ai

ClusterAnalysis

MachineLearningMicroservices

ProducDonSuite

MachineLearning

ML

Streaming

Streams

SQL

Batch

Streaming

Flink

Intelligent

Man

agem

ent

…Co

nsoles

Page 37: Lightbend Fast Data Platform

Lightbend Fast Data Platform V1.0

Internet

Logs

Sockets

DC/OS:OnPremiseorCloud

FDP

KafkaConnect

Storage

HDFS,S3,…

SQL/NoSQL ES

KafkaStreams

CloudHosted

FDP.ai

ClusterAnalysis

MachineLearningMicroservices

ProducDonSuite

MachineLearning

ML

Streaming

Streams

SQL

Batch

Streaming

Flink

Intelligent

Man

agem

ent

…Co

nsoles

Page 38: Lightbend Fast Data Platform

• Low latency? How low? • High volume? How high?

Streaming Tradeoffs (1/3)

38

Page 39: Lightbend Fast Data Platform

• Which kinds of data processing & analytics are required?

• How will this processing be done? • Individual processing of events? • Bulk processing of records?

Streaming Tradeoffs (2/3)

39

Page 40: Lightbend Fast Data Platform

• Which tools and data sources/sinks must interoperate with your streaming tool?

Streaming Tradeoffs (3/3)

40

Page 41: Lightbend Fast Data Platform

41

Page 42: Lightbend Fast Data Platform

42

• Low latency • Low volume • Complex flows • Complex Event

Processing

Page 43: Lightbend Fast Data Platform

43

• Med. latency • High volume • Data flows, SQL • En masse

processing

Page 44: Lightbend Fast Data Platform

44

• Low latency • High volume • Data flows,

correctness • En masse

processing

Page 45: Lightbend Fast Data Platform

45

• Low latency • Med. volume

• ETL, “tables” • Data flow or

per event

Page 46: Lightbend Fast Data Platform

Lightbend Fast Data Platform V1.0

Internet

Logs

Sockets

DC/OS:OnPremiseorCloud

FDP

KafkaConnect

Storage

HDFS,S3,…

SQL/NoSQL ES

KafkaStreams

CloudHosted

FDP.ai

ClusterAnalysis

MachineLearningMicroservices

ProducDonSuite

MachineLearning

ML

Streaming

Streams

SQL

Batch

Streaming

Flink

Intelligent

Man

agem

ent

…Co

nsoles

Page 47: Lightbend Fast Data Platform

Lightbend Fast Data Platform V1.0

Internet

Logs

Sockets

DC/OS:OnPremiseorCloud

FDP

KafkaConnect

Storage

HDFS,S3,…

SQL/NoSQL ES

KafkaStreams

CloudHosted

FDP.ai

ClusterAnalysis

MachineLearningMicroservices

ProducDonSuite

MachineLearning

ML

Streaming

Streams

SQL

Batch

Streaming

Flink

Intelligent

Man

agem

ent

…Co

nsoles

Page 48: Lightbend Fast Data Platform

Kafka & Spark Metrics

Page 49: Lightbend Fast Data Platform

Correlated Troubleshooting

Page 50: Lightbend Fast Data Platform

Lightbend Fast Data Platform V1.0

Internet

Logs

Sockets

DC/OS:OnPremiseorCloud

FDP

KafkaConnect

Storage

HDFS,S3,…

SQL/NoSQL ES

KafkaStreams

CloudHosted

FDP.ai

ClusterAnalysis

MachineLearningMicroservices

ProducDonSuite

MachineLearning

ML

Streaming

Streams

SQL

Batch

Streaming

Flink

Intelligent

Man

agem

ent

…Co

nsoles

Page 51: Lightbend Fast Data Platform

• Only metrics are uploaded, not sensitive domain data.

• One less service for you to manage. • …

Why Only Hosted Services?

Page 52: Lightbend Fast Data Platform

• … • We can rapidly evolve this service.

without impacting your environment. • You benefit from aggregated knowledge

from all FDP clusters.

Why Only Hosted Services?

Page 53: Lightbend Fast Data Platform

Lightbend Fast Data Platform V1.0

Internet

Logs

Sockets

DC/OS:OnPremiseorCloud

FDP

KafkaConnect

Storage

HDFS,S3,…

SQL/NoSQL ES

KafkaStreams

CloudHosted

FDP.ai

ClusterAnalysis

MachineLearningMicroservices

ProducDonSuite

MachineLearning

ML

Streaming

Streams

SQL

Batch

Streaming

Flink

Intelligent

Man

agem

ent

…Co

nsoles

Page 54: Lightbend Fast Data Platform

• Alpha: Jan 9 • Beta 1: April • Beta 2: July • V1.0: September

Timeline

54

Page 55: Lightbend Fast Data Platform

Lightbend Fast Data Platform V1.0

Internet

Logs

Sockets

DC/OS,MarathonOnPremiseorCloud

FDP

KafkaConnect

Storage

HDFS,S3,…

SQL/NoSQL ES

KafkaStreams

CloudHosted

GravitonClusterAnalysis

MachineLearningMicroservices

ProducConSuite

MachineLearning

ML

Streaming

Streams

SQL

Batch

Streaming

Flink

Intelligent

Man

agem

ent

…Co

nsoles

For More Information:

lightbend.com/fast-data-platform

Page 56: Lightbend Fast Data Platform

For More Information

lightbend.com/ reactive-microservices-architecture

Page 57: Lightbend Fast Data Platform

bit.ly/lightbend-fast-data

For More Information

Page 58: Lightbend Fast Data Platform

• FDP: lightbend.com/fast-data-platform • Book: bit.ly/lightbend-fast-data