13
INTERACTIVE VISUALIZATION OF STREAMING DATA POWERED BY SPARK

Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Embed Size (px)

Citation preview

Page 1: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

INTERACTIVE VISUALIZATION OF STREAMING DATA POWERED BY SPARK

Page 2: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Streaming @Zoomdata

Visualizations react to new data delivered

Users start, stop, pause the stream

Users select a rolling window or pin a start time to capture cumulative metrics

Page 3: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Drivers for Streaming Data

Data Freshness Time to Analytic Business Context

Page 4: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Challenges• Time

• Frequency

• Retention

• Synchronization

• Order

• Updates

Page 5: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Addressing streaming @Zoomdata

Historical Revised

Receive Data JMS Kafka

Manipulate Stream Single JVM in Memory Spark Streaming

Hold Data in Buffer MongoDB Pluggable

Interact with Data Custom Code Pluggable

Page 6: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Technology Cast• The Stream - Kafka, Kinesis, JMS• Processing Fabric - Spark Streaming• Landing Area - MemSQL, Solr, Kudu,

Others

Page 7: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

How it looks

Page 8: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

With the rest of the app

Page 9: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Scale Out

Page 10: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Demo• Twitter Producer• Spark Streaming• MemSQL & Solr

Sinks

Page 11: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Benefits• Contextual Expressiveness with Streaming

Data• Independent scalability (scale-up, scale-

around)• Expressiveness powered by Spark -- using

Windowing (dataframe API with stream)• DR COOP, other Data management concerns

Page 12: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Future Work• Cross stream synchronization & fusion

• On-demand scale out and resource management via Mesos

• Schema evolution

• More extensible landing strategies

Page 13: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Thanks