CS 744: Big Data...

CS 744: Big Data Systems

Shivaram Venkataraman Fall 2018

ADMINISTRIVIA

-  Assignment 2, Midterm grades this week -  Course Projects: round 2 meetings next Friday -  Next Tuesday: Guest speaker for first part

WHAT WE KNOW SO FAR

CONTINUOUS OPERATOR MODEL

Long-lived operators

Distributed Checkpoints High overhead for Fault

Recover

Naiad Task

Control Message Driver

Network Transfer

Mutable State

Stragglers ?

1.  Scalability to hundreds of nodes

2. Minimal cost beyond base processing (no replication) 3. Second-scale latency 4. Second-scale recovery from faults and stragglers

DISCRETIZED STREAMS

DISCRETIZED STREAMS (DSTREAMS)

Approach - Use short, stateless, deterministic tasks - Store state across tasks as in-memory RDDs - Fine-grained tasks à Parallel recovery / speculation

- Chunk inputs into a number of micro-batches - Processed via parallel operations (i.e., map, reduce, groupBy etc.)

- Save intermediate state as RDD / write output to external systems

COMPUTATION MODEL: MICRO-BATCHES

Control Message Driver

SHUFFLE

Network Transfer

Micro-Batch

EXAMPLE pageViews=readStream(http://...,"1s")ones=pageViews.map(event=>(event.url,1))counts=ones.runningReduce((a,b)=>a+b)

ARCHITECHTURE

DSTREAM API

Output operations save output to external database / filesystem

Transformations

Stateless: map, reduce, groupBy, join Stateful: window(“5s”) à RDDs with data in [0,5), [1,6), [2,7) reduceByWindow(“5s”, (a, b) => a + b) à incremental aggregation

ASSOCIATIVE, INVERTIBLE

Add previous 5 each time

Subtract previous and add current

OTHER ASPECTS

Tracking State: streams of (Key, Event) à (Key, State) - Initialize: Create a State from the first event - Update: Return new State given, old state and event - Timeout for dropping old states.

Unifying batch and stream

- Join DStream with static RDD - Attach console and query existing RDDs - Shared codebase, functions etc.

events.track((key,ev)=>1,(key,st,ev)=>ev==Exit?null:1,"30s”)

SYSTEM IMPLEMENTATION

OPTIMIZATIONS

Network Communication Rewrote Spark’s data plane to use asynchronous I/O

Timestep Pipelining

No barrier across timesteps unless needed Tasks from the next timestep scheduled before current finishes

Checkpointing

Async I/O, as RDDs are immutable Forget lineage after checkpoint

FAULT TOLERANCE: PARALLEL RECOVERY

Worker failure - Need to recompute state RDDs stored on worker - Re-execute tasks running on the worker

Strategy - Run all independent recovery tasks in parallel - Parallelism from partitions in timestep and across timesteps

EXAMPLE pageViews=readStream(http://...,"1s")ones=pageViews.map(event=>(event.url,1))counts=ones.runningReduce((a,b)=>a+b)

FAULT TOLERANCE

Straggler Mitigation Use speculative execution Task runs more than 1.4x longer than median task à straggler

Master Recovery

- At each timestep, write out graph of DStreams and Scala function objects - Workers connect to a new master and report their RDD partitions - Note: No problem if a given RDD is computed twice (determinism).

DISCUSSION/SHORTCOMINGS

Expressiveness - Current API requires users to “think” in micro-batches

Setting batch interval - Manual tuning. Higher batch à better throughput but worse latency

Memory usage - LRU cache stores state RDDs in memory

SUMMARY

Micro-batches: New approach to stream processing Higher latency for fault tolerance, straggler mitigation Unifying batch, streaming analytics

CS 744: Big Data...

Documents

Jayashankar - University of Wisconsin–Madisonpages.cs.wisc.edu/.../cs744-jayashankar-mesos.pdf · Mesos Architecture . Resource Offer ... Master is designed to be soft state, new

ASAP: Fast, Approximate Graph Pattern Mining at Scalepages.cs.wisc.edu/~shivaram/cs744-slides/cs744-yunang-asap.pdfosdi18_slides_iyer.pdf Iyer, Anand Padmanabha, et al. "ASAP: fast,

CS 744: MAPREDUCEpages.cs.wisc.edu/~shivaram/cs744-fa19-slides/cs744-mapred-notes.pdfCS 744: MAPREDUCE Shivaram Venkataraman Fall 2019 . ANNOUNCEMENTS • Assignment 1 out • CloudLab

Continuous Authentication of Mobile User: Fusion …vipl.ict.ac.cn/uploadfile/upload/2017020711340782.pdfContinuous Authentication of Mobile User: Fusion of Face Image and Inertial

Continuous Random Variables › question_assets › idcollabstat1 › Chapter5.pdfContinuous Random Variables 5.1 Continuous Random Variables1 5.1.1 Student Learning Objectives By

Making Sense of Performance in Data Analytics Frameworkspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-making-zi.pdf · Making Sense of Performance in Data Analytics Frameworks Authors:

Zero-Interaction Authentication April 15, 2003 Mark D.Corner, Brian D. Noble Presented by Seong Oun Hwang CS744 Special Topics in System Architecture:

Managing Server Load in Global Memory Systemspages.cs.wisc.edu/~vernon/papers/gms.97sigm.pdfManaging Server Load in Global Memory Systems Geoffrey M. Voelker, Herve´ A. Jamrozik,

CONTINUOUS DEPLOYMENTlichota.pl/continuous_deployment_072017.pdfCONTINUOUS DEPLOYMENT na przykładzie aplikacji w Django Wojciech Lichota - STX Next Lipiec 2017

Modeling and Optimization within Interacting Systemspages.cs.wisc.edu/~ferris/talks/beijing-mar.pdf · 2015-05-11 · Modeling and Optimization within Interacting Systems Michael

GraphX : Graph Processing in a Distributed Dataflow Frameworkpages.cs.wisc.edu/~shivaram/cs744-slides/cs744-graphx-bidyut.pdf · Google Knowledge graph :570MVertices, 18B Edges (

CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-trill.pdf · - Real-time streaming, temporal queries on logs, progressive queries etc. Language Integration

Protocol- and Situation-Aware Distributed Storage Systemspages.cs.wisc.edu › ~ra › ram-thesis.pdf · Bharath Ramakrishnan reminded me to have fun. I am grateful to my brothers

IG DATA SYSTEMSpages.cs.wisc.edu/~paris/cs564-f16/lectures/lecture-18.pdf– OLAP (decision support queries) • MapReduce – first developed by Google, published in 2004 – only

COZ : Finding Code that Counts with Causal Profilingpages.cs.wisc.edu/~shivaram/cs744-slides/cs744-coz-anuja.pdf•Fluidanimate–replaced custom barrier by default (37 %) •Memcached–removed

Clipper: A Low-Latency Online Prediction Serving Systempages.cs.wisc.edu/~shivaram/cs744-readings/clipper.pdfClipper: A Low-Latency Online Prediction Serving System Daniel Crankshaw*,

Dremel: Interactive Analysis of Web-Scale Datasetspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-philip-dremel.pdf · Dremel is a system that supports interactive analysis of very

Integrated Modeling for Optimization of Energy Systemspages.cs.wisc.edu/~ferris/talks/orsnz-dec.pdfIntegrated Modeling for Optimization of Energy Systems Michael C. Ferris University

Introduction to Operating Systemspages.cs.wisc.edu/~remzi/Classes/537/Spring2012/Book/intro.pdf2.2 Virtualizing Memory Now let’s consider memory. The model of physical memory pre-sented

APACHE FLINK STREAM AND BATCH PROCESSING IN A SINGLE …pages.cs.wisc.edu/~shivaram/cs744-slides/cs744-akshaya-flink.pdf · • Apache Flink is designed to perform both stream and