27
Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu Yale University

Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Libra and the Art of Task Sizing in Big-Data Analytic Systems

Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Yale University

Page 2: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Background

Page 3: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Stage 0

Stage 1 Stage 2

Stage 3 Stage 4

Stage 5

Stage 6

Page 4: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Stage 0

Stage 1 Stage 2

Stage 3 Stage 4

Stage 5

Stage 6

Stage 4

Page 5: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Stage 0

Stage 1 Stage 2

Stage 3 Stage 4

Stage 5

Stage 6

Stage 4

stage input data

stage output data

Page 6: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Stage 0

Stage 1 Stage 2

Stage 3 Stage 4

Stage 5

Stage 6

Stage 4

stage input data

stage output data

Page 7: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Stage 0

Stage 1 Stage 2

Stage 3 Stage 4

Stage 5

Stage 6

Stage 4

stage input data

stage output data

How to set task size?

Page 8: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Stage 0

Stage 1 Stage 2

Stage 3 Stage 4

Stage 5

Stage 6

Stage 4

stage input data

stage output data

How to set task size?-- User experience-- System default value

Page 9: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

The importance of task sizing

Page 10: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Observation 1: diff jobs have diff optimal task sizes

Normalized stage completion time vs task size

Page 11: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

PageRank stage completion time vs task size

Observation 2: diff stages havediff optimal task sizes

Page 12: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

1. Proper task sizing is important

Page 13: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

1. Proper task sizing is important2. U-curve pattern

Page 14: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Analysis of U-curve pattern

Page 15: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Per-task overhead for PageRank stage 1

Observation 3: tasks have similar scheduling delay and system overhead regardless of task sizes

Page 16: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

# of IO ops for different stages of PageRank

Observation 4: small size => fail to do batch processinglarge size => memory swapping

Page 17: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Small task size => high aggregated overhead, no batch processingLarge task size => memory swapping

Page 18: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

System design

• Strawman solution

Page 19: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Refinement 1: ADAM optimization

Page 20: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Refinement 2: noise filtering

Task processing rate fluctuation for stage 1 of PageRank

Page 21: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Refinement 2: noise filtering

Task processing rate fluctuation for stage 1 of PageRank

Page 22: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Refinement 3: contention avoidance

PageRank over two machines

Page 23: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Refinement 3: contention avoidance

PageRank over two machines

Page 24: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Evaluation

• 8 m4.xlarge VMs from EC2

• Workloads generated from HiBench

Page 25: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Initial task size effect

PageRank completion time over diff. initial task size

Page 26: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Libra performance

PageRank completion time with diff. input data size

Page 27: Libra and the Art of Task Sizing in Big-Data Analytic Systems · 2020-03-06 · Libra and the Art of Task Sizing in Big-Data Analytic Systems Rui Li, Peizhen Guo, Bo Hu, Wenjun Hu

Q&A