28
Rethinking Storage

Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”

Rethinking Storage

Page 2: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”
Page 3: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”
Page 4: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”
Page 5: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”

Library model

Temporal Locality Search engine maintains per-user stats

over last week

Billions of users “alive” at given instant

But, only millions actively surfing

Everything stored in main memory

Storage is the main reason to scale out

Apps, Services, StreamingPipelines,Analytics, …

Devices, Clients, Dashboards, …

Page 6: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”

FASTER Log

FASTER KV

CPR

CRA / Ambrosia

FishStore

Mison

end-user apps cloud services

storage accelerator point of truth

Page 7: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”
Page 8: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”

SimpleStore for Compute Workloads FASTER Log

FASTER KV + cache

Concurrent Prefix Recovery

Library for Serverless

Page 9: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”

Apps, Services, StreamingPipelines,Analytics, …

Devices, Clients, Dashboards, …

Page 10: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”

integrated cache

Page 11: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”

head

tail …

…Hash Index Hybrid Record LogThreads

..., r2 ,r1

...

...

...

Disk

Memory (cache)

Page 12: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”

HeadOffset

In Memory

Read-Copy Update

Stable

LA = 0

Read Only

LA = ∞

Mutable

Disk

Increasing Logical

Address

In-Place Update

ReadOnlyOffset

Page 13: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”

-

50

100

150

0 10 20 30 40 50 60

Thro

ugh

pu

t (M

ops/

sec)

Number of Threads

FASTER (2cpu)

Intel TBB (2cpu)

Masstree (2cpu)

RocksDB (2cpu)

-

20

40

60

80

100

0 10 20 30 40 50 60

Thro

ugh

pu

t (M

ops/

sec)

Number of Threads

FASTER (2cpu)

Intel TBB (2cpu)

Masstree (2cpu)

RocksDB (2cpu)

Page 14: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”
Page 15: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”

group commit

Prefix Recovery (PR)

Page 16: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”

local operation timeline

System, not user, chooses exact CPR point per

thread → key to non-blocking

Page 17: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”
Page 18: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”
Page 19: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”
Page 20: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”

State store

Edge cache++

persistent queue

Page 21: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”

Wrapper Library

Local KVS(FASTER)

Cloud Storage

AppLogic

Compute (Container)

Remote KVS (FASTER)

Cloud MessagingService

Metadata Store

SSD

SSD…

Page 22: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”

Reusable storage artifacts help build such systems, make it

easier to manage app state

Page 23: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”

SimpleStore for Big Data Analytics FishStore for flexible schema data

Learned data layouts for storage & caching

Page 24: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”
Page 25: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”
Page 26: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”

reinforcement learning

Page 27: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”

https://github.com/badrishc

https://badrish.net/

Page 28: Rethinking Storage for the cloud, edge, serverless, and ... · Library model Temporal Locality Search engine maintains per-user stats over last week Billions of users “alive”