26
Dr Allon Cohen, VP Products and Business Development, Elastifile Cross-Cloud Distributed File Systems: Bursting File System Dependent Workflows to the Public Cloud

Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

Dr Allon Cohen, VP Products and Business Development, Elastifile

Cross-Cloud Distributed File Systems: Bursting File System DependentWorkflows to the Public Cloud

Page 2: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

• But, Why??

Everybody seems to be bursting their workloads to public clouds

Page 3: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

Confidential | 3

What the Cloud Offers

• Elasticity and Scalability> Resources scale and contract to match bursty workload requirements

• Ease of Use> Simple, intuitive management and universal application compatibility

• Freedom of Choice> Data and apps are mobilized to run where you want them

• Comparative Advantages for Analytics and AI Training> Cloud offers comparative advantage for certain workloads. Economies of scale, specialized HW, etc.

Page 4: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

Confidential | 4

But, Cloud is a (Wonderfully) Strange Environment

• Everything including Infrastructure is dynamic> Clusters can change sizes in minutes (not months)

• Scale seams infinite (from single enterprise perspective)> As many resources as you need available immediately

• Underlying hardware is a mystery> Assume nothing

• Different failure mechanisms> Zones, Live Migrations,

Unpredictable resets by unhuman (AI) admins

What did that AI just do to our consensus

leader?

Page 5: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

So How do You Adapt a Distributed File System to Public Cloud?

You Don’t!

You Design and Build a New One

Page 6: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

Confidential | 6

Optimization Point: Be the Best Data Platform forthe Cloud Era Enterprise

CloudConnect

Delivers Data Mobility Between FileSystems and Object Storage

Elastifile Cloud File System

Software-defined, Scalable File System

Page 7: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

Confidential | 7

Architectural Base• Scale Out– Support real-time & dynamic reconfiguration of the system– Consistent predictable performance (even under failures and noisy

environments)

• Software only– Avoid unnecessary/uncommon hardware requirements like NVRAM, RDMA

networks, etc

• File system core– Data level services & the best performance– Superset of block/object interfaces– Enables data sharing

Page 8: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

Confidential | 8

Examples of Unique Cloud Native Optimizations

Scalable Metadata• Scaling metadata is hard, due to the consistency requirements of

file-systems• Usually constancy comes at cost of performance• We’re using “consistency layering” to get a good combination of

consistency and performance at scale

Noisy Environment Adaptations• Write anywhere• Hiccup support

Page 9: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

Confidential | 9

Unique Consistency Layering Model

In-memory tracking of who’s owner of what. Scale-out by

sharding.

Actual MD

handling

Keeps (persistent) state of the configuration of the cluster.

ECDB:Configuration

DB

ORC: Ownership Repository Coordinator

Ownership

Page 10: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

Confidential | 10

Unique Consistency Layering Model

Open/close file

IO

Configuration change (add/remove node, etc.)

Higher rateORC: Ownership

Repository Coordinator

Ownership

ECDB:Configuration

DB

Page 11: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

Confidential | 11

Unique Consistency Layering Model

In-memory distributed consensus

Local function

call

Persistent distributed consensus

Lower costORC: Ownership

Repository Coordinator

Ownership

ECDB:Configuration

DB

Page 12: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

Confidential | 12

Consistency Strictly Maintained at Scale

Resilient to f node failures(2f+1 consensus)

Failure requires recovery

Resilient to f node failures & total power failure(2f+1 consensus)

Consistency is derived from depending on lower layerORC: Ownership

Repository Coordinator

Ownership

ECDB:Configuration

DB

Page 13: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

Confidential | 13

Bizur: A Key-value Consensus AlgorithmSpecialized consensus algorithm (instead of Paxos / Raft) Optimized for high concurrency and low latency, especially during failures

Page 14: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

Confidential | 14

Delivering Scalable, High-Performance In-Cloud File ServicesEnvironment Setup Results

Page 15: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

Burst Use Case Examples

Page 16: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

Confidential | 16

Page 17: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

Confidential | 17

Elastically ScalableCompute Resources

Page 18: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

Confidential | 18

Elastically ScalableCompute Resources

Object

ElastifileCloudConnect

Page 19: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

Confidential | 19

Elastically ScalableCompute Resources

Object

ElastifileCloudConnect

Page 20: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

Confidential | 20

Elastically ScalableCompute Resources

Object

ElastifileCloudConnect

Page 21: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

21

A Real-World, Cloud-Integrated Drug Discovery SolutionMolecular Analysis Jobs

Object Storage

Distributed File Storage

GPU and CPU Compute

SLU

RM S

ched

uler Cloud

Instance

ELASTIFILE

CloudInstance

ELASTIFILE

CloudInstance

ELASTIFILE

CloudInstance

CloudInstance

CloudInstance

CloudInstance

CloudInstance

CloudInstance

CloudInstance

CloudInstance

CloudInstance

CloudInstance

CloudConnect

Page 22: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

Confidential | 22https://cloudplatform.googleblog.com/2018/02/why-we-used-Elastifile-Cloud-File-System-on-GCP-to-power-drug-discovery.html

Page 23: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

23

Genomics-as-a-ServiceA Non-Profit Research Institute Needed to leverage hybrid IT infrastructure

ChallengeData needs to be durable and accessible across environments

Applications should be capable of running the location that makes sense

Need to align costs with workload requirements

Merging the Best Aspects of On-Premises and Public Cloud Infrastructure

SolutionElastifile’s POSIX-compliant file system for a consistent interface on-premises and in-cloud

Elastifile CloudConnect for data transport between on-premises and cloud

Elastifile CloudConnect for data tiering between file storage and low-cost object

Hybrid Architecture to Deliver Data Mobility Across Environments

Page 24: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

GxaaS Architecture StrategyMitigate the bad in each cloud … maximize the good … make it look a really big cloud … (seems easy right?)

Ploid™ Object Storage

CPU, GPU,

& FPGA

CPU, GPU, &

TPU

CPU, GPU, &

TPU

FILE SYSTEMFILE SYSTEM

CPU, GPU, & FPGA

CPU, GPU, & FPGA

FILE SYSTEM

10101010 10101010

Scheduler Scheduler Scheduler

Page 25: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

Confidential | 25

Elastifile: Enabling Cloud-Integrated File WorkloadsBurst to cloud to expand resources or offload on-premises infrastructure

Delivering cloudapplication compatibility

Delivering in-cloud data management

Delivering hybrid cloud data mobility

Page 26: Cross-Cloud Distributed File Systems: Bursting File System …storageconference.us/2018/Presentations/Cohen.pdf · 2018-05-15 · Confidential | 4 But, Cloud is a (Wonderfully) Strange

THANKYOU!

Free Your Data,Free Your Business