Towards Elastic Operating Systems

Towards Elastic Operating SystemsAmit GuptaEhab AbabnehRichard HanEric Keller

University of Colorado,Boulder

OS/Process

Resources Limited• Thrashing• CPUs limited• I/O bottlenecks

• Network• Storage

Present Workarounds• Additional Scripting/Code changes• Extra Modules/Frameworks

• Coordination• Synch/Aggregating State

OS + Cloud Today

ELB/CloudMgr

Advantages• Expands available Memory • Extends the scope of Multithreaded

Parallelism (More CPUs available)• Mitigates I/O bottlenecks• Network• Storage

Stretch ProcessOS/Process

ElasticOS : Our Vision

ElasticOS: Our Goals “Elasticity” as an OS Service

Elasticize all resources – Memory,CPU, Network, …

Single machine abstraction Apps unaware whether they’re running on

1 machine or 1000 machines Simpler Parallelism

Compatible with an existing OS (e.g Linux, …)

6“Stretched” Process Unified Address Space

OS/Process

Elastic Page TableLocation

Movable Execution ContextOS/Process

• OS handles elasticity – Apps don’t change• Partition locality across multiple nodes• Useful for single (and multiple) threads

• For multiple threads, seamlessly exploit network I/O and CPU parallelism

Replicate Code, Partition Data

Data 1

Data 2

CODE CODE

• Unique copy of data (unlike DSM)• Execution context follows data

(unlike Process Migration, SSI )

Exploiting Elastic Locality• We need an adaptive page clustering

algorithm• LRU, NSWAP i.e “always pull”• Execution follows data i.e “always

jump”• Hybrid (Initial): Pull pages, then Jump

Status and Future Work Complete our initial prototype Improve our page placement

algorithm Improve context jump efficiency Investigate Fault Tolerance issues

Thank YouQuestions

Contact:amit.gupta@colorado.edu

Algorithm Performance(1)

Algorithm Performance(2)

Page PlacementMultinode Adaptive LRU

Swap CPUs Swap

Pulls Threshold Reached !Pull First

JumpExecution

Context

Locality in a Single Thread

Swap CPUs Swap

Temporal Locality

Locality across Multiple Threads

Swap CPUs Swap

CPUs Swap

Unlike DSM…

Exploiting Elastic Locality• Assumptions • Replicate Code Pages, Place Data Pages

(vs DSM)• We need an adaptive page clustering

algorithm• LRU, NSWAP• Us (Initial): Pull pages, then Jump

Replicate Code, Distribute Data

Data 1

Data 2

CODE CODE

• Unique copy of data (vs DSM)• Execution context follows data

(vs Process Migration)

AccessingData 1 Accessing

Data 2Accessing

Data 1

Benefits OS handles elasticity – Apps don’t

change Partition locality across multiple nodes

Useful for single (and multiple) threads For multiple threads, seamlessly

exploit network I/O and CPU parallelism

Benefits (delete) OS handles elasticity

Application ideally runs unmodified Application is naturally partitioned …

By Page Access locality By seamlessly exploiting multithreaded

parallelism By intelligent page placement

How should we place pages ?

Execution Context JumpingA single thread example

Address Space

Node 1

Address Space

Node 2

Process

Address Space

Node 1

Address Space

Node 2

Process

V RPage Table

IP Addr

“Stretch” a Process Unified Address Space

Operating Systems Today Resource Limit = 1 Node

Disks Process

Cloud Applications at Scale

Cloud Manager

LoadBalancer

Process

More Resources ?

ProcessProcess

Framework (eg. Map Reduce)

Partitioned Data

More Queries ?

Our findings Important Tradeoff

Data Page Pulls Vs Execution Context Jumps

Latency cost is realistic Our Algorithm: Worst case scenario

“always pull” == NSWAP marginal improvements

Advantages Natural Groupings: Threads &

Pages Align resources with inherent

parallelism Leverage existing mechanisms

for synchronization

“Stretch” a Process : Unified Address Space

Page Table

A “Stretched” Process =

Collection of Pages + Other Resources { Across Several Machines }

IP Addr

delete Exec. context follows Data Replicate Code Pages

Read-Only => No Consistency burden Smartly distribute Data Pages Execution context can jump

Moves towards data *Converse also allowed*

Elasticity in Cloud Apps Today

Input Data

….~~~

Output Data

Load Balancer

….~~~

Output Data

Input Queries

(delete)Goals : Elasticity dimensions Extend Elasticity to

Memory CPU I/O

Network Storage

Thank You

Bang Head Here !

Stretching a Thread

Overlapping Elastic Processes

*Code Follows Data*

Application Locality

Possible Animation?

Multinode Adaptive LRU

Possible Animation?

Open Topics Fault tolerance

Stack handling

Dynamic Linked Libraries Locking

Elastic Page TableVirtual Addr

Phy. Addr Valid Node (IP addr)

A B 1 LocalhostC D 0 LocalhostE F 1 128.138.60.

1G H 0 128.138.60.

Local MemSwap spaceRemote Mem

RemoteSwap

“Stretch” a Process Move beyond resource boundaries of

ONE machine CPU Memory Network, I/O

Input Data

….~~~

Output Data

Reinventing Elasticity Wheel

Towards Elastic Operating Systems

Documents

Elastic Events - Virtual Conferences - Elastic Meetings

Comparison of elastic and elastic-plastic structural ... · PDF fileComparison of Elastic and Elastic- J Plastic Structural Analyses for Cooled Turbine ... a three-dimensional elastic-plastic

Towards an operating system: Processes and their …montefiore.ulg.ac.be/~pw/cours/psfiles/struct-cours3-e.pdf · An operating system’s basic organization An operating system is

German Aerospace Center - Elastic Actuators: From ......Elastic Actuators: From mastering vibrations towards utilization of intrinsic dynamics Christian Ott German Aerospace Center(DLR)

NOX: Towards an Operating System for Networks

Towards a Network Operating System

Usability issues in the operating room – Towards

New Towards a safe operating space for the Netherlands · 2019. 9. 16. · Towards a safe operating space for the Netherlands The 2030 Agenda for Sustainable Development and its 17

Mounting and Operating Instructions Sliding Hub with ...smarthost.maedler.de/datenblaetter/RNR_english.pdf · Mounting and Operating Instructions Sliding Hub with Elastic Coupling

Руководство пользователя Elastic Cloud Elastic Cloud... · Руководство пользователя Elastic Cloud Виртуальные дата-центры

Mounting and Operating Instructions Elastic Couplings RN

Hydrogen - Towards Elastic Management of Reconfigurable ...pg1709/docs/pg14ispa.pdf · Hydrogen - Towards Elastic Management of Recon gurable Accelerators Paul Grigoras, Max Tottenham,

Towards an Open, Disaggregated Network Operating Systemabout.att.com/content/dam/innovationblogdocs/att-routing-nos-open... · Towards an Open, Disaggregated Network Operating System

Towards Cost-Effective and Elastic Cloud Database

Towards Li-Ion Batteries Operating at 80 C: Ionic …uu.diva-portal.org/smash/get/diva2:1229575/FULLTEXT01.pdfbatteries Communication Towards Li-Ion Batteries Operating at 80 C: Ionic

Operating Instructions - Siemens · Operating Instructions BA 3602 EN 01.02 Elastic RUPEX Couplings Types RWB, RBS with brake disk A. Friedr. Flender AG ⋅ 46393 Bocholt ⋅ Tel

Towards more realistic values of elastic moduli for

Developing, Deploying, and Operating Twelve … - Developing...Developing, Deploying, and Operating Twelve-Factor Applications with TOSCA. ... Engine2 and AWS Elastic Beanstalk by

Towards modelling elastic-plastic deformation of a tube ... · Towards modelling elastic-plastic deformation of a tube-shaped work-piece under axisymmetric load ... necessary changes

Assembly and operating instructions TSCHAN Elastic ... · PDF file- Ensure the operating safety of the coupling. ... procedure: • Clean the coupling hub of ... so that during the