Self-awareness and Adaptive Technologies: the Future of Operating Systems?

Self-‐awareness and Adap/ve Technologies: the future of opera/ng systems?

Lamia Youseff*

fos team: Nathan Beckmann, Harshad Kasture, Charles Gruenwald III, Adam Belay, David Wentzlaff**, Lamia Youseff,

Jason Miller, Anant Agarwal And e-‐fos team in collabora/on with the Angstrom team.

Live fos web server - http://fos.csail.mit.edu * Lamia worked on fos while she was a postdoctoral associate with the Carbon group at CSAIL, MIT. ** David worked on fos while he was a Ph.D. candidate with the Carbon group at CSAIL, MIT.

Emerging Computer Architectures Provide Opportuni/es for OS n  Clouds

n  Huge compu/ng power n  Scalability with a “click”

n Mul/core

n  Large number of cores n  Very fast on-‐chip core-‐to-‐core communica/on

2 All logos and trademarks appearing in this presentation are the property of their respective owners.

What is Wrong with Current OSs?

n Unclear how to scale contemporary OSs on Mul/core and Manycore

n  Programming on an Infrastructure as a Service (IAAS), Amazon EC2 style, Cloud is a challenge

Linux Does not Scale – Page Alloca/on Example n  Physical Page alloca/on (toy example)

n  Allocate physical pages as quickly as possible n  Examine the scalability of where the /me goes

n  Precision /mers & Oprofile n  16 core: quad – quad Intel server

ON EACH CORE: void main(void) {

char * big_array; long long count; big_array = malloc(MEMORY_SIZE_IN_MB * MB_IN_BYTES); for (count = 0; count < ((MEMORY_SIZE_IN_MB * MB_IN_BYTES)/PAGE_SIZE_IN_BYTES); count++) { big_array[count * PAGE_SIZE_IN_BYTES] = count % 256; }

Lock Conten/on Dominates Run/me

Number of Cores

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16

get_page_from_freelist

__pagevec_lru_add_active

__rmqueue_smallest

free_pages_bulk

rmqueue_bulk

page_fault

handle_mm_fault

clear_page_c

Lock contention

Architectural overhead

Useful work

… …

Problem with Current Day Clouds

Independent Linux VM instances cobbled together in an ad-‐hoc manner at applica/on layer

Network Switch

App App App

Linux VM

SMP Linux VM

Linux VM

App App App

Core 0 Core 2 Mul/core Blade

Hypervisor Hypervisor

Core 0 Core 1 Core 2 Mul/core Blade

Core 1 Core 0 Core 2 Mul/core Blade

Hypervisor

Core 1

Linux VM

Core 0 Core 2 Mul/core Blade

Hypervisor

Core 1

Linux VM

OSes Need to be Rethought

n  Shared memory and locks lead to poor scalability

n  Non-‐composable due to locks – hard to modify or add services

n  OS and applica/on fight for in-‐core cache

n  OS relies on shared memory, which does not work across a cloud

n  No resilience to faults; single failure

causes OS to crash

Manycore moving from a few cores to 100’s of cores

Outline n  fos Introduc/on

n  Vision, Structure and Architecture n  Anatomy of a malloc()

n  Scalability in fos n  Fleets of servers n  Example: File System Stack

n  Why adaptable and self-‐aware OS services? n  Adaptability in fos

n  Hybrid Messaging System n  Elas/c Fleets

n  Enabling Automa/c Self Awareness in fos 8

What is fos?

n  Scalability and reliability for future mul/cores with 100’s to 1000’s of cores

n  Provides single-‐system image across cloud machines

n  Dynamically grow and shrink services to meet applica/on demands

Core 0 Core 0 Core 0 Core 0 Core 2 Core 0 Core 0 Core 0 Core 0 Core 2

Mul/core Blade Core 1

Hypervisor Hypervisor

Core 1 Mul/core Blade

A novel elas/c opera/ng system for mul/cores and clouds

Provides typical OS services: n  Process management, scheduling of mul/ple applica/ons, virtual memory

management, protec/on, file systems, networking Also provides special mul/core services

n  Process migra/on, fault resilience, dynamic goal op/miza/on, elas/c resource alloca/on/dealloca/on

An Opera)ng System as a Service (OSaaS)

fos Structure

PS PS PS

User App

FS Need new page

File read

n  Internet-‐inspired structure, based on messaging. n  OS is collec/on of services (e.g. name service, page service PS, file service FS) n  Each service is implemented by a fleet of distributed servers n  Each server is bound to a core n  Applica/on cores message a par/cular server core to u/lize service n  Server cores collaborate/communicate to implement needed OS service

fos Architecture

n  Microkernel n  Executes on all cores n  Provides protec/on mechanism

but not policy n  Provides memory management

mechanism, not policy n  Naming

n  Translates symbolic names to physical des/na/on

n  Standard high-‐level API hides loca/on

n  Provides one-‐to-‐many maps n  Provides redirec/on n  Provides resilience when a server

crashes

PS PS PS

File System Server, fleet member Block Device Driver Process Management server, fleet member Name server, fleet member Network Interface

Applications Page allocator, fleet member

app3 app1

app4 NS

Name Server microkernel

File system server

microkernel

Applica/on 5 microkernel

libfos

fos Architecture

n  System Servers n  Internet Inspired Servers n  Fleets: Co-‐opera/ng processes

that provide a service n  Communicate only via

messaging n  Facilitates transparent

communica/on inter-‐machine or intra-‐machine

n  Easily move server processes anywhere

n  Messaging n  uk level messaging for core-‐to-‐

core communica/on. n  Light-‐weight user space

messaging. n  Inter-‐machine vs intra-‐machine

communica/on.

PS PS PS

File System Server, fleet member Block Device Driver Process Management server, fleet member Name server, fleet member Network Interface

Applications Page allocator, fleet member

app3 app1

app4 NS

Name Server microkernel

File system server

microkernel

Applica/on 5 microkernel

libfos

Anatomy of a malloc Call in fos

Core 1 Core 2

Microkernel

Applica/on

Mul/core Blade

Hypervisor Core 1 Core 2

Proxy-‐Network Server

Mul/core Blade

Namecache

Microkernel

Namecache

Microkernel

Namecache

libfos

Core 3

Microkernel Namecache

Hypervisor

fos Server (b) Paging System Server

Microkernel

Namecache

msg msg

Anatomy of a malloc Call in fos when Server is on Remote Machine in the Cloud

Core 1 Core 2

Microkernel

Applica/on

Mul/core Blade

Namecache

Microkernel

Namecache

Microkernel

Namecache

libfos

Core 3

Hypervisor

fos Server (a) Paging System Server

Microkernel

Namecache

msg msg msg msg

n  The key three design guidelines n  Inspired by the Internet, OS is built as collec/on of services (e.g., naming and resource discovery, scheduler, file system)

n  Each service is fully distributed with no global shared state or locks – implemented as a collec/on of coopera/ng servers

n  OS servers are bound to cores n  Eliminates interference between OS and applica/on

Scalability in fos

fos Scalable Fleet example: File System Fleet

PS PS PS

app4 app6

File System Server microkernel

microkernel

Applica/on 1 libfos

File System coordinator

microkernel

Applica/on 2 libfos

Outline n  fos Introduc/on

n  Vision, Structure and Architecture n  Anatomy of a malloc()

n  Scalability in fos n  Fleets of servers n  Example: File System Stack

n  New Challenges for today’s OS n  Adaptability in fos

n  Hybrid Messaging System n  Elas/c Fleets

n  Enabling Automa/c Self Awareness in fos 17

New Challenges for today’s OS n  Unprecedented amounts of

resources and variables n  Increased number of resources

=> increased number of failures

n  Unprecedented variability in demand n  For the same applica/on, its

performance characteris/cs and requirements usually change during it run/me (e.g. allocated number of core)

n  Different simultaneously-‐execu/ng applica/ons have different performance requirements (e.g. a memory-‐intensive vs a computa/onal intensive app).

Disk I/O

Devices

miss rate

cache size, associativity

Memory Manager

File System

Device Drivers Scheduler

App 1 App 2 App 3

System call

algorithm heartbeat,

Building Adaptability and Self-awareness into fos

Disk I/O

Devices DRAM

miss rate

voltage, freq, precision

cache size, associativity

Memory Manager

File System

Device Drivers Scheduler

activity, power, temp

Analysis & Optimization Engine

Observe

Decide Act

App 2 App 3

Learner

System call

algorithm heartbeat,

Heartbeat

Perf. Models

n  fos Analysis and Optimization Engine. n  A closed feedback loop of

ODA. n  Each component provides a

new OS service: n  The Observer provides vital

signs services. n  The Decision engine provides

performance models and AI learning engine.

n  The actuators changes system status.

n  Each component is either implemented as a fos fleet or integrated into another fleet.

Vital Signs (e.g. heartbeats)

Decision Engine (e.g. performance models, control system, AI learner)

Actuators

Building Automa/c self-‐awareness into fos

•  Applications (e.g. algorithms) •  OS sub-systems (e.g. growing fleets) •  Hardware components (e.g. frequency scaling)

Vital Signs Fleet n  Observes the status of software

and hardware components. n  Provides a global knowledge-

base of the system n  Implemented as a fleet, storing

the status information in a distributed data object n  key-value store

Vital Signs (e.g. heartbeats)

Decision Engine Actuators

Sefos: Building self-‐awareness into fos

n  Collects measurements from: 1) Applications e.g. Apps heartbeats, App. specific measurements (fps in a video encoder or flops in a scientific app). 2) OS subsystems e.g. Utilization ratio of the file system fleet. 3) Hardware components e.g. temperature, core frequency, power, cache miss rate.

Decision Engine n  A new OS service in fos. n  Implemented through a fleet of

servers for scalability. n  Process input from the vital

signs service. n  Provides three approaches to

runtime decision making:

Vital Signs

Decision Engine (e.g. performance models, control system AI learner)

Actuators

•  Machine learning •  Classical control theory •  Performance Models

Uniform API across the mechanisms allowing them to be switched at runtime & composable.

Actuators n  Action that allows the system to

adapt at runtime. n  Three sets of actions, based on

their impact: •  Application actions

•  Allocating or de-allocating cores to an application

•  Switching between algorithms •  Migrating processes for better data

locality and cache usage •  OS services actions

•  Growing and shrinking fleets •  Migrating servers

•  Hardware actions •  Frequency scaling to save power

Vital Signs

Decision Engine Actuators

•  Applications •  OS sub-systems •  Hardware components (e.g. frequency scaling)

•  Implemented as extension to fos fleets, ac/ons for hardware through OS tools or applica/on-‐specific ac/ons.

Adaptability in fos

n  Fos adapts to the varying demand in mul/core and clouds: n  Hybrid Messaging n  Elas/c Fleets

I. Messaging in fos

n Messaging in fos provides IPC through message-‐passing abstrac/on n Mailbox-‐based, each associated with name and capability

n  Can be implemented via a variety of mechanisms, mul/plexed via libfos library layer

n  Transparent to the applica/on n  fos currently supports three mechanisms

Messaging Mechanisms in fos

n  Kernel messaging n  Implemented in microkernel via shared memory n  Messages are sent by trapping into the microkernel

n User-‐level messaging n  A channel-‐base mechanism via URPC n  Runs en/rely in user-‐space n  Lower messaging latency at the cost of ini/al overhead for

channel crea/on

n  Inter-‐machine messaging n  Message goes through the proxy server n  Proxy server routes the message to the correct machine

Hybrid Messaging n  fos adapts automa/cally using various messaging mechanisms

Number of messages

II. Elas/c Fleets

n  fos fleets can grow and shrink to meet varying demand n  Fleet can grow

n  New servers are spawned on new cores

n  The naming service provides load-‐balancing n  Requests directed to the new fleet member

n  Fleet can shrink n  Load is distributed to other fleet members

fos Name Service Enables Elas/city

PS PS PS

User App

NS File System Service Elas=city Example 1.  Name server points to File Server 1 2.  Applica/on request rate rises; File

Service spawns a new File Server 2 3.  Name Server directs file system

accesses to the new server transparent to applica/on

Name Service enables service fleets to grow elas/cally with demand

fos File System Service Elas/city

n  fos services can grow and shrink to match demand n  A synthe=c benchmark app makes requests at a varying rate;

n  2 Clients, each repeatedly opens a file, reads 1 KB and closes the file n  Clients increase request rate un/l midpoint then decrease

n  fos filesystem grows from 1 to 2 servers to match demand, then shrinks as demand lessens

n  fos services can grow and shrink to match demand n  A synthe=c applica=on makes requests at a varying rate n  fos filesystem grows from 1 to 2 servers to match demand, then shrinks as demand lessens

1 server 1 server

1 server 2 servers

1 server 2 servers Elastic fleet

Grow fleet to 2 servers Shrink fleet to 1 server

An Early Prototype of a self-aware fs service n  Observer: Leverage fs fleet utilization to indicate the load of the OS subsystems. n  Decision Engine: Use simplified high- and low-watermarks. n  Actions: grow and shrink the fleet

34 0 2000 4000 6000 8000 10000

Number of transactions

System

throughput�trans

�millioncycles�

Fleetsize

Conclusions n  New many-cores and clouds architectures present new OS challenges for the

OS community: n  Scalability n  Varying demand

n  fos is a new factored operating system that is: n  Addressing scalability by the fleets design n  Addressing varying demands through the optimization and analysis engine

n  We designed the analysis and optimization engine as a new OS service in the form of a close ODA loop

n  We demonstrated how adaptability and self-awareness can be achieved through the analysis and optimization engine in two toy prototypes n  Hybrid messaging n  Elastic fs fleet

Ques/ons

live fos web server at h0p://fos.csail.mit.edu

Hidden Slides

Elas/c fleets example: fos file system

n  2 Clients, each repeatedly opens a file, reads 1 KB and closes the file n  Clients increase request rate un/l midpoint then decrease 38

1 Server 2 Servers Elastic fleet

GrowShrink

0 1� 109 2� 109 3� 109 4� 109 5� 109 6� 1090

Cycles elapsed

Throughput�trans

actions�106

cycle�

Anatomy of a malloc Call in fos

Core 1 Core 2

Microkernel

Applica/on

Mul/core Blade

Namecache

Microkernel

Namecache

Microkernel

Namecache

libfos

Core 3

Hypervisor

fos Server (b) Paging System Server

Microkernel

Namecache

msg msg

Anatomy of a malloc Call in fos when Server is on Remote Machine in the Cloud

Core 1 Core 2

Microkernel

Applica/on

Mul/core Blade

Namecache

Microkernel

Namecache

Microkernel

Namecache

libfos

Core 3

Hypervisor

fos Server (a) Paging System Server

Microkernel

Namecache

msg msg msg msg

Overview of Sefos ODA Heartbeats services (Sensors or Observers): i.e.

knowledge-base of the system. n  Two sets of APIs:

n  Collecting status updates from apps and system services n  Responding to queries about the apps and sys services’

status Decision engine service (decision or controllers):

n  Generalized form of the scheduling services n  Deploy Control theoric and AI models to make decisions.

Variables (Actions or Actuators); e.g. leveraging : n  elasticity: growing and shrinking fleets. n  Spatial locality: by migrating process. n  Improving performance: by controlling the number of

cores allocated to a service/ app. n  Changing cores frequency: to save overall power

O: heartbeats Service n  A new OS service that keeps the current status of the

system n  App processes statistics n  Other system fleets’ statistics (utilization level,

incoming queue length, etc ) n  Hardware components status (temp, cores freq, etc)

n  Implemented as a fleet of servers. n  Functionality:

1. Collects the heartbeats from different applications and other system service, Via heartbeats API, e.g.

2. Responds to inquiries about overall system status or specific application heartbeats

Via heartbeats_monitor API, e.g.

D: Decision Engine n  Another OS service that is a generalized form of the

scheduling service. n  Can maximize overall performance, minimize power or

other goals. n  Query the heartbeats service for system status. n  Make decision using:

n  A closed feedback loop to decide on best course of action to achieve a certain goal

n  Works best for changing one variable at a time n  Model can become very difficult to understand as

number of goals and number of variables increase. n  Alternatively, an AI model

n  Works best for changing several variables at a time

A: Actions n  Each sub-system provide a set of actions (variables)

and n  Three types of actions, according to their direct

impact… n  App Actions; e.g.

n  Allocating or de-allocating cores to an application n  Migrating App processes for better spatial allocation

§  To move process to data: improve cache efficiency. §  To decrease communication overhead (in asymmetric

machines) n  System Services Actions; e.g.

n  Leverage fleet elasticity: grow and shrink the fleet n  Migrate servers to improve spatial locality

n  Hardware Actions; e.g. n  Frequency scaling to save power

Self-awareness and Adaptive Technologies: the Future of Operating Systems?

Education

Cooperative Adaptive Cruise Control (CACC) For Partially ... · Cooperative Adaptive Cruise Control (CACC) provides an intermediate step toward a longer-term vision of trucks operating

Adaptive Fuzzy Hardware Scheduler for Real Time Operating

Adaptive Supply Voltage Management for Low Power Logic Circuitry Operating at Subthreshold

Attention and Engagement-Awareness in the Wild: …slash/papers/Okoshi2017a...Attention and Engagement-Awareness in the Wild: A Large-Scale Study with Adaptive Notiﬁcations Tadashi

Wideband Self-Adaptive RF Cancellation Circuit for Full ... · Wideband Self-Adaptive RF Cancellation Circuit for Full-Duplex Radio: Operating Principle and Measurements Timo Huusari

Wideband Self-Adaptive RF Cancellation Circuit for … Self-Adaptive RF Cancellation Circuit for Full-Duplex Radio: Operating Principle and Measurements Timo Huusari*, Yang …

Ontological Context-awareness for Adaptive Augmented Realityjesusfontecha.name/wp-content/uploads/2013/02/ucami2011_submission... · Ontological Context-awareness for Adaptive Augmented

CATEGORY ONE AIRSIDE DRIVING · driving, airfield knowledge, route planning, adaptive manoeuvres and situational awareness. Situational awareness includes visually looking out for

BDI Model with Adaptive Alertness through Situational Awareness

The Internet Operating System: Middleware for Adaptive Distributed

GreenStar App Quick eference uide - John Deere...Operating with Adaptive Curves The field below is an example of when to use Adaptive Curves. You can use Adaptive Curves in all fields,

Questions to Ask When Developing an Adaptive Security ...pages.mediapro.com/rs/889-LYM-560/images/MediaPro...3 Questions to Ask When Developing an Adaptive Security Awareness Program

Adaptive Restore 2010 - Paragon Software | Main page · About Adaptive Restore Technology Technology Background Windows family operating systems are notorious for their excessive

Adaptive Business Engagement...IT’S ADAPTIVE BUSINESS ENGAGEMENT MODEL Operating Principles for Effective Digital Relationship Management Source: CEB analysis. Flex IT’s Engagement

Intellectual Property Awareness Survey 2015 - gov.uk · Intellectual Property Office is an operating name of the Patent Office Intellectual Property Awareness Survey 2015

Jumping & Balancing - SpecialOlympics.orgmedia.specialolympics.org/resources/community... · eral fitness, motor skills (jumping, balance, grasp), body awareness, and adaptive skills

Adaptive Campaigning-future Land Operating Concept

Adaptive Filters Processing Structures - Uni Kiel · Introduction and Motivation Adaptive Filters Operating in Subbands Filter Design for Prototype Lowpass Filters Examples. Digital

OS Awareness Manual QNX · - Choose Help menu > Processor Architecture Manual. † “OS Awareness Manuals” (rtos_.pdf): TRACE32 PowerView can be extended for operating

Building an Adaptive Operating System for Predictability ...€¦ · Building an Adaptive Operating System for Predictability and ... Resource Centric Computing (ARCC), the OS distributes