19

Hardware Management: Hyperscale Datacenter Systems

  • Upload
    others

  • View
    11

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Hardware Management: Hyperscale Datacenter Systems
Page 2: Hardware Management: Hyperscale Datacenter Systems

H a rd wa r e M a n a g e m e n t : S o f t wa r e A g e n t s i n H y p e r s c a l e D a t a c e n te r S y s te m s

James Malachowski, Datacenter Product & Strategy, Ericsson

Page 3: Hardware Management: Hyperscale Datacenter Systems

• Path to Hyperscale

• Use Cases

• Challenges

• Current Approaches

• Requirements

• Planned OCP Contributions

• Solution Architecture

• Future

Content

Page 4: Hardware Management: Hyperscale Datacenter Systems

Hyperscale

Datacenter

Op

era

tio

na

l E

ffic

iency

Asset Efficiency

Operator

Enterprise

Google

AmazonFacebook

Operators (and Ericsson today)

Cloud giants are playing a different game

Page 5: Hardware Management: Hyperscale Datacenter Systems

Everything is a Datacenter

5G / Edge

ComputingCloud Internet of Things

Page 6: Hardware Management: Hyperscale Datacenter Systems

• Data is high velocity, high volume, low variety

• Things are distributed, time is relative, consensus improbable

• Continuously accelerating technology refresh cycles

• Diverse protocols, complex technologies and multiple vendors

• Massive surface area vulnerable to attack

Unique Challenges

Page 7: Hardware Management: Hyperscale Datacenter Systems

What should I buy? How much capacity?

Improve Performance? Increase

Efficiency?

What do I have? What is it doing?

What did I order? Does it work?

What settings for my

workload?

What OS, packages, drivers and apps?

Informed decision making is impossible Fragmented, inefficiently managed datacenters

How do I run this across my datacenters?

Data center

Life cycle

Page 8: Hardware Management: Hyperscale Datacenter Systems

Current ApproachesCapability Components

Search Solr, Elastic Search

Analysis MapReduce (batch), Spark (real time)

Time Series Data OpenTSDB + HBase

Unstructured Data HBase + HDFS

Structured Data Maria-DB, MySQL

Logs Flume

Service

ManagementZooKeeper/etcd

Clients/Collectors Nagios, Statsd/Collectd, Syslog, Sysdig

Page 9: Hardware Management: Hyperscale Datacenter Systems

Current Approaches

Data Collection

Complete Visibility

Capability Components

Search Solr, Elastic Search

Analysis MapReduce (batch), Spark (real time)

Time Series Data OpenTSDB + HBase

Unstructured Data HBase + HDFS

Structured Data Maria-DB, MySQL

Logs Flume

Service

ManagementZooKeeper/etcd

Clients/Collectors Nagios, Statsd/Collectd, Syslog, Sysdig

Small footprint

Highly scalable

Installs in minutes

Generic data model

Guaranteed performance

Purpose built for machines

Page 10: Hardware Management: Hyperscale Datacenter Systems

• Opensource – owned by the community

• Modern – lightweight, extensible, agnostic

• Standardized – works with existing common tools and interfaces

• Secure – industrialized for the world’s mobile communication networks

• Supported – developed in commercial products, used in production environments

In Need of a Solution…

Page 11: Hardware Management: Hyperscale Datacenter Systems

Ericsson Proposed ContributionClient software for data collection and configuration of x86

machines

Component Description

AgentStandard Linux package for data collection and configuration

of x86 machines

Mini AgentCollector, Filter, Forwarder components for composition of

unique and lightweight data collection use cases

Micro KernelMinimalist bootable Linux kernel + agents and process

automation tool chain for building stateless deployment

CollectorsStandard collectors for x86 Hardware and Linux Operating

Systems

Page 12: Hardware Management: Hyperscale Datacenter Systems

Anatomy of an Agent

COLLECTORS

• /proc

• ipmi

• lsusb

• …

FORMATTERS

• JSON

• XML

• TSV

• …

FORWARDERS

• TCP/UDP

• Graphite, Spark

• Socket

• …

Ericsson Security, Audit & Performance Enhancements

Any Standard Linux Environment (bin, container, microkernel)

Page 13: Hardware Management: Hyperscale Datacenter Systems

• Build once, run anywhere (lowest common denominator)

• No external dependencies

• Use standard tools when available

• Fail hard, restart fast

• Self-configuring, self-updating

• Completely tunable and extensible

Agent Requirements

Page 14: Hardware Management: Hyperscale Datacenter Systems

Architecture

HOST NETWORK

DATA

CO

NTR

OL

OS & Hardware Inventory,

Metrics & LogsSNMP, IPMI, LLDP, DCMI, Redfish, TCP/IP

DNS, DHCP, TFTP, PXE, IPMI, RESTConfiguration of Firmware,

BIOS, RAID & OS

Resource

Management

Agent AgentDatastore

API | GUI | CLI

Page 15: Hardware Management: Hyperscale Datacenter Systems

Co

lle

cti

on

Pro

vis

ion

DC Facility:

Power, Cooling, Building

IT Infrastructure:

Compute, Network, Storage

Hardware Resource Pool

Software Resource Pool

DC Operation & Business support

Data Center

Infrastructure

Management

NMS /

SDN Control

POD/vPOD

Virtual Infra

Management

OSS / BSS

Marketing Finance IT

Other

Business

Partners

Complete

Visibility

Real-Time

Analytics

Process

Automation

Data Center

Simulator

Resource

Management

Software Defined Infrastructure

Page 16: Hardware Management: Hyperscale Datacenter Systems

End Users as of February 2017

Americas

› Undisclosed (NFVi)

› Undisclosed (IT)

› Undisclosed (NFVi)

› Telefónica, Peru (NFVi)

› Telefónica, Colombia (NFVi)

Europe & Middle East

› Swisscom, Switzerland (NFVi)

› Telefonica, Germany (NFVi)

› Undisclosed (NFVi)

› Undisclosed (NFVi)

› Undisclosed (NFVi)

› Undisclosed (NFVi)

› Undisclosed (NFVi)

› RIKS, Estonia (IT)

› Undisclosed (IT)

› Undisclosed (NFVi)

› Undisclosed (NFVi)

› Undisclosed (NFVi)

› Undisclosed (IT)

Asia Pacific

› SKT, Korea (IT)

› Far Eastone, Taiwan (IT)

› Undisclosed (NFVi)

› Telstra, Australia (NFVi)

› Undisclosed (IT)

› VHA, Australia (NFVi)

› Undisclosed (IT)

Africa

› Undisclosed, NFVi, IT

Page 17: Hardware Management: Hyperscale Datacenter Systems

FUTURE: Automated FactoryIntelligent data center

Continuously

improving throughput,

while optimizing cost structure

The Autonomous

Data Center

Self-OrganizingContinuous

Integration

Continuous

DeploymentSelf-Learning

Page 18: Hardware Management: Hyperscale Datacenter Systems
Page 19: Hardware Management: Hyperscale Datacenter Systems