24
Network Engineering Group Update Joe Metzger, Network Engineer ESnet Network Engineering Group ESCC January 17 2013

Network Engineering Group Update

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Network Engineering Group Update

Joe Metzger, Network Engineer

ESnet Network Engineering Group

ESCC

January 17 2013

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Outline

Current Network Status

Short-term next steps (6 months)

What comes after that?

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Current Status ESnet5 Transport Network

Shared Optical System •  All common components deployed & commissioned •  Dec 4th Optical System Inventory report has 6495 components!

−  341 nodes total, 60+ are add/drop/regen −  80% are common, the rest are dedicated to ESnet or Internet2 (over 500 are XFPs.) −  Sunnyvale has 4 32-slot shelves −  Sacramento is a 7-direction node

•  Still refining processes for sharing spectrum, chassis, configuration, management, etc.

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

ESnet5 Optical System January 2012

Geography is only representational

SUNN

SEAT

LOSA

BOIS

EUGE

PORT

STLO

INDI

EQCH

CHIC

STAR

HOUS

DALL

TULS

KANS

SANA

GOOD

ELPA

ALBQ

DENV

PHOE

LASV

ECHOSALT

EURERENO

BOSTALBA

BUFF

PITT

CLEV

SOUT

JACK

ATLA

CHAT

BATOHOUL

JKMS

MEMPNASH

LOUI

CINC

CHAR

RALE

WASH

PHIL

AOFA

NEWY

ASHB

ANL

FNAL

ORNL

SNLL

NERSC

LBNL

JGI

SLAC

ESnet5 Optical Network

10:8 1/3/2013

44 Lambdas

61 Lambdas

100G+ Lit

Add Drop Node (Ciena)

ORNL Express Node (Ciena)

BNL

88 Lambdas

LIMAN Node (Infinera)

SAND

PEBL

SACR

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

1/11/13 5

ESnet5 Optical System January 2012 Lit Waves

SUNN

SEAT

LOSA

BOIS

EUGE

PORT

STLO

INDI

EQCH

CHIC

STAR

HOUS

DALL

TULS

KANS

SANA

GOOD

ELPA

ALBQ

DENV

PHOE

LASV

ECHOSALT

EURERENO

BOSTALBA

BUFF

PITT

CLEV

SOUT

JACK

ATLA

CHAT

BATOHOUL

JKMS

MEMPNASH

LOUI

CINC

CHAR

RALE

WASH

PHIL

AOFA

NEWY

ASHB

ANL

FNAL

ORNL

SNLL

NERSC

LBNL

JGI

SLAC

ESnet5 Optical Network

10:54 1/10/2013

44 Lambdas

61 Lambdas

100G+ Lit

Add Drop Node (Ciena)

100G Wave < 1000km

100G Wave > 1000km

10G Wave with optical protection

4x10G mux’d Wave

X2

X2

ORNL Express Node (Ciena)

BNL

88 Lambdas

LIMAN Node (Infinera)

SAND

PEBL

SACR

X3X2

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

ESnet5 Routed Network

Routers •  16 new Alcatel Lucent (ALU) 7750-SR12 −  10-slot router with up to 200G per slot −  56 100G interfaces & 200+ 10G interfaces

•  35 existing Juniper MX’s −  Used in 10G hubs, commercial exchange points, sites

•  12 existing Juniper M7i & M10i −  For terminating links slower than GE

•  5 very old Cisco 7206 −  For terminating links slower than GE

Services •  Standard routed IP (including full Internet services) •  Point to Point Dynamic Virtual Circuits using OSCARS •  Various overlay networks (Private VPN’s, LHCONE VRF)

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

SNLL

PNNL

LANL

SNLA

JLAB

PPPL

BNL

AMES

LLNL

JGI

LBNL ANL

Salt Lake

GFDL PU Physics

SEAT

STA

R

commercial peering points R&E network peering locations

ALBQ

LASV

SDSC

LOSA

Routed IP 100 Gb/s Routed IP 4 X 10 Gb/s 3rd party 10Gb/s Express / metro 100 Gb/s Express / metro 10G Express multi path 10G Lab supplied links Other links Tail circuits

Major Office of Science (SC) sites LBNL Major non-SC DOE sites LLNL

CLE

V

ESnet optical transport nodes (only some are shown)

ESnet managed 10G router 10 ESnet managed 100G routers 100  

Site managed routers 100   10

10 10

10

10

100  10

10 10

1

100 

100 

100 

100 

10

10

100 

100 

10

10

10

10

10

100  

100  10

10

100  10

100 

SUNN ESnet PoP/hub locations

LOSA ESnet optical node locations (only some are shown)

10

100  

100  

100 

100 

100  

100  E

QC

H

Geography is only representational

ESnet5 January 2013

1

100  

10

SUNN

100 

100  100  

10

100 

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

1/17/13 8

Example: Optical System in Sunnyvale Esnet Colo at SUNN Level3

DWDMSuite 119 # 02-01, 02 & 04!"#$%&'()%*+%"&(,,'(-.,/

Page DWDM

SUNN-CienaShelf#2

Fiber Routing Channel and Air Plenum

3x60A PWR 3x60A PWR

Access Panel

24 2521 22 23 26 27 28 3531 32 33 36 37 38 3934

4 51 2 3 6 7 8 14 1511 1213

16 17 1918

SUNN-CienaShelf#1

Fiber Routing Channel and Air Plenum

3x60A PWR 3x60A PWR

Access Panel

24 2521 22 23 26 27 28 3531 32 33 36 37 38 3934

4 51 2 3 6 7 8 14 1511 12 13 16 17 1918OCLD

OCI

Rack #01.01 Rack #01.02

!"#$%&'()%*+%"&(,,'(-.,/ 012#3#4*

DIA

SUNN-CienaShelf#3

Fiber Routing Channel and Air Plenum

3x60A PWR 3x60A PWR

Access Panel

24 2521 22 23 26 27 28 3531 32 33 36 37 38 3934

4 51 2 3 6 7 8 1415

11 12 13 16 17 1918

OCLDOCI

Rack #01.03

OCLD

OCI

OCLDOCI

SUNN-CR5-10/1/3 NERSC-MR2 10G

SUNN-CR5-10/1/4 SLAC-MR2 10G

SUNN-CR5-10/1/5 SNLL-MR2 10G

SUNN-CR5-9/1/1 JGI-MR2 10G

SUNN-CR5-9/1/2 LBNL-MR2 10G

SUNN-CR5-9/1/3 LLNL-MR2 10G

10G

10G

10G

10G

10G

10G

OCI OCLD

OCLDOCI

11-2 NERSC – 10/1/3

Cha

n 2

NE

RS

CC

han

4 JG

I

7-2 JGI – 9/1/1

8-2 LBNL– 9/1/1

Cha

n 6

LBN

L

12-2 SLAC– 10/1/4

Cha

n 8

SLA

C

13-2 SNLL– 10/1/5

Cha

n 20

S

NLL

24-2 LLNL – 9/1/3

Chan 18 LLNL

Chan 84

Chan 71Chan 72

SUNN-CR5-1/1/1 DENV 100G

SUNN-CR5-2/1/1 NERSC 100G

SUNN-CR5-3/1/1 ELPA 100G

SHELF#1

SHELF#1

SHELF#1 Slot 21 & 22/23

OCI OCLDOCI OCLD

Sun

n-cr

5

Sun

n-cr

5

SUNN-CienaShelf#4

Fiber Routing Channel and Air Plenum

3x60A PWR 3x60A PWR

Access Panel

24 2521 22 23 26 27 28 3531 32 33 36 37 38 3934

4 51 2 3 6 7 8 14 1511 1213

16 17 1918

Rack #02.04

OCI OCLD OCI OCLD

10G

10G

Future

10G

Future

10G

Future

10G

Future

SUNN-CR5-4/1/1 LBL 100GSHELF#4 Slot 4 & 5/6

Chan 70

SUNN-CR5-5/1/1 NERSC 100GSHELF#4 Slot 1 & 2/3

Chan 69

SUNN-ASW1-xe-0/1/0 SCAR future10G

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Next 6 Months

•  100G production connections to: −  ANL, BNL, FNAL, LBNL, LLNL, NERSC & ORNL

•  100G production connections to exchange points & R&E peers: −  MANLAN, Starlight, PACWAVE, WIX, Internet2

•  40G into Equinix Ashburn & re-arranging our Washington DC ring to provide diverse backbone connections for JLAB & other sites in the area

•  Diverse fiber laterals & diverse optical nodes at ANL & FNAL •  Deploying diverse routers at ORNL •  Lots of cleanup & consolidation at the hubs, moving connections from

the MX’s to the ALUs •  Normalize 100G Testbed infrastructure •  Swap out our un-supported ‘third party’ 10x10 MSA CFPs in our

Ciena interfaces with Ciena supported ones covered by 4-hour on-site maintenance contract

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Disclaimer for rest of this slide deck

•  The rest of this slide deck makes all sorts of assumptions −  Some have been carefully thought out −  Some haven’t had much thought at all −  Some we don’t even realize we are making −  Some of them are sure to be violated

•  It is intended to spur thought and discussion. −  Not to come up with the right answers today, but to make sure

we are not missing any important questions.

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Backbone & Traffic Metrics

Backbone Bisection Bandwidth: Sum of inbound traffic crossing red line from Sites & Peers, converted to Gb/s

ESnet

Site Peers Site

Site Site

Peers

Peers

Peers

Average Accepted Traffic:

Sum of Links crossing the red line

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

1/17/13 13

Projections for next 7 years

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

1/17/13 14

Projections for next 7 years

ESnet4

ESnet5

Next Wave Fall 2013

(Testbed?)

10x100G on all routes by 2017

Optical System full in 2020 at

88x100G assuming no 400G Waves

ANI Prototype

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

1/17/13 15

Cost Inflection Points next 7 years

ESnet4

ESnet5

ANI Prototype

Routed network exceeds ESnet4 complexity 2016

Hit optical system capacity at current spectral efficiency

in 2020 Need to start

deploying next generation network

2017 or 2018

Add routers & optical chassis incrementally

with wave adds from 2015 on

Contract on Shared Optical System

expires 10/31/19

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Things to consider

•  How much ‘head-room’ is enough −  Need less ‘bisection bandwidth’ if traffic is local vs cross-country −  Need more for unpredictable users −  Need less for predictable steady-state users

•  Could automated science work-flow systems be more predictable, and sustain utilization >75%?

•  Some changes that could drive growth faster than projected −  Science DMZ adoption reducing impacts of poorly performing

firewalls & security infrastructure −  PerfSONAR adoption & deployment allowing people to discover &

fix broken infrastructure −  Data mobility tools enhancement & adoption −  Commercial Adoption rate of 100GE will affect the price curve of

100G components driving up lab adoption rates −  Changes in science data access models could affect growth rates

in either direction

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Evolution of ESnet5

•  Simplifying Hubs where possible −  Removing ‘extra’ points of failure

•  Optical Express circuits −  Some circuits could bypass all routers

•  Expanding International reach, capacity & control

•  For now, we have collapsed Nx10GE of IP & SDN traffic onto single 100G circuits using QOS for logical separation.

•  We anticipate moving back to physical separation of traffic onto different links as additional circuits are added.

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Evolution of Site Interconnections

Goal: provide the best possible service within the constraints −  Dimensions of ‘best’ that are important:

•  Bandwidth •  Reliability •  Predictability •  Risk tolerance •  Debug-ability •  Flexibility •  Cost

−  Discussion Question: What are the comparative weights of the different dimensions?

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

SiteHub

Sub GE Connected Site (Pantex or SRS)

ALU 7750

Other ESnet5Backbone Nodes 100G

Other ESnet5Backbone Nodes

100G Site Router1

10G OC3/DS3

ex4200 m7i / m10I

GE

m7i / m10I

GE/FE

Dem

arc

SiteESnetDiversity Examples

Site Bldg 2

Hub

Site Bldg 1

Hub

100 GE + 10GE Connected Site

ALU 7750

Other ESnet5Backbone Nodes 100G

Other ESnet5Backbone Nodes 100G

Site Router1

10G

10G

Dem

arc

SiteESnet

ALU 7750

10G

Juniper MX

Site Router2

10G

100G

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Which is better?

Site

Hub

Site

Hub

ALU 7750

Other ESnet5Backbone Nodes 100G

Other ESnet5Backbone Nodes 100G

Site Router1

10G

10G

Dem

arc

SiteESnet

ALU 7750

10G

Juniper MX

Site Router2

10G

100G

Hub

Hub

ALU 7750

Other ESnet5Backbone Nodes 100G

Other ESnet5Backbone Nodes 100G

Site Router1

Dem

arc

SiteESnet

ALU 7750

Site Router2

100G

SiteESnet

100G

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Site Connections beyond 200G

1/17/13 21

Hub

SiteHub

ALU 7750

Other ESnet5Backbone Nodes Nx100G

Other ESnet5Backbone Nodes Nx100G

Site Router1

Dem

arc

ALU 7750

Site Router2

Nx100GE

SiteESnet

Nx100GE

Is the architecture this simple?

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Possible Additions to Network Services Portfolio

•  Looking at additional types of overlay networks −  Layer2 VPNs

•  Point to Point •  Multipoint

−  Layer3 VPN’s •  IPV4 and IPV6

−  We haven’t figured out which subset we need to support

−  Please let us know if you anticipate future requirements for overlay networks so we can take your needs into consideration

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Additional NEG Updates

DNS DOS Rate Limiting

Improving site interactions

•  Trying to figure out how to move from ad-hoc and event driven interactions to something that is more organized and intentional

•  NEG Ambassador?

1/17/13 23

Questions?

Thanks!

Joe Metzger – [email protected]

http://www.es.net/

http://fasterdata.es.net/