Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
Network Engineering Group Update
Joe Metzger, Network Engineer
ESnet Network Engineering Group
ESCC
January 17 2013
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
Outline
Current Network Status
Short-term next steps (6 months)
What comes after that?
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
Current Status ESnet5 Transport Network
Shared Optical System • All common components deployed & commissioned • Dec 4th Optical System Inventory report has 6495 components!
− 341 nodes total, 60+ are add/drop/regen − 80% are common, the rest are dedicated to ESnet or Internet2 (over 500 are XFPs.) − Sunnyvale has 4 32-slot shelves − Sacramento is a 7-direction node
• Still refining processes for sharing spectrum, chassis, configuration, management, etc.
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
ESnet5 Optical System January 2012
Geography is only representational
SUNN
SEAT
LOSA
BOIS
EUGE
PORT
STLO
INDI
EQCH
CHIC
STAR
HOUS
DALL
TULS
KANS
SANA
GOOD
ELPA
ALBQ
DENV
PHOE
LASV
ECHOSALT
EURERENO
BOSTALBA
BUFF
PITT
CLEV
SOUT
JACK
ATLA
CHAT
BATOHOUL
JKMS
MEMPNASH
LOUI
CINC
CHAR
RALE
WASH
PHIL
AOFA
NEWY
ASHB
ANL
FNAL
ORNL
SNLL
NERSC
LBNL
JGI
SLAC
ESnet5 Optical Network
10:8 1/3/2013
44 Lambdas
61 Lambdas
100G+ Lit
Add Drop Node (Ciena)
ORNL Express Node (Ciena)
BNL
88 Lambdas
LIMAN Node (Infinera)
SAND
PEBL
SACR
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
1/11/13 5
ESnet5 Optical System January 2012 Lit Waves
SUNN
SEAT
LOSA
BOIS
EUGE
PORT
STLO
INDI
EQCH
CHIC
STAR
HOUS
DALL
TULS
KANS
SANA
GOOD
ELPA
ALBQ
DENV
PHOE
LASV
ECHOSALT
EURERENO
BOSTALBA
BUFF
PITT
CLEV
SOUT
JACK
ATLA
CHAT
BATOHOUL
JKMS
MEMPNASH
LOUI
CINC
CHAR
RALE
WASH
PHIL
AOFA
NEWY
ASHB
ANL
FNAL
ORNL
SNLL
NERSC
LBNL
JGI
SLAC
ESnet5 Optical Network
10:54 1/10/2013
44 Lambdas
61 Lambdas
100G+ Lit
Add Drop Node (Ciena)
100G Wave < 1000km
100G Wave > 1000km
10G Wave with optical protection
4x10G mux’d Wave
X2
X2
ORNL Express Node (Ciena)
BNL
88 Lambdas
LIMAN Node (Infinera)
SAND
PEBL
SACR
X3X2
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
ESnet5 Routed Network
Routers • 16 new Alcatel Lucent (ALU) 7750-SR12 − 10-slot router with up to 200G per slot − 56 100G interfaces & 200+ 10G interfaces
• 35 existing Juniper MX’s − Used in 10G hubs, commercial exchange points, sites
• 12 existing Juniper M7i & M10i − For terminating links slower than GE
• 5 very old Cisco 7206 − For terminating links slower than GE
Services • Standard routed IP (including full Internet services) • Point to Point Dynamic Virtual Circuits using OSCARS • Various overlay networks (Private VPN’s, LHCONE VRF)
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
SNLL
PNNL
LANL
SNLA
JLAB
PPPL
BNL
AMES
LLNL
JGI
LBNL ANL
Salt Lake
GFDL PU Physics
SEAT
STA
R
commercial peering points R&E network peering locations
ALBQ
LASV
SDSC
LOSA
Routed IP 100 Gb/s Routed IP 4 X 10 Gb/s 3rd party 10Gb/s Express / metro 100 Gb/s Express / metro 10G Express multi path 10G Lab supplied links Other links Tail circuits
Major Office of Science (SC) sites LBNL Major non-SC DOE sites LLNL
CLE
V
ESnet optical transport nodes (only some are shown)
ESnet managed 10G router 10 ESnet managed 100G routers 100
Site managed routers 100 10
10 10
10
10
100 10
10 10
1
100
100
100
100
10
10
100
100
10
10
10
10
10
100
100 10
10
100 10
100
SUNN ESnet PoP/hub locations
LOSA ESnet optical node locations (only some are shown)
10
100
100
100
100
100
100 E
QC
H
Geography is only representational
ESnet5 January 2013
1
100
10
SUNN
100
100 100
10
100
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
1/17/13 8
Example: Optical System in Sunnyvale Esnet Colo at SUNN Level3
DWDMSuite 119 # 02-01, 02 & 04!"#$%&'()%*+%"&(,,'(-.,/
Page DWDM
SUNN-CienaShelf#2
Fiber Routing Channel and Air Plenum
3x60A PWR 3x60A PWR
Access Panel
24 2521 22 23 26 27 28 3531 32 33 36 37 38 3934
4 51 2 3 6 7 8 14 1511 1213
16 17 1918
SUNN-CienaShelf#1
Fiber Routing Channel and Air Plenum
3x60A PWR 3x60A PWR
Access Panel
24 2521 22 23 26 27 28 3531 32 33 36 37 38 3934
4 51 2 3 6 7 8 14 1511 12 13 16 17 1918OCLD
OCI
Rack #01.01 Rack #01.02
!"#$%&'()%*+%"&(,,'(-.,/ 012#3#4*
DIA
SUNN-CienaShelf#3
Fiber Routing Channel and Air Plenum
3x60A PWR 3x60A PWR
Access Panel
24 2521 22 23 26 27 28 3531 32 33 36 37 38 3934
4 51 2 3 6 7 8 1415
11 12 13 16 17 1918
OCLDOCI
Rack #01.03
OCLD
OCI
OCLDOCI
SUNN-CR5-10/1/3 NERSC-MR2 10G
SUNN-CR5-10/1/4 SLAC-MR2 10G
SUNN-CR5-10/1/5 SNLL-MR2 10G
SUNN-CR5-9/1/1 JGI-MR2 10G
SUNN-CR5-9/1/2 LBNL-MR2 10G
SUNN-CR5-9/1/3 LLNL-MR2 10G
10G
10G
10G
10G
10G
10G
OCI OCLD
OCLDOCI
11-2 NERSC – 10/1/3
Cha
n 2
NE
RS
CC
han
4 JG
I
7-2 JGI – 9/1/1
8-2 LBNL– 9/1/1
Cha
n 6
LBN
L
12-2 SLAC– 10/1/4
Cha
n 8
SLA
C
13-2 SNLL– 10/1/5
Cha
n 20
S
NLL
24-2 LLNL – 9/1/3
Chan 18 LLNL
Chan 84
Chan 71Chan 72
SUNN-CR5-1/1/1 DENV 100G
SUNN-CR5-2/1/1 NERSC 100G
SUNN-CR5-3/1/1 ELPA 100G
SHELF#1
SHELF#1
SHELF#1 Slot 21 & 22/23
OCI OCLDOCI OCLD
Sun
n-cr
5
Sun
n-cr
5
SUNN-CienaShelf#4
Fiber Routing Channel and Air Plenum
3x60A PWR 3x60A PWR
Access Panel
24 2521 22 23 26 27 28 3531 32 33 36 37 38 3934
4 51 2 3 6 7 8 14 1511 1213
16 17 1918
Rack #02.04
OCI OCLD OCI OCLD
10G
10G
Future
10G
Future
10G
Future
10G
Future
SUNN-CR5-4/1/1 LBL 100GSHELF#4 Slot 4 & 5/6
Chan 70
SUNN-CR5-5/1/1 NERSC 100GSHELF#4 Slot 1 & 2/3
Chan 69
SUNN-ASW1-xe-0/1/0 SCAR future10G
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
Next 6 Months
• 100G production connections to: − ANL, BNL, FNAL, LBNL, LLNL, NERSC & ORNL
• 100G production connections to exchange points & R&E peers: − MANLAN, Starlight, PACWAVE, WIX, Internet2
• 40G into Equinix Ashburn & re-arranging our Washington DC ring to provide diverse backbone connections for JLAB & other sites in the area
• Diverse fiber laterals & diverse optical nodes at ANL & FNAL • Deploying diverse routers at ORNL • Lots of cleanup & consolidation at the hubs, moving connections from
the MX’s to the ALUs • Normalize 100G Testbed infrastructure • Swap out our un-supported ‘third party’ 10x10 MSA CFPs in our
Ciena interfaces with Ciena supported ones covered by 4-hour on-site maintenance contract
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
Disclaimer for rest of this slide deck
• The rest of this slide deck makes all sorts of assumptions − Some have been carefully thought out − Some haven’t had much thought at all − Some we don’t even realize we are making − Some of them are sure to be violated
• It is intended to spur thought and discussion. − Not to come up with the right answers today, but to make sure
we are not missing any important questions.
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
Backbone & Traffic Metrics
Backbone Bisection Bandwidth: Sum of inbound traffic crossing red line from Sites & Peers, converted to Gb/s
ESnet
Site Peers Site
Site Site
Peers
Peers
Peers
Average Accepted Traffic:
Sum of Links crossing the red line
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
1/17/13 13
Projections for next 7 years
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
1/17/13 14
Projections for next 7 years
ESnet4
ESnet5
Next Wave Fall 2013
(Testbed?)
10x100G on all routes by 2017
Optical System full in 2020 at
88x100G assuming no 400G Waves
ANI Prototype
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
1/17/13 15
Cost Inflection Points next 7 years
ESnet4
ESnet5
ANI Prototype
Routed network exceeds ESnet4 complexity 2016
Hit optical system capacity at current spectral efficiency
in 2020 Need to start
deploying next generation network
2017 or 2018
Add routers & optical chassis incrementally
with wave adds from 2015 on
Contract on Shared Optical System
expires 10/31/19
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
Things to consider
• How much ‘head-room’ is enough − Need less ‘bisection bandwidth’ if traffic is local vs cross-country − Need more for unpredictable users − Need less for predictable steady-state users
• Could automated science work-flow systems be more predictable, and sustain utilization >75%?
• Some changes that could drive growth faster than projected − Science DMZ adoption reducing impacts of poorly performing
firewalls & security infrastructure − PerfSONAR adoption & deployment allowing people to discover &
fix broken infrastructure − Data mobility tools enhancement & adoption − Commercial Adoption rate of 100GE will affect the price curve of
100G components driving up lab adoption rates − Changes in science data access models could affect growth rates
in either direction
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
Evolution of ESnet5
• Simplifying Hubs where possible − Removing ‘extra’ points of failure
• Optical Express circuits − Some circuits could bypass all routers
• Expanding International reach, capacity & control
• For now, we have collapsed Nx10GE of IP & SDN traffic onto single 100G circuits using QOS for logical separation.
• We anticipate moving back to physical separation of traffic onto different links as additional circuits are added.
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
Evolution of Site Interconnections
Goal: provide the best possible service within the constraints − Dimensions of ‘best’ that are important:
• Bandwidth • Reliability • Predictability • Risk tolerance • Debug-ability • Flexibility • Cost
− Discussion Question: What are the comparative weights of the different dimensions?
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
SiteHub
Sub GE Connected Site (Pantex or SRS)
ALU 7750
Other ESnet5Backbone Nodes 100G
Other ESnet5Backbone Nodes
100G Site Router1
10G OC3/DS3
ex4200 m7i / m10I
GE
m7i / m10I
GE/FE
Dem
arc
SiteESnetDiversity Examples
Site Bldg 2
Hub
Site Bldg 1
Hub
100 GE + 10GE Connected Site
ALU 7750
Other ESnet5Backbone Nodes 100G
Other ESnet5Backbone Nodes 100G
Site Router1
10G
10G
Dem
arc
SiteESnet
ALU 7750
10G
Juniper MX
Site Router2
10G
100G
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
Which is better?
Site
Hub
Site
Hub
ALU 7750
Other ESnet5Backbone Nodes 100G
Other ESnet5Backbone Nodes 100G
Site Router1
10G
10G
Dem
arc
SiteESnet
ALU 7750
10G
Juniper MX
Site Router2
10G
100G
Hub
Hub
ALU 7750
Other ESnet5Backbone Nodes 100G
Other ESnet5Backbone Nodes 100G
Site Router1
Dem
arc
SiteESnet
ALU 7750
Site Router2
100G
SiteESnet
100G
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
Site Connections beyond 200G
1/17/13 21
Hub
SiteHub
ALU 7750
Other ESnet5Backbone Nodes Nx100G
Other ESnet5Backbone Nodes Nx100G
Site Router1
Dem
arc
ALU 7750
Site Router2
Nx100GE
SiteESnet
Nx100GE
Is the architecture this simple?
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
Possible Additions to Network Services Portfolio
• Looking at additional types of overlay networks − Layer2 VPNs
• Point to Point • Multipoint
− Layer3 VPN’s • IPV4 and IPV6
− We haven’t figured out which subset we need to support
− Please let us know if you anticipate future requirements for overlay networks so we can take your needs into consideration
Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science
Additional NEG Updates
DNS DOS Rate Limiting
Improving site interactions
• Trying to figure out how to move from ad-hoc and event driven interactions to something that is more organized and intentional
• NEG Ambassador?
1/17/13 23