42
Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University www.dsn.jhu.edu Yair Amir Don P. Giddens Lecture

Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University Yair Amir

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

DistributedSystemsandNetworkslabDepartmentofComputerScience,JohnsHopkinsUniversity

www.dsn.jhu.edu

YairAmirDonP.GiddensLecture

Page 2: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  Theoneandonly–MichalMiskin‐Amir•  Theadvisor–DannyDolev•  Theprofessors–

–  M.Melliar‐Smith,L.Moser,K.Birman,A.Brodsky,Y.Yemini

•  Thestudent‐colleagues‐–  R.Borgstrom,J.Stanton,D.Shaw,J.Green,J.Schultz,T.Schlossnagle,A.Peterson–  C.Nita‐Rotaru,C.Danilov,C.Tutu,R.Caudy,A.Munjal,M.Hilsdale–  N.Rivera,J.Lane,R.Musaloiu‐EleNeri,J.Kirsch,M.Kaplan–  D.Obenshain,T.TanQllo

•  Thego‐toexperts‐–  B.Awerbuch,A.Barak,G.Tsudik,S.Goose,A.Terzis,B.Coan,R.Ostrovsky

•  Theentrepreneurs‐–  M.Khan,Y.Javadi,S.Goose

•  TheHopkinsprofessors‐–  B.Awerbuch,G.Mason,R.Kosaraju,S.Smith,M.Goodrich,R.Westgate

•  Theprogrammanagers––  D.Maughan,T.Gibson,C.Landwehr,H.Shrobe

2February16,2012 YairAmir

Page 3: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

Asingle,mulQ‐purpose,IP‐basednetwork– EachaddiQonalnodeincreasesitsreachandusefulness(similartoanynetwork)

– EachaddiQonalapplicaQondomainincreasesitseconomicadvantage

– Willthereforeswallowmostothernetworks•  Happened:mailtoe‐mail,PhonetoVoIP,FaxtoPDFs•  Startedtheprocess:TV,variouscontrolsystems•  SQlltocome:Cellphonenetworks

3February16,2012 YairAmir

Page 4: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

Asingle,mulQ‐purpose,IP‐basednetwork•  Theartofdesign–asuccessfulparadigm

–  Keepitsimpleinthemiddle•  Best‐effortpacketswitching,rouQng(intranet,Internet)

–  Smartattheedge•  End‐to‐endreliability,naming

•  Couldthereforeadaptandscale–  Survivedfor4decadesandcounQng–  Sustainedatleast7ordersofmagnitudegrowth

•  Standardizedandalotridesonit–  Thebasicservicesarenotlikelytochange

4February16,2012 YairAmir

Page 5: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  CommunicaQonpaberns–  FromPoint‐to‐point–topoint‐to‐mulQpoint–tomany‐to‐many

•  Highperformancereliability–  “Fasterthanreal‐Qme”filetransfers

•  LowlatencyinteracQvity–  150mskeystrokemirroring–  100msforVoIP–  80‐100msforinteracQvegames(remotesurgery?)

•  End‐to‐enddependability–  From“Internet”dependability–to“phoneservice”dependability–to

“TVservice”dependability–to“remotesurgery”dependability•  Systemresiliency

–  FromE‐mailfaulttolerance–tofinancialtransacQonsecurity–tocriQcalinfrastructure(SCADA)intrusiontolerance

5YairAmirFebruary16,2012

Page 6: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  Buildspecializednetworks–  WasdonedecadesbeforetheInternet–  ThinkCable/TVdistribuQon(Satellite+lastmile)–  Extremelyexpensive

•  BuildprivateIPnetworks–  AvoidstheresourcesharingaspectsoftheInternet,solvessomeofthe

scaleissues–  Expensive–  SQllconfinedtobasicIPnetworkcapabiliQes

•  BuildabeberInternet–  ImprovementsandenhancementstoIP(orTCP/IPstack)–  “Cleanslatedesign”

•  Buildoverlaynetworks

6YairAmirFebruary16,2012

Page 7: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  Overlay paradigm: –  In contrast to “keep it simple in the middle and smart at the edge” –  Move intelligence and resources to the middle

•  Software-based overlay routers working on top of the internet •  Overlay links translated to Internet paths

•  Smaller overlay scale (# nodes) smarter algorithms, better performance, and new services.

February16,2012 YairAmir 7

Page 8: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  FlexibleRouQng–  RON–resilientrouQngusingalternatepaths[Andersenetal,01]–  XBone–flexiblerouQngusingIPinIPtunneling[Touch,Hotz,98]

•  ContentDistribuQon–  Yoid–host‐basedcontentdistribuQon[Francis00]–  Overcast–reliablemulQcastforhighbandwidthcontentdistribuQon[Jano:

etal,00]–  Bullet–mulQ‐pathdatadisseminaQon[Kos<cetal03]

•  MulQcast–  ESM–providesapplicaQon‐levelmulQcast[Chuetal,00]–  HTMP–interconnectsislandsofIPMulQcast[Zhangetal,02]

•  PeertoPeer–  Chord–logarithmiclookupservice[Stoicaetal,01]–  Kelips–O(1)lookupwithmoreinformaQonstored[Guptaetal,03]

•  GroupCommunicaQon–  TheSpreadtoolkit–scalablewideareagroupcommunicaQonusinganoverlay

approach[Amir,Danilov,Stanton,00]

February16,2012 YairAmir 8

Page 9: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  TheOverlayNetworkParadigm•  TheDARPANetworkingChallenge(99‐03)

– OverlayArchitecture–  Low‐latencyreliabletransport

•  TheSiemensVoIPChallenge(03‐06)– Almost‐reliable,real‐Qmetransport

•  TheLiveTimeNetTVChallenge(08‐…)–  FromOverlaystoClouds– UlQmateresiliency,automatedmonitoringandcontrol

•  ThechallengesaheadFebruary16,2012 YairAmir 9

Page 10: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  ThetradiQonalparadigm(keepitsimpleinthemiddleandsmartattheedge)workswellfortradiQonalapplicaQonsintypicalconnecQvitycondiQons

•  ButnotsowellfortradiQonalapplicaQonsinbadconnecQvitycondiQons

•  AndnotsowellforemergingapplicaQonsintypicalconnecQvitycondiQons

February16,2012 YairAmir 10

Page 11: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  50millisecondnetwork–  E.g.LosAngelestoBalQmore–  50millisecondstotellthesenderabouttheloss–  50millisecondstoresendthepacket

•  Atleast100millisecondstorecoveralostpacket

February16,2012 YairAmir 11

565

5

LAX BWI

Page 12: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  50millisecondnetwork–  E.g.LosAngelestoBalQmore–  50millisecondstotellthesenderabouttheloss–  50millisecondstoresendthepacket

•  Atleast100millisecondstorecoveralostpacket–  Canwedobeber?

February16,2012 YairAmir 12

LAX BWI

Page 13: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  50millisecondnetwork,fivehops–  10millisecondstotellnodeDALabouttheloss–  10millisecondstogetthepacketbackfromDAL

•  Only20millisecondstorecoveralostpacket–  LostpacketsenttwiceonlyonlinkDAL–ATL

February16,2012 YairAmir 13

56 5

5

LAX PHX DAL ATL DCA BWI

Page 14: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

SimulaQon

Latency

February16,2012 14YairAmir

Page 15: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

SimulaQon

Latency

Ji@er

February16,2012 15YairAmir

Page 16: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

SimulaQon SpinesonEmulab

Latency

Ji@er

February16,2012 16YairAmir

Page 17: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  50msnetworkdividedevenlyintoxhops•  Delayedpackets:arriveaNermorethan50+10ms

February16,2012 YairAmir 17

Page 18: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  TheSpinesOverlayMessagingsystem– AnOverlaysoNwarerouter(daemon)ontopofUDP–  RunningasanormalInternetapplicaQon

•  Easytouseprogrammingplarorm–  TransparentinterfaceidenQcaltothesocketinterface,givingTCP,UDPandIPMulQcastfuncQonality

•  “Commercialgrade”deployablesystem–  ImprovingapplicaQonperformanceovertheInternet–  Enablingnewservices– Opensource(www.spines.org)

February16,2012 YairAmir 18

Page 19: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

February16,2012 YairAmir 19

Page 20: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  TheOverlayNetworkParadigm•  TheDARPANetworkingChallenge(99‐03)

– OverlayArchitecture–  Low‐latencyreliabletransport

•  TheSiemensVoIPChallenge(03‐06)– Almost‐reliable,real‐Qmetransport

•  TheLiveTimeNetTVChallenge(08‐…)–  FromOverlaystoClouds– UlQmateresiliency,automatedmonitoringandcontrol

•  ThechallengesaheadFebruary16,2012 YairAmir 20

Page 21: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  Canwemaintaina“goodenough”phonecallqualityovertheInternet?

•  Highqualitycallsdemandpredictableperformance–  VoIPisinteracQve.Humansperceivedelaysat100ms–  Thebest‐effortserviceofferedbytheInternetwasnotdesignedto

offeranyqualityguarantees–  CommunicaQonsubjecttodynamicloss,delay,jiber,pathfailures

February16,2012 YairAmir 21

PSTN

50msnetworkdelay

Page 22: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  Localizedreal‐Qmerecoveryonoverlayhops–  Retransmissionisabemptedonlyonce

•  EachOverlaynodekeepsahistoryofthepacketsforwardedinthelast100ms–  Whentheotherendofahopdetectsaloss,itrequestsa

retransmissionandmoveson

–  IftheupstreamnodesQllhasthepacketinitshistory,itresendsit

•  Notareliableprotocol–  NoACKs.Noduplicates.Noblocking.

•  Recoveryworksforhopsshorterthanabout30ms–  Thisisok:overlaylinksareshort!

February16,2012 YairAmir 22€

loss ≈ 2 ⋅ p2

retr_ delay = 3 ⋅T + Δ

Page 23: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  Spinesoverlay–5linksof10mseach•  10VoIPstreamssendinginparallel•  LossonmiddlelinkC‐D

February16,2012 YairAmir 23

50msnetworkdelay

Page 24: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  RouQngalgorithmthattakes

intoaccountretransmissions•  WhichpathmaximizesthenumberofpacketsarrivingatnodeEinunder100ms?

•  FindingthebestpathbycompuQnglossanddelaydistribuQononallthepossibleroutesisveryexpensive

•  Weightmetricforlinksthatapproximatesthebestpath

February16,2012 YairAmir 24

Exp_ latency = (1− p) ⋅T + (p − 2 ⋅ p2) ⋅ (3 ⋅T + Δ) + 2 ⋅ p2 ⋅Tmax

Page 25: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  Localizedreal‐Qmerecoveryonoverlayhops– Retransmissionisabemptedonlyonce

•  FlexiblerouQngmetricavoidscurrentlycongestedpaths– Costmetricbasedonmeasuredlatencyandlossrateofthelinks

– Linkcostequivalenttotheexpectedpacketlatencywhenretransmissionsareconsidered

February16,2012 25YairAmir

Page 26: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  TheOverlayNetworkParadigm•  TheDARPANetworkingChallenge(99‐03)

– OverlayArchitecture–  Low‐latencyreliabletransport

•  TheSiemensVoIPChallenge(03‐06)– Almost‐reliable,real‐Qmetransport

•  TheLiveTimeNetTVChallenge(08‐…)–  FromOverlaystoClouds– UlQmateresiliency,automatedmonitoringandcontrol

•  ThechallengesaheadFebruary16,2012 YairAmir 26

Page 27: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  CantheInternetbeanunderlyingnetworkforaliveTVservice?–  Livechanneltransport(BusinesstoBusiness)–  Thevirtualcablecompany(BusinesstoConsumer)–  NextGeneraQonTV(InteracQvity)

•  Requirements–  Scalability:Highcapacityflows,manyany‐to‐manyflows

–  Highavailabilityanduniformdelivery

•  Technologytrends–  Cheaplong‐haulaccessbandwidth–  BroadbandInternetconnecQvitytothehome

–  MulQ‐corecomputerarchitectureFebruary16,2012 27YairAmir

Page 28: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

Theserviceproviderpointofview•  AserviceratherthansoNwareorhardware•  Controloverwhereoverlaynodesarelocated

•  MulQplenetworkprovidersineachoverlaynode(SuperNodes)

•  Guaranteedcapacitywithadmissioncontrol

•  MonitoringandControl–nearautomaQon

February16,2012 28YairAmir

Page 29: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

February16,2012 29YairAmir

Page 30: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

February16,2012 30YairAmir

Page 31: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

February16,2012 31YairAmir

Page 32: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

February16,2012 32YairAmir

Page 33: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  Scalableoverlaynetworkarchitecture– Paralleloverlays

•  Real‐Qmemonitoringandcontrol– Automated–takethehumanoutoftheloop

•  ThreelevelsofprotecQon– Linklevel:real‐QmeprotocolforHD‐TV

– Overlaylevel:responsiveoverlayrouQng– Cloudlevel:NxWayfailoverforoverlayrouters

February16,2012 33YairAmir

Page 34: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

Network packet loss on one link (assuming 66% burstiness)

Loss experienced by flows on the LTN Network

2% < 0.0003%

5% < 0.003%

10% < 0.03%

February16,2012 34YairAmir

Page 35: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

Availablelink

SuperNode

35February16,2012 YairAmir

• UQlizesmulQpleTier1IPbackbones• OpQmizedoverlaypathsdetermineselectedlinks• AutomaQcallyandinstantaneouslyswitchtoabeberpath

Page 36: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

Availablelink Selectedlink

SuperNode

36February16,2012 YairAmir

• UQlizesmulQpleTier1IPbackbones• OpQmizedoverlaypathsdetermineselectedlinks• AutomaQcallyandinstantaneouslyswitchtoabeberpath

Page 37: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

Availablelink Selectedlink DeterioraQnglink

SuperNode

37February16,2012 YairAmir

• UQlizesmulQpleTier1IPbackbones• OpQmizedoverlaypathsdetermineselectedlinks• AutomaQcallyandinstantaneouslyswitchtoabeberpath

Page 38: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

Availablelink Selectedlink DeterioraQnglink

SuperNode

38February16,2012 YairAmir

• UQlizesmulQpleTier1IPbackbones• OpQmizedoverlaypathsdetermineselectedlinks• AutomaQcallyandinstantaneouslyswitchtoabeberpath

Page 39: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

Availablelink Selectedlink DeterioraQnglink

SuperNode

39February16,2012 YairAmir

• UQlizesmulQpleTier1IPbackbones• OpQmizedoverlaypathsdetermineselectedlinks• AutomaQcallyandinstantaneouslyswitchtoabeberpath

Page 40: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

• UQlizesmulQpleTier1IPbackbones• OpQmizedoverlaypathsdetermineselectedlinks• AutomaQcallyandinstantaneouslyswitchtoabeberpath

Availablelink Selectedlink DeterioraQnglink

SuperNode

40February16,2012 YairAmir

Page 41: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  TheOverlayNetworkParadigm•  TheDARPANetworkingChallenge(99‐03)

– OverlayArchitecture–  Low‐latencyreliabletransport

•  TheSiemensVoIPChallenge(03‐06)– Almost‐reliable,real‐Qmetransport

•  TheLiveTimeNetTVChallenge(08‐…)–  FromOverlaystoClouds– UlQmateresiliency,automatedmonitoringandcontrol

•  ThechallengesaheadFebruary16,2012 YairAmir 41

Page 42: Yair Amir - Johns Hopkins Universityyairamir/YairAmir_IPL.pdf · Distributed Systems and Networks lab Department of Computer Science, Johns Hopkins University  Yair Amir

•  Resiliency‐allthewaytointrusiontolerance– Resilientclouds– CriQcalinfrastructure– SKYDA(SCADAintheSky)

•  Timelinessandquality–noendtothat – RemotemanipulaQon

– Remotesurgery– Remotemusictraining?

February16,2012 YairAmir 42