52
Network Hardware and Software ECE 544: Computer Networks II Spring 2014 Dr. Reininger

Network Hardware and Software

  • Upload
    randy

  • View
    51

  • Download
    0

Embed Size (px)

DESCRIPTION

Network Hardware and Software. ECE 544: Computer Networks II Spring 2014 Dr. Reininger. Part I: What’s inside a router?. Router Architecture (Functional View). IP Router Architecture: An Overview, James Aweya , Nortel Networks. Router Architecture (Component View). Input Ports. - PowerPoint PPT Presentation

Citation preview

Page 1: Network  Hardware and Software

Network Hardware and Software

ECE 544: Computer Networks IISpring 2014Dr. Reininger

Page 2: Network  Hardware and Software

Part I: What’s inside a router?

Page 3: Network  Hardware and Software

Router Architecture (Functional View)

IP Router Architecture: An Overview, James Aweya, Nortel Networks

Page 4: Network  Hardware and Software

Router Architecture (Component View)

Page 5: Network  Hardware and Software

Input Ports

Page 6: Network  Hardware and Software

Router Table Lookup

• Search through the routing table, looking for a destination entry that matches the destination address of the datagram, or a default route if the destination entry is missing.

• Backbone routers need to perform millions of lookups per second. – Desired for the input port processing to be able to process at line speed, that is lookup can

be done in less than the time needed to receive a packet at the input port. – In this case, input processing of a received packet can be completed before the next

receive operation is complete.

• OC-48 link runs at 2.5Gbps. With a 256 byte long packet -> ~ 1 million lookups a second.

Page 7: Network  Hardware and Software

Routing Table Data Structure• Linear search through a large routing table is impossible.• Store the routing table entries in a tree data structure.

– Each level in the tree can be thought of as corresponding to a bit in the destination address.

– To lookup an address, start at the root node of the tree. If the first address bit is a zero, then the left subtree will contain the routing table entry for destination address; otherwise it will be in the right subtree.

– The appropriate subtree is then traversed using the remaining address bits – if the next address bit is a zero the left subtree of the initial subtree is chosen; otherwise, the right subtree of the initial subtree is chosen.

– In this manner, one can lookup the routing table entry in N steps, where N is the number of bits in the address.

Page 8: Network  Hardware and Software

Increasing Lookup Speed• Even with N=32 steps, the lookup speed of binary search is not fast enough for

today’s backbone routing requirements.• Assuming a memory access at each step, less than a million address lookups/sec

could be performed with 40ns memory access times.• Several techniques can be used to increase lookup speed:

– Router explicitly checks each incoming packet against a table of all of the router’s addresses to see if there is a match. Routing table is never checked for local traffic (packets destined to the router’s network).

– Content addressable memories (CAMs) allow a 32-bit IP address to be presented to the CAM, which then returns the content of the routing table entry for that address in essentially constant time.

– Caching recently accessed routing table entries.• For an OC-3 speed link, approximately 256,000 source-destination pairs might be seen in one minute in

a backbone router.– Faster data structures which allow routing table entries to be located in log(N) steps.

[Waldvogel 1997]– Routing table compression techniques [Degemark 1997]

• Exploit the sparseness of actual entries in the space of all possible routing table entries. – Hardware techniques (higher performance at lower cost, but less flexible)

Page 9: Network  Hardware and Software

Output Port Processing

Page 10: Network  Hardware and Software

First Generation IP Routers:Bus-based router architecture

IP Router Architecture: An Overview, James Aweya, Nortel Networks

Page 11: Network  Hardware and Software

Improvement to first generation:Route cache in the network interface

Maintain a very fast access subnet of the routing topology information.

Page 12: Network  Hardware and Software

Second Generation Router:Multiple Parallel Forwarding Engines

IP Router Architecture: An Overview, James Aweya, Nortel Networks

Page 13: Network  Hardware and Software

Third Generation:Switched-based IP Router Architecture

IP Router Architecture: An Overview, James Aweya, Nortel Networks

Page 14: Network  Hardware and Software

Limitation of Route Caching

• “Demand caching” schemes subjected to cache misses– Default to classical software-based route lookup

(“slow path”).• Routing changes invalidate the cache• Enterprise backbone and public networks

subjected to highly random traffic patterns and frequent topology changes eliminate the benefit of route caching.

Page 15: Network  Hardware and Software

Overcoming Cache Churn

• Use a forwarding database in each network interface which mirrors the entire content of the IP routing table maintained by the CPU– Eliminates the cache and the “slow path”.

• Offers significant benefits in terms of performance, scalability, network resilience and functionality, particularly in large complex networks with dynamic flows.– Accommodate network dynamics of short flows associated

with Web-based applications and interactive type sessions.

Page 16: Network  Hardware and Software

Switched-based Router with fully-distributed processors

IP Router Architecture: An Overview, James Aweya, Nortel Networks

Page 17: Network  Hardware and Software

IP Router Layer 2/3 entities

IP Router Architecture: An Overview, James Aweya, Nortel Networks

Page 18: Network  Hardware and Software

IP Router functional partitioning

IP Router Architecture: An Overview, James Aweya, Nortel Networks

Page 19: Network  Hardware and Software
Page 20: Network  Hardware and Software
Page 21: Network  Hardware and Software

IP Router Architecture: An Overview, James Aweya, Nortel Networks

Fast Path or Slow Path?

Page 22: Network  Hardware and Software

Distributed Router Architecture Functional Diagram

IP Router Architecture: An Overview, James Aweya, Nortel Networks

Page 23: Network  Hardware and Software

Address lookup and forwarding

IP Router Architecture: An Overview, James Aweya, Nortel Networks

Page 24: Network  Hardware and Software

IP Packet Router Lifecycle

IP Router Architecture: An Overview, James Aweya, Nortel Networks

Page 25: Network  Hardware and Software
Page 26: Network  Hardware and Software
Page 27: Network  Hardware and Software
Page 28: Network  Hardware and Software
Page 29: Network  Hardware and Software
Page 30: Network  Hardware and Software
Page 31: Network  Hardware and Software
Page 32: Network  Hardware and Software
Page 33: Network  Hardware and Software

Forwarding databases for Future Internet Services

• Non-SQL DB – GET/SET KVPs – Content Routing lookup as Map-Reduce computation

• REDIS – Hardware:

• 4 GB 1.7 GHz DDR3 RAM • i7 dual core 1.7 GHz with 4 MB shared L3 Cache• 128 GB Flash storage

– Operating System: Ubuntu 13.04 – SET: 403.1 KRPS (kilo requests per second)– GET: 508.4 KRPS– 16 MB/MKVPs (encoded into 1,000 hashes of 1,000 sub-keys each)

Page 34: Network  Hardware and Software
Page 35: Network  Hardware and Software

Shared Medium TDM bus

Page 36: Network  Hardware and Software

Shared Memory Switch Fabric

IP Router Architecture: An Overview, James Aweya, Nortel Networks

Page 37: Network  Hardware and Software

Distributed Output Buffered Switch Fabric

IP Router Architecture: An Overview, James Aweya, Nortel Networks

Page 38: Network  Hardware and Software

Space Division Switch Fabric

IP Router Architecture: An Overview, James Aweya, Nortel Networks

Page 39: Network  Hardware and Software
Page 40: Network  Hardware and Software

Where does queuing occur?

• Packet queues can form at both the input ports and the output ports.

• As queues grow larger, the router’s buffer space will eventually be exhausted and packet loss will occur.

• The actual location of packet loss (either at input or output port queues) will depend on the traffic load, the relative speed of the switching fabric and the line speed.

Page 41: Network  Hardware and Software

Output Queuing (OQ)• 100% throughput• Internal speedup of N– Impractical for large N

Input 1

Input 2

Input 3

Input 4

Output 1

Output 2

Output 3

Output 4

3

3

3

3

Page 42: Network  Hardware and Software

Input Queuing (IQ)

• Easy to implement• HOL Blocking, throughput 58.6%

Head of Line Blocking

Input 1

Input 2

Input 3

Input 4

Output 1

Output 2

Output 3

Output 4

12

32

34

24

Page 43: Network  Hardware and Software

Output Port Queuing

Page 44: Network  Hardware and Software

HOL blocking

Page 45: Network  Hardware and Software

Reducing the effects of HOL blocking

• Increase the speed of the I/O channel (i.e. speed up the switch fabric)– Under certain assumptions on traffic statistics a

speed up (ratio of the switch fabric bandwidth and the bandwidth of the input links) of 4 or 5 leads to 99% throughput.

– Recently, a speedup as low as 2 may be sufficient to obtain performance comparable to output buffered switches.

Page 46: Network  Hardware and Software

Virtual Output Queuing (VOQ)

• Virtual Output Queuing (VOQ)– Overcome HOL blocking– No speedup requirement– Need scheduling

algorithms to resolve contention• Complexity• Performance guarantee

1234

1234

1234

1234

Page 47: Network  Hardware and Software

Terabit and Petabit Packet Switching (TPS and PPS)

• Two main transmission/switching technologies– optical-based

• advantage: data carrying capacity and longer transmission distances (>1km) – electrical-based

• more suited to short distance, e.g. 10GbASE-T is capable of 10 Gb/s over distances of less than 100m• lower cost, higher density, easier control of switching cross points. 10GbASE-T is highly cost competitive for

interconnection within data centers.• Cabling uses four pairs of wires in parallel with each pair transmitting at either 0.25 Gbps for 1 GbASE-T or 2.5

Gbps for 10 GbASE-T.

• Ethernet at 1 Gb/s and beyond uses both– Electrical allows higher density of crosspoints allow much more capable switch in terms of number

of input and output ports.– Multistage Interconnection Networks (MIN) and memory elements are used to construct larger

switches– Optical solutions lack fast all-optical data storage for switching and data processing, this

necessitating a conversion to electrical signals.– Optical Ethernet does not lend itself to carrier sensing and is often deployed in a point-to-point

manner without multiaccessing.

Page 48: Network  Hardware and Software

Terabit and Petabit Packet Switching (TPS and PPS)

• Content Distribution Service– 100 Mbps multimedia streams to subscribers– 100,000 subscribers with 1 Gbps access -> 100 Tbps– Core switch requires 5,000 10 Gbps ports assuming a 20:1

access switch concentration ratio (20 1 Gbps input lines at the access switch concentrate into 1 10 Gbps input line at the core switch)

– Assuming a 1:1 fan in/out for the core switch, the core switch requires 10,000 ports

– If we use modules of 100 inputs each, the core switch consists of 100 modules

Page 49: Network  Hardware and Software

Terabit and Petabit Packet Switching (TPS and PPS)

• Data center LAN motivation example– GbE Host Bus Adapters (HBA) is giving way to 10

GbE NICs– 100 x 10 GbE = 1 TbE– After aggregation data can be transported over

short distances of less than 30m to end-of-row core switches

– PPS needed for interconnecting LANs in data centers.

Page 50: Network  Hardware and Software

Router with 10 GbE ports and three physical layer module types

http://en.wikipedia.org/wiki/10-gigabit_Ethernet

Page 51: Network  Hardware and Software

Data Center Aggregation

http://www.datacenter.rdm.com/global/en/data-center-solutions/data-center-topologies/end-of-row-vs_-top-of-rack.html

Page 52: Network  Hardware and Software

References• Ivan Seskar, Network Hardware and Software, ECE Computer Networks II, Spring 2009• IP Router Architectures: An Overview, James Aweya, Nortel Networks• What’s inside a router? http://www3.gdin.edu.cn/jpkc/dzxnw/jsjkj/chapter4/4-6.htm• Anatomy of a Broadband Router, Gabriel Torres, http

://www.hardwaresecrets.com/article/Anatomy-of-a-Broadband-Router/420• A 50-Gb/s IP Router, Craig Partridge et. al., IEEE/AC Transactions of Networking, Vol. 6, No. 3, June 1998,

http://nms.lcs.mit.edu/6829-papers/50gb.pdf• Introduction to Clos Networks

– http://en.wikipedia.org/wiki/ Clos_network– http://www.stanford.edu/class/ee384y/Handouts/ clos_networks.pdf

• Terabit Ethernet: Access and Core Switching Using Time-Space Carrier Sensing, Joseph Hui and David Daniel, IEEE Systems Journal, Vol. 4, No4. Dec. 2010

• Storing hundreds of millions of simple key-value pairs in Redis, http://instagram-engineering.tumblr.com/post/12202313862/storing-hundreds-of-millions-of-simple-key-value-pairs

• How fast is Redis? http://redis.io/topics/benchmarks• What kind of problems does MapReduce solve?

– http://programmers.stackexchange.com/questions/144787/what-kind-of-problems-does-mapreduce- solve– http://www.analyticbridge.com/profiles/blogs/what-mapreduce-can-t- do