Algorithms for Advanced Packet Classification with TCAMs Karthik Lakshminarayanan UC Berkeley Joint...

Algorithms for Advanced Packet Classification with TCAMs

Karthik LakshminarayananUC Berkeley

Joint work withAnand Rangarajan and Srinivasan Venkatachary

(Cypress Semiconductors Inc.)

Packet Processing Environment

• Packet matches a set of rules based on the header• Examples: routers, intrusion detection systems

Rule: acl-id src-addr src-port dst-addr dst-port proto(e.g. acl1231 128.32.0.0/8 0-1023 32.12.1.1/16 1024 tcp)

Hdr Payload

Search Key

permit/deny,update counter

Rule Action

… …

ACL Database

Ternary Content Addressable Memory

• Memory device with fixed width arrays• Each bit is 0, 1 or x (don’t care)• Search is performed against all entries in parallel

and the first result is returned

width = W bits

00100x1x001110x0x

01110xxx001100xxx

1111101x1101000xxwidth = W bits

Search key

011101xx001100x10 Outputis row2

TCAM: Benefits and Disadvantages

• Benefits:– Deterministic Search Throughput—O(1) search

• Disadvantages:– Cost

– Power consumption

• Current TCAM usage:– 6 million TCAM devices deployed

– Used in multi-gigabit systems that have O(10,000) rules

– TCAMs can support a table of size 128K ternary entries and 133 million searches per second for 144-bit keys

Problems

• Range Representation Problem

• Multimatch Classification Problem

Algorithms in software easier to implement

Problems

• Range Representation Problem

Range Representation Problem

• Representing prefixes in ternary is trivial– IP address prefixes present in rules

• Representing arbitrary ranges is not easy though– port fields might contain ranges– e.g. some security applications may allow ports

1024-65535 only

Earlier Approaches – I

Prefix expansion of ranges:– express ranges as a union of prefixes– have a separate TCAM entry for each prefix– expansion: the number of entries a rules expands to

• Example: the range [3,12] over a 4-bit field would expand to:– 0011 (3), 01xx (4-7), 10xx (8-11) and 1100 (12)

• Worst-case expansion for a W-bit field is 2W-2– example: [1,14] would expand to 0001, 001x, 01xx,

10xx, 110x, 1110– 16-bit port field expands to 30 entries

Earlier Approaches – II

Database-dependent encoding: – observation: TCAM array has some unused bits– use these additional bits to encode commonly

occurring ranges in the database

• TCAMs with IP ACLs have ~ 36 extra bits– 144-bit wide TCAMs– 104-bits + 4-bits for IP ACL rules

• Example:

Address Port …12.123.0.0/16 20-24 …32.12.13.0/24 1024- …128.0.0.0/8 20-24 …

Set extra bit to 1

Set extra bit to 1Set extra bit to x

If search key falls in 20-24, set extra bit to 1, else set it to 0

• Improved version: Region-based Range Encoding

• Disadvantages:– database dependent incremental update is hard

Database-Independent Range Pre-Encoding

• Key insight: use additional bits in a database independent way– wider representation of ranges– reduce expansion in the worst-case

• Fence encoding (W bits):– total of 2W-1 bits– encoding of i has i ones

preceded by 2W-i-1 zeros– e.g. W=3, f(0) = 0000000,

f(2) = 0000011

• With 2W-1 bits, fence encoding achieves an expansion of 1

Theorem: For achieving a worst-case row expansion of 1 for a W-bit range, 2W-1 bits are necessary

• Procedure:– split W-bit field into multiple chunks– encode each chunk using fence encoding– “combine” the chunks to form ternary entries

Combining chunks: analogous to multi-bit tries

W bits

k1 bitsk0 bits k2 bits

Unibit view of DIRPE (Prefix expansion)

• W=3 divided into 3 one-bit chunks• R=[1,6]—prefixes = {001,01x,10x,110}• Each level can contribute to at most 2 prefixes

(but the top level)xxx

0xx 1xx

00x 01x 11x10x

000 001 010 011 100 101 110 111

Multi-bit view of DIRPE

Worst caseexpansion= 2W/k – 1

Number of extrabits needed

= (2k-1)W/k - W

Legend:•8-bit field (W=8)•k0=2, k1=3, k2=3•R=[11,54] =[013, 066]•split chunk = 1

Comparison of Expansion

Worst-caseexpansion

Real-lifeexpansion

DIRPE + DB-dependent Net expansion was 1.12

MetricPrefix

Expansion

Region-basedEncoding

(with r regions)

DIRPE(with k-bitchunks)

DIRPE +Region-based

Extra bits

Worst-casecapacity

degradation

Cost of anincremental

update

Overhead onthe packetprocessor

0 F(log2r + 2n-1 r

) F(W(2k-1)

k - W) 2n-1

F( (2k-1) log2r

(2W-2)F (2log2r)F ( )F 2W

k - 1 ( )F 2log2r

O(( )O(WF) O(N) O(N)

None O((log2r+2n-1

r) F.2W)

Pre-computedtable of size:

( or )O(nF) comparators

of width W bits

logic gates

Both piecesof logic from

previous two columns

DIRPE: Summary

↑ Database independent

↑ Scales well for large databases

↑ Good incremental update properties

↓ Additional bits needed

↓ Small logic needed for modifying search key

Problems

• Range Expansion Problem

Multimatch Classification Problem

• TCAM search primitive: return first matching entry for a key

• Multimatch requirement: return k matches (or all matches) for a key– security applications where all signatures that

match this packet need to be found– accounting applications where counters have to

be updated for all matching entries

Earlier Approaches

Entry Invalidation scheme:– maintain state of multimatch using an

additional bit in TCAM called “valid” bit

TCAM array

00100x1x001110x0x

01110xxx001100xxx

1111101x1101000xx

Search key

011101xx001100x10

valid bit

0 match

Earlier Approaches

Entry Invalidation scheme:– maintain state of multimatch using an

additional bit in TCAM called “valid” bit

• Disadvantage:– ill-suited for multi-threaded environments

• need one bit for each thread

Earlier Approaches

Geometric intersection scheme:– construct geometric intersection (cross-

products) of the fields and place in TCAM– pre-processing step is expensive– search is fast

• Disadvantage:– does not scale well in capacity– for router dataset: expansion of 25—100

Multimatch Using Discriminators (MUD)

• Observation: after index j is matched, the ACL has to be searched for all indices >j

• Basic idea:– store a discriminator field with each row that

encodes the index of the row– to search rows with index >j, the search key is

expanded to prefixes that correspond to >j– multiple searches are then issued

MUD: Example

TCAM array

discriminator field

Search key

011101xx00 xxxx

discriminator

001x01xx1xxx

MetricEntry

InvalidationGeometric

Intersection-basedMUD

Multi-threadingsupport

Cycles fork multi-matches

Overhead onthe packetprocessor

No Yes Yes

Small state machinelogic; can be

implemented using a few hundred gatesor a few microcode

instructions

1 + d + (d-1)(k-2)

Update cost O(NF) O(N)O(N)

NWorst-case TCAMentries for N rules

NO(NF)

Small state machinelogic; can be

implemented using a few hundred gatesor a few microcode

instructions

Extra bits 0 0without DIRPE: d

with DIRPE: 1 + d(k-1)r

with DIRPE:log2(d/r) + (d-r) + (2r-1)

MUD: Summary

↑ No per-search state—multi-threaded

↑ Incremental updates fast

↑ Scales well to large databases

↓ Additional bits needed

↓ Extra search cycles↑ Can still support Gbps speeds

Conclusion

• Range expansion problem: proposed DIRPE, a database independent encoding– scales to large number of ranges– good incremental update properties

• Multimatch classification problem: MUD– suitable for multithreaded environments– scales to large databases

• No change to hardware easy to deploy

Algorithms for Advanced Packet Classification with TCAMs Karthik Lakshminarayanan UC Berkeley Joint...

Documents

A Smart Pre-Classifier to Reduce Power Consumption of TCAMs for Multi-dimensional Packet Classification Yadi Ma, Suman Banerjee University of Wisconsin-Madison

Applied Research Laboratory Edward W. Spitznagel 24 October 20151 Packet Classification using Extended TCAMs Edward W. Spitznagel, Jonathan S. Turner,

Generative Adversarial Network - developer.download.nvidia.com fileMihaela Rosca, Balaji Lakshminarayanan, David Warde-Farley, Shakir Mohamed, “Variational Approaches for Auto-Encoding

AUTOMATIC TARGET RECOGNITION AND DATA FUSION March 9 th, 2004 Bala Lakshminarayanan

Analyzing Cooperative Containment Of Fast Scanning Worms Jayanthkumar Kannan Joint work with Lakshminarayanan Subramanian, Ion Stoica, Randy Katz

1 Routing as a Service Karthik Lakshminarayanan (with Ion Stoica and Scott Shenker) Sahara/i3 retreat, January 2004

Lakshminarayanan AP.No.1 of 2013 - tnerc.gov.in AP.N… · Title: Microsoft Word - Lakshminarayanan AP.No.1 of 2013 Author: TNERC Created Date: 8/1/2013 11:25:16 AM

1 TCAM Razor: A Systematic Approach Towards Minimizing Packet Classifiers in TCAMs Department of Computer Science and Information Engineering National

X Non-Transitive Connectivity and DHTs Mike Freedman Karthik Lakshminarayanan Sean Rhea Ion Stoica WORLDS 2005

Oasis: Anycast for Any Service Michael J. Freedman Karthik Lakshminarayanan David Mazières in NSDI 2006 Presented by: Sailesh Kumar

1 Route Table Partitioning and Load Balancing for Parallel Searching with TCAMs Department of Computer Science and Information Engineering National Cheng

balajiln@ Balaji Lakshminarayanan Reliable Deep Anomaly

Bandwidth Estimation in Broadband Access Networks Venkat Padmanabhan Systems & Networking Group Microsoft Research Joint work with: Karthik Lakshminarayanan

Beyond TCAMs: An SRAM-based Parallel Multi-Pipeline Architecture for Terabit IP Lookup Author: Weirong Jiang ViktorK.Prasanna Publisher: Infocom 08 Present:

MyRadius CS4222 PROJECT – GROUP 8 By: Abbinayaa Subramanian (U099080Y) V andhanaa Lakshminarayanan (U099161R)

Error Detection and Correction in SRAM Emulated TCAMs

Cure for the Hunchback - by Janhavi Lakshminarayanan and Kamya Sandeep Subramanya

A Hybrid DL and GM (cont’d) Applications inComputerVision · •“Learning in Implicit Generative Models,” ShakirMohamed, Balaji Lakshminarayanan, 2016 •“VariationalInference

Packet Classification using Extended TCAMs

Bala Lakshminarayanan AUTOMATIC TARGET RECOGNITION April 1, 2004