[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [RRG] Tunnel fragmentation/reassembly for RRG map-and-encaps architectures

To: Tony Li <tli@cisco.com>
Subject: Re: [RRG] Tunnel fragmentation/reassembly for RRG map-and-encaps architectures
From: Dino Farinacci <dino@cisco.com>
Date: Thu, 10 Jan 2008 01:45:39 -0800
Cc: raszuk@juniper.net, Yakov Rekhter <yakov@juniper.net>, Brian Dickson <briand@ca.afilias.info>, Brian E Carpenter <brian.e.carpenter@gmail.com>, "Templin, Fred L" <Fred.L.Templin@boeing.com>, Routing Research Group list <rrg@psg.com>
In-reply-to: <6A14AAEB-49C0-46CB-BFF3-259150B0577A@cisco.com>
References: <200801081412.m08ECd922940@magenta.juniper.net> <1EC863D3-A092-4654-AB4F-5EDD299A41C6@cisco.com> <4784C973.1010108@juniper.net> <F2D485A8-9A1C-41C6-BDAB-0EE9BABA7D74@cisco.com> <6A14AAEB-49C0-46CB-BFF3-259150B0577A@cisco.com>

CEF was not created because caching failed. It was created becausethe way caching was implemented was not optimal.
Excuse me? Caching failed miserably. Per-host caching was growingto be larger than the number of prefixes in the RIB, and per-prefixcaching would have covered 80% of the RIB.

That's because of the granularity of the cache which required moreentries. If the original fastswitching design used prefix-basedpopulation, it would have suffice.

And from today's statistics that in an average router, only 10% of theFIB entries are typically in use, the forwarding cache could be muchsmaller than the RIB.

Caching is NOT worthwhile when the working set is 80% of the fulltable.


I agree with that.

It was not a question of having a partial table (i.e. cache) versusa full table (i.e. a RIB's worth of data), but how the forwardingtable was populated.
The population was expensive, but originally it was deemedworthwhile because the cached lookups were faster and certain folksdidn't know how to do a tree walk and there was no regard for theresulting space complexity when used in the core. The original hostcache was simply designed for the enterprise and made no senseanywhere else. In fact, even in large enterprises, there wereissues simply due to the number of hosts and the smallish number ofcache buckets originally allocated.


Yes, I am well aware there were more than one problem.

Who do you think reviewed for original CEF code?   ;-)

1) Populate the mapping cache for active flows only.
2) Install a default mapping cache entry that points to an ETR thathas more mapping
  entries.
3) Put all mapping entries from the Internet into the site ITR.
DNS does 1), CEF does 3), and 2) is done by doing a hybrid. We canswing on this pendulum based on traffic patterns. But here are thedisadvantages of each:
1) Data-plane induced requests cause population. So latency isincurred. But latency
  is only for the first destination site request.
2) Stretch is increased to not require all ITRs must have a fulltable.3) Requires lots of memory to store all mappings so latency can beeliminated. Solving
  this in one in scalable way will help scale xTRs in 2).
So choose your poison, you have to make a hard decision. So westart with caching and if the cache gets as large as full table,then we have moved the pendulum full swing.
Or.... you cache in some locations, where the working set is smalland you can afford latency (which, as we've discussed many times,can be further eliminated by piggybacking the mapping request withthe DNS request) and you have other locations that get a partial orfull feed. [Sean Doran will note that this is Yet Another ProblemThat Was Already Solved Once For NNTP.]

As you know, all the LISP mapping database mechanisms touches on alltradeoffs. We know how to do it each way, what's left isexperimentation and a decision to pick one, or blend two.

LISP 2.0 is this design but many objected to having routing depend ondirectory when directory needed to depend on routing.

The nice thing about this approach is that you make this "harddecision" on a per-box basis, so that you don't have to make onedecision for the entire 'net. Ya pays your money and you getservice proportional to the money that you put in...

We can do that with a blend of LISP-ALT and NERD. Or LISP-ALT in thecore with ITRs having a default mapping (so it's the default-mapperapproach).


We presented this at the last two RRGs.

Dino


--
to unsubscribe send a message to rrg-request@psg.com with the
word 'unsubscribe' in a single line as the message text body.
archive: <http://psg.com/lists/rrg/> & ftp://psg.com/pub/lists/rrg

Follow-Ups:
- Re: [RRG] Tunnel fragmentation/reassembly for RRG map-and-encaps architectures
  - From: Tony Li <tli@cisco.com>

References:
- Re: [RRG] Tunnel fragmentation/reassembly for RRG map-and-encaps architectures
  - From: Yakov Rekhter <yakov@juniper.net>
- Re: [RRG] Tunnel fragmentation/reassembly for RRG map-and-encaps architectures
  - From: Dino Farinacci <dino@cisco.com>
- Re: [RRG] Tunnel fragmentation/reassembly for RRG map-and-encaps architectures
  - From: Robert Raszuk <raszuk@juniper.net>
- Re: [RRG] Tunnel fragmentation/reassembly for RRG map-and-encaps architectures
  - From: Dino Farinacci <dino@cisco.com>
- Re: [RRG] Tunnel fragmentation/reassembly for RRG map-and-encaps architectures
  - From: Tony Li <tli@cisco.com>

Prev by Date: Re: [RRG] Tunnel fragmentation/reassembly for RRG map-and-encaps architectures
Next by Date: [RRG] Properties of mapping solutions
Previous by thread: Re: [RRG] Tunnel fragmentation/reassembly for RRG map-and-encaps architectures
Next by thread: Re: [RRG] Tunnel fragmentation/reassembly for RRG map-and-encaps architectures
Index(es):
- Date
- Thread