nanog mailing list archives

Re: fuzzy subnet aggregation

From: Joe Maimon <jmaimon () jmaimon com>
Date: Mon, 28 Oct 2019 12:23:48 -0400

At this time I am just trying to get an idea if the whole exercise isworth it. Whether the processing time is feasible for 5k, 50k, 100k,200k. Whether the results reduce the count measurably at acceptablecollateral levels.

Because rtbh scaling to 100k is one thing. And from there it could gohigher. And it works best if it can be spread to as many edge devices aspossible, including ones with limited tcam, such as customer edge l3switches. And to customers/friends who have redundant connections withother providers, including broadband, possibly with tiny routers, evenddwrt.


50k routes here and there and soon you are talking about real table bloat.

I try to start every project as a script if at all possible. If thatworks even somewhat real code is promising.


Joe

Mark Leonard wrote:

You could modify a radix tree to include a consolidation function andresulting confidence. Then walk the nodes of the tree, check to seeif the subtree meets the requirements for consolidation, if so, pruneand record the confidence. You would need to re-run the consolidationfrom the original data every time an individual IP was added/removedfrom the list as the consolidation function is lossy.
Alternatively, you could do consolidation on the fly losslessly if youhad a custom tree walk algorithm. That's probably the way I would doit. I'm not a programmer so I assume there are better ways out there.
Your processing time for 5k IPs should be measured in seconds (ie:less than one) rather than minutes on any modern core. Based on yourpseudocode (sort -n | uniq) I get the impression that you're usingBASH which isn't ideal for performing this sort of operation at highspeed.
On the flip side, I think an extra 100k routes isn't that much unlessyou're suffering from hardware routing table limitations. In my worldthe cost of a false positive match would far outweigh the cost ofupgrading hardware. YMMV.
Do you have a git repo?
On Sun, Oct 27, 2019 at 9:58 PM Joe Maimon <jmaimon () jmaimon com<mailto:jmaimon () jmaimon com>> wrote:
    So I went back to the drawing board, and I think I have something
    that
    seems to work much better.

    - convert input prefixes to single ip expressed as integer
    - sort -n | uniq
    - into a temporary list file

    begin

    read sequentially until maxhosts (or minhosts) or next subnet

    If matched enough single addresses, output subnet (and missing hosts
    without early loop termination)

    delete all subnet addresses read

    loop

    Total process time on a vm on old hardware, less than 2m for a
    5500 line
    input. Now to verify results, positive and negative....

    Results are still raw, but anyone who wishes is welcome to it.

    Joe

    Joe Maimon wrote:
    > Does anyone have or seen any such tool? I have a script that
    seems to
    > work, but its terribly slow.
    >
    > Currently I can produce aggregated subnets that can be mising up
    to a
    > specified number of individual addresses. Which can be fed back
    in for
    > multiple passes.
    >
    > Doing RTBH on individual /32 does not scale well, if you are eyeing
    > collaboration with external lists. I have found likely sources
    that could
    > produce another 100k prefixes easily.
    >
    > Joe
    >
    >

Current thread:

fuzzy subnet aggregation Joe Maimon (Oct 31)
- Re: fuzzy subnet aggregation Christopher Morrow (Oct 31)
  - Re: fuzzy subnet aggregation Joe Maimon (Oct 31)
    - Re: fuzzy subnet aggregation Grant Taylor via NANOG (Oct 31)
    - Re: fuzzy subnet aggregation Antonio Querubin (Oct 31)
- Re: fuzzy subnet aggregation Masataka Ohta (Oct 31)
- Re: fuzzy subnet aggregation Joe Maimon (Oct 31)
  - Re: fuzzy subnet aggregation Mark Leonard (Oct 31)
    - Re: fuzzy subnet aggregation Joe Maimon (Oct 31)
    - RE: fuzzy subnet aggregation Michel Py (Oct 31)
    - Re: fuzzy subnet aggregation Joe Maimon (Oct 31)
- Re: fuzzy subnet aggregation Mark Leonard (Oct 31)
  - Re: fuzzy subnet aggregation Joe Maimon (Oct 31)
    - Re: fuzzy subnet aggregation Nick Morrison via NANOG (Oct 31)
- <Possible follow-ups>
- RE: fuzzy subnet aggregation Jakob Heitz (jheitz) via NANOG (Oct 31)