In this video from Dell's Network Field Day 11 (#NFD11) presentation, Madhu Santhanam demonstrates an interesting use case for the new OS10 switch operating system that was introduced at the event.
The core of OS10 is an unmodified Linux kernel with an application development environment for Control Plane Services (CPS). These APIs allow software running on the switch: native linux applications, third party applications, and native OS10 applications to run on the core OS10 operating system.
The FIB Optimization application consists of three components: an sFlow agent to provide network visibility, Quagga for BGP routing, and the Selective Route Push agent which provides a REST API for selectively populating the hardware routing tables in the switch ASIC. The FIB Optimization application allows an inexpensive data center switch to replace a much more expensive high capacity Internet router.
In this use case, the data center is connected to a single transit provider and multiple additional peer networks. Initially all traffic is sent via a default route to the transit provider. The full Internet routing table consists of nearly 600,000 prefixes - far too many to fit in the switch hardware forwarding tables which in typical low cost switches can only handle 20,000 - 30,000 routes.
However, for any given site, only a small number the advertised prefixes are destinations for traffic. The challenge is to identify the most active prefixes so that they can be selectively installed in hardware tables. In this demonstration sFlow-RT is used to process the industry standard sFlow telemetry exported by the sFlow agent on the switch and compute the top N active prefixes. Every minute active prefixes installed and inactive prefixes are removed from the hardware using the Selective Route Push REST API.
In the demo, an IXIA traffic generator simulates a mixture of traffic to 1 peer and to other prefixes handled by the transit provides. Initially, all traffic is sent to the transit provider, but after a minute 76% of the traffic is being directed to the peer, and after two minutes over 90% of the traffic is being directed to the peer. As traffic patterns change, prefixes are automatically added and removed from the hardware forwarding table.
Analytics driven SDN is poised to disrupt the router market, replacing expensive, custom hardware with commodity switches. Solutions like the one Dell presented are already in production. Last year, David Barroso gave a talk showing that 99% of Spotify’s Internet traffic was contained in only 20,000 prefixes and started the SDN Internet Router project which similarly combines BGP routing information with sFlow telemetry, but only updates the active routes every hour. In the Dell demo, updates occur every minute. The speed of response can be increased even further, the sFlow-RT Active Route Manager application detects and updates active routes within a second. Speed is important since the faster the application can react to changing traffic, the more effectively the hardware forwarding, opening up additional use cases for SDN routing.