Comments
Richard Davies wrote: The UK has a good crop of technology pioneers in cloud computing - for example ElasticHosts, FlexiScale, Flexiant, OnApp - and also some strong government initiatives such as G-Cloud. We will have to see whether this kind of technical leadership converts into swift mass-market adoption or not.
Cloud Expo on Google News

SYS-CON.TV
Cloud Expo & Virtualization 2009 East
PLATINUM SPONSORS:
IBM
Smarter Business Solutions Through Dynamic Infrastructure
IBM
Smarter Insights: How the CIO Becomes a Hero Again
Microsoft
Windows Azure
GOLD SPONSORS:
Appsense
Why VDI?
CA
Maximizing the Business Value of Virtualization in Enterprise and Cloud Computing Environments
ExactTarget
Messaging in the Cloud - Email, SMS and Voice
Freedom OSS
Stairway to the Cloud
Sun
Sun's Incubation Platform: Helping Startups Serve the Enterprise
POWER PANELS:
Cloud Computing & Enterprise IT: Cost & Operational Benefits
How and Why is a Flexible IT Infrastructure the Key To the Future?
Click For 2008 West
Event Webcasts
The Emergence of the Universal Appliance
Containers are an example of a universal solution - one that revolutionized the shipping industry

In 1956, Malcolm McLean invented a shipping system that revolutionized cargo shipping forever, namely the container. The shipping container provides a standard, universal packing solution that can be used for transporting whatever you need to ship. Containers can be transported on trucks, trains or on ships, because they are of standard size.

Containers are an example of a universal solution that revolutionized the shipping industry. Can the emergence of the standard PC server platform as a universal computing platform herald the proliferation of even more innovative dedicated network appliances in IP networks?

The growth of network appliances
Over the last decade, we have all become familiar with packet networks including equipment like routers and switches. But packet networks have become more intelligent over the last few years as more and more demanding, real-time services have migrated to IP. While intelligence has been added to routers to deal with these new service types, a need has developed for dedicated network appliances that can perform specific functions in the IP network.

A good example is network performance monitoring. Routers have the capability to create and collect netflow data, which can be used to monitor the performance of IP networks. However, processing this data on many sessions places an unacceptable load on the router, which diverts attention away from the main task of the router, which is to forward packets. It therefore makes sense to off-load this task to dedicated network performance monitoring appliances.

Another example is network security. Many routers provide network security features, but as we move to higher speed networks, there is interest in using dedicated network security devices, such as intrusion detection and prevention systems (IDS/IPS) to detect threats in real-time and take immediate action.

This is a trend seen throughout the network with dedicated appliances for network analysis, network forensics, network test and measurement, network optimization and network security. These are highly intelligent solutions, which have the ability to process a vast amount of data in real-time. They are essential in establishing IP networks as multi-service and intelligent transport networks.

From proprietary to standard hardware platforms
Until recently, the high performance required by these solutions dictated a system design similar to the routing products they were designed to off-load, namely a proprietary hardware design. The doctrine has been that only a customized, proprietary design can provide the performance you need to meet the real-time demands of high-speed network monitoring.

But an alternative system design approach has been gathering momentum over the last few years based on standard off-the-shelf platforms. Standard PC servers have established themselves as a credible hardware platform alternative to in-house proprietary design and have been embraced by a number of network monitoring solution vendors who recognize that the value of their solutions lies in the application software provided. The hardware platform just needs to provide the raw computing power, memory bandwidth and fast input/output of data that these solutions require.

With the latest server platforms based on new multi-core CPU architectures, the raw processing power and memory bandwidth is available to perform even the most demanding tasks. However there is one area that these standard platforms are not able to address – fast input/output of data, especially for real-time network analysis applications.

Standard Network Interface Cards (NICs) provided with standard servers do not have the real-time throughput capacity and efficiency needed to for high-speed network monitoring. NICs can provide fast input/output for data packets to a specific server MAC/IP address, but cannot provide the same performance for all traffic when monitoring of all MAC/IP addresses is required. This is especially the case when moving to 10 Gbps networking.

Fortunately, specialist network adapters have emerged to fill the gap.

Focusing R&D effort
The combination of standard server platforms and intelligent real-time network adapters establishes the universal appliance platform for high-performance network monitoring or any other application that requires real-time packet capture, analysis and re-transmission at speeds up to 10 Gbps without losing packet data.

The emergence of such a universal appliance is significant. It effectively separates the application software from the hardware supporting it. This allows a multitude of dedicated application software solutions to be supported by a single hardware platform where addition of features or even a total replacement of application software supported by the server is possible. Vendors of network monitoring, analysis, test & measurement, optimization and security solutions can thus concentrate on the application and focus their R&D investment on software development rather than diverting attention to hardware development.

Not only does this mean more focus, but it is also comes at a lower cost! Standard PC server platforms enjoy economies of scale leading to relatively low unit prices. A standard server for a few thousand dollars is more than adequate in providing the CPU power and memory performance requirements for 10G applications. It is therefore possible to provide a lower cost hardware platform with a high performance with zero investment in hardware development.

But to make it work, you need an intelligent real-time network adapter. Let’s take a look at the fast input/output challenge for real-time network monitoring and how intelligent real-time network adapters help to meet these challenges.

The limitations of standard NICs
Fast input/output in real-time network monitoring requires that all data is captured no matter the packet size, link utilization or line-speed. Standard Network Interface Cards (NICs) have been used for this task in the past, but as the graphs in figure 1a and 1b show below, they face significantly challenges in a 10Gbps real-time network monitoring:

Figure 1a: Real throughput on a 10 Gbps port for standard NICs (Source: CESNET performance tests)

Figure 1b: CPU load handling 10 Gbps data traffic on 10 Gbps port (Source: CESNET performance tests)

The graph shown in figure 1a is referring to the effective throughput that can be achieved without losing packets at the port. It refers to Ethernet frames, which are used to transport IP packets in IP networks. Ethernet frames (and IP packets) can have any size. The size is determined by the application, but also conditions on the network – if the network or parts of the network are heavily loaded, then this can result in the use of smaller packets/frames as these have a better chance of reaching the destination in a congested newtork.

Table 1 below shows the theoretical limit for the throughput one should expect on a 10 Gbps port. Note that throughput naturally falls as the frame size is reduced. With smaller frame sizes, there are more frames to be handled and the preamble and inter-frame gap associated with each frame becomes more significant. This is pure overhead and reduces the effective throughput.

Table 1: Theoretical maximum throughput for a 10 Gbps Ethernet port As can be seen in figure 1a, for large Ethernet frame sizes, throughput is close to the theoretical limit. However, as frame sizes decrease, the effective throughput drops off dramatically to less than 1 Gbps at small frame sizes.

Typical frame sizes for Internet communication lie in the range from 128 to 1024 bytes with 300 bytes an often referenced frame size for tests. In this range, it can be seen that throughput is at best 6 Gbps and can be as low as 1 Gbps!

The graphs above are based on 10 Gbps port throughput, but the issue is the same for 1 Gbps ports. What distinguishes these two cases is the additional load that is placed on the CPU for handling of data traffic. For 1 Gbps ports, the CPU load is high, but acceptable, whereas for 10 Gbps ports, as figure 1b shows, almost 2/3 of the CPU resources are used just in handling Ethernet frames. This is not acceptable for many of the compute- and data-intensive network applications that are now becoming common in the network.

The explanation for this considerable work-load is that standard NICs are designed to interrupt the CPU each time a frame is received and needs to be handled. The CPU must decide what to do with the frame, to re-order and de-duplicate frames received, to discard frames that are invalid etc. This, obviously, is a distraction for CPUs, which should be busy running the network application in question.

Intelligent real-time network adapters, on the other hand, are designed for real-time network monitoring. In particular, they are designed to provide full throughput at the theoretical limit without losing packets no matter the packet size. They are also designed to do this without overloading the CPU by off-loading many of the tasks normally performed by the CPU. The results can be seen below (see figure 2a and 2b):

Figure 2a: Napatech NT20E throughput performance

Figure 2b: Napatech NT20E CPU load performance

As can be seen, the throughput can be maximized to theoretical limits while CPU load can be reduced to less than 1%. A lower CPU load ensures that there is more processing power delivered back to the application. This means a faster application with the ability to process more data. Intelligent real-time network adapters, such as Napatech’s can bridge the performance gap making standard off-the-shelf servers a viable and powerful universal platform for network appliances.

Parallel processing using multiple CPU cores
The latest CPUs provide multiple cores, effectively 2, 4 or 8 CPUs in one chip. However, to take advantage of this, it must be possible to run multiple instances of one application or several different applications on the available CPU cores. It must also be possible to direct the right traffic to each application instance. Now, instead of one flow of data being processed by a single application, 2, 4 or 8 flows can be processed in parallel.

While methods exist to implement multi-threading or multiple instances of the same application software on multiple CPU cores, standard NICs are not designed for providing data to multiple application instances in an intelligent way. In standard NIC implementations, Ethernet frames are treated on a frame-by-frame basis as a single flow. It is up to the operating system to copy the frames to all of the relevant application instances, which is both a time consuming and wasteful process.

Napatech network adapters provide a unique capability to intelligently define multiple data flows based on an examination of the Ethernet frames received. The flows can be defined based on the source and destination ports and addresses in the Ethernet, IP and TCP/UDP headers, but also on tunnel identifiers if a tunneling protocol has been used, such as SCTP, GRE or GTP.

Once these flows are defined, they can be directed to up to 32 different CPU cores for processing by an application instance. A Direct Memory Access (DMA) process is used, which means that the operating system does not need to be involved and no copying of frames is necessary. This removes delays and does not waste memory leading to a faster, more efficient data transfer.

The net result is real-time, parallel processing of multiple flows of data where each flow can be processed and managed differently, if one so chooses.

From standard server to universal appliance
The pieces are now in place to provide a universal appliance platform that can support any real-time network analysis application. This not only provides a relatively cheap, but powerful and reliable platform, but also provides flexibility in the type of server platform to use and the application to run on the platform thanks to the separation of hardware from software. More importantly, it allows providers of network monitoring, analysis, test & measurement, optimization and security solutions to focus their energy on software development rather than on hardware development.

Just as containers revolutionized the shipping industry, can the Universal Appliance concept do the same for dedicated network appliances and IP networks?

About Dan Joe Barry
Dan Joe Barry is VP of Marketing at Napatech. Napatech develops and markets the world's most advanced programmable network adapters for network traffic analysis and application off-loading. Napatech is the leading OEM supplier of Ethernet network acceleration adapter hardware with an installed base of more than 40,000 ports.

In order to post a comment you need to be registered and logged in.

Register | Sign-in

Reader Feedback: Page 1 of 1

Latest Cloud Developer Stories
As a result, it said, of “customer feedback and evolving usage patterns,” Microsoft cut the price of its cloud-ified SQL Azure database 48%–75% for databases larger than 1GB and introduced a new entry-level 100MB model. It blogged that it’s noticed that many projects start smal...
Wide and cheap availability of cloud-based media services is upon us. With the transformations these services are already bringing to the consumption of music, video and interactive media, change has likewise come to professional workflows. Documents in 2012 are read, written, co...
With Cloud Expo 2012 New York (10th Cloud Expo) just four months away, what better time to start introducing you in greater detail to the distinguished individuals in our incredible Speaker Faculty for the technical and strategy sessions at the conference... We have technical ...
Fresh off a happy quarter, Rackspace said Thursday that it’s bought SharePoint911, one of those you-never-heard-of-them outfits that does SharePoint consulting, training and JumpStart services so it can deliver newfangled SharePoint services along with its existing SharePoint hos...
Cloud is a shift from the focus on underlying technology implementation to leveraging existing implementations and further building upon them. Cloud orchestration or a network of clouds is the wave of the future where these clouds can operate with elasticity, scalability, and eff...
Subscribe to the World's Most Powerful Newsletters
Subscribe to Our Rss Feeds & Get Your SYS-CON News Live!
Click to Add our RSS Feeds to the Service of Your Choice:
Google Reader or Homepage Add to My Yahoo! Subscribe with Bloglines Subscribe in NewsGator Online
myFeedster Add to My AOL Subscribe in Rojo Add 'Hugg' to Newsburst from CNET News.com Kinja Digest View Additional SYS-CON Feeds
Publish Your Article! Please send it to editorial(at)sys-con.com!

Advertise on this site! Contact advertising(at)sys-con.com! 201 802-3021

SYS-CON Featured Whitepapers
ADS BY GOOGLE