Tag Archives: editorial

AppClassNet: A commercial-grade dataset for application identification research

Wang Chao, Alessandro Finamore, Lixuan Yang, Kevin Fauvel, Dario Rossi

Abstract

The recent success of Artificial Intelligence (AI) is rooted into several concomitant factors, namely theoretical progress coupled to practical availability of data and computing power. Therefore, it is not surprising that the lack of high quality data is often recognized as one of the major factors limiting AI research in several domains, and the networking domain is not excluded. Large companies have access to large data assets, that would constitute interesting benchmarks for algorithmic research in the broader scientific community. However, such datasets are private assets that are generally very difficult to share due to privacy or business sensitivity concerns.

Following numerous requests we received from the scientific community, we release AppClassNet, a commercial-grade dataset for benchmarking traffic classification and management methodologies. AppClassNet is significantly larger than the datasets generally available to the academic community in terms of both the number of samples and classes, and reaches scales similar to the popular ImageNet dataset commonly used in computer vision literature.

To avoid leak of user- and business-sensitive information, we opportunely anonymized the dataset, while empirically showing that it still represents a relevant benchmark for algorithmic research. In this paper, we describe the public dataset as well as the steps we took to avoid leakage of sensitive information while retaining relevance as a benchmark. We hope that AppClassNet can be instrumental for other researchers to address more complex commercial-grade problems in the broad field of traffic classification and management.

Download from ACM

The multiple roles that IPv6 addresses can play in today’s Internet

Maxime Piraux, Tom Barbette, Nicolas Rybowski, Louis Navarre, Thomas Alfroy, Cristel Pelsser, François Michel, Olivier Bonaventure

Abstract

The Internet use IP addresses to identify and locate network interfaces of connected devices. IPv4 was introduced more than 40 years ago and specifies 32-bit addresses. As the Internet grew, available IPv4 addresses eventually became exhausted more than ten years ago. The IETF designed IPv6 with a much larger addressing space consisting of 128-bit addresses, pushing back the exhaustion problem much further in the future.

In this paper, we argue that this large addressing space allows reconsidering how IP addresses are used and enables improving, simplifying and scaling the Internet. By revisiting the IPv6 addressing paradigm, we demonstrate that it opens up several research opportunities that can be investigated today. Hosts can benefit from several IPv6 addresses to improve their privacy, defeat network scanning, improve the use of several mobile access networks and their mobility as well as to increase the performance of multicore servers. Network operators can solve the multihoming problem more efficiently and without putting a burden on the BGP RIB, implement Function Chaining with Segment Routing, differentiate routing inside and outside a domain given particular network metrics and offer more fine-grained multicast services.

Download from ACM

The July 2022 issue

This July 2022 issue contains one technical paper and two editorial notes.

The technical paper, The Packet Number Space Debate in Multipath QUIC, by Quentin De Coninck, deals with how QUIC packets should be numbered over multiple paths. This work provides a comparison between the usage of a single (shared) or multiple packet space numbers for QUIC multipath. The main outcome of the evaluation is that using multiple packet number spaces has the advantage that packet losses can be detected while maintaining a significantly lower state at the receiver. Also, it allows using fewer signalling frames at the cost of a more profound modification of the QUIC protocol.

We have two editorial notes. The first one, The multiple roles that IPv6 addresses can play in today’s Internet, by Maxime Piraux and his colleagues, argues that the large IPv6 addressing space allows reconsidering how IP addresses are used and enables improving, simplifying and scaling the Internet. The second, AppClassNet: A commercial-grade dataset for application identification research by Wang Chao and his colleagues, releases a commercial-grade dataset for benchmarking traffic classification and management methodologies. AppClassNet is significantly larger than the datasets generally available to the academic community.

I hope that you will enjoy reading this new issue and welcome comments and suggestions on CCR Online (https://ccronline.sigcomm.org) or by email at ccr-editor at sigcomm.org.

Recommendations for Designing Hybrid Conferences

Vaibhav Bajpai, Oliver Hohlfeld, Jon Crowcroft, Srinivasan Keshav, Henning Schulzrine, Jorg Ott, Simone Ferlin-Reiter, Georg Carle, Andrew Hines, Alexander Raake

Abstract

During the Covid-19 pandemic, many smaller conferences have moved entirely online and larger ones are being held as hybrid events. This reduces the carbon footprint of conference travel and
makes events more accessible to parts of the research community that have difficulty traveling long distances. Hybrid events will become an attractive alternative in the future since they make meetings broadly available without the need for travels, while preserving all elements of in-person gatherings. While we have developed a solid understanding of how to design virtual events, we do not yet know how to properly run hybrid events. We present guidelines and considerations–spanning technology, organization and social factors–for organizing successful hybrid conferences. This is the output of a Dagstuhl seminar on “Climate Friendly Internet Research” held in July 2021.

Download from ACM

A Case for an Open Customizable Cloud Network

Dean H. Lorenz, David Breitgand, Kathy Barabash, Etai Lev-Ran, Danny Raz

Abstract

Cloud computing is transforming networking landscape over the last few years. The first order of business for major cloud providers today is to attract as many organizations as possible to their own clouds. To that end cloud providers offer a new generation of managed network solutions to connect the premises of the enterprises to their clouds. To serve their customers better and to innovate fast, major cloud providers are currently on the route to building their own “private Internets”, which are idiosyncratic. On the other hand, customers that do not want to stay locked by vendors and who want flexibility in using best-for-the-task services spanning multiple clouds and, possibly, their own premises, seek for solutions that will provide smart overlay connectivity across clouds.

The result of these developments is a multiplication of closed idiosyncratic solutions rather than an open standardized ecosystem. In this editorial note we argue for desirability of such an ecosystem, outline the main requirements and sketch possible solutions. We focus on enterprise as our primary use case and illustrate the main ideas through it, but the same principles apply to various different use cases.

Download from ACM

The April 2022 issue

This April 2022 issue contains five technical papers and two editorial notes.

The first technical paper, Data-Plane Security Applications in Adversarial Settings, by Liang Wang and colleagues, investigates security issues that may arise when creating and running data-plane applications for programmable switches. This work moves security analysis and design forward in this particular area. This paper also calls for a more thorough rethinking of security for data-plane applications for programmable switches.

The second technical paper, One Bad Apple Can Spoil Your IPv6 Privacy, by Said Jawad Saidi and colleagues, leverages IPv6 passive measurements to pinpoint that a non-negligible portion of devices encodes their MAC address in their IPv6 address. This threatens users’ privacy, allowing content providers and CDNs to consistently track users and their devices across multiple sessions and locations. Overall, the paper is an excellent contribution toward privacy-by-design solutions and a nicely executed measurements study that clarifies the problem and provides solid suggestions to mitigate the problem.

The third technical paper, Hyper-Specific Prefixes: Gotta Enjoy the Little Things in Interdomain Routing, by Khwaja Zubair Sediqi and colleagues, investigates the presence of high-specific prefixes (HSP) on the BGP Internet routing during the last decade. These prefixes are more-specific than /24 (/48) for IPv4 (IPv6) and are commonly filtered by Autonomous Systems operators. Overall this paper offers a nice contribution to the understanding of the BGP universe, with a clear message and a nice quantification of the phenomenon. The authors clearly present and motivate the work, offering also to not experts a nice view of the routing complexity of the internet nowadays.

The fourth technical paper, Programming Socket-Independent Network Functions with Nethuns, by Nicola Bonelli and colleagues, proposes a new solution to transparently develop packet-processing programs on top of different network I/O frameworks. The authors design and develop an open-source library, nethuns, serving as a unified programming abstraction for network functions that natively supports multi-core programming. Not only is this work very relevant to our community, but also the code is released open-source through a BSD license, which can be used to foster more research in the area, towards unifying programming mechanisms of end-host networking.

The fifth technical paper, Measuring DNS over TCP in the Era of Increasing DNS Response Sizes: A View from the Edge, by Mike Kosek and colleagues, studies one of the foundations of today’s Internet: the Domain Name Service (DNS). The original RFC document of DNS instructs to send queries either over UDP (DoUDP) or TCP (DoTCP). This paper presents a measurement study on DoTCP focusing on two perspectives: failure rates and response times.

Finally, we have two editorial notes. A Case for an Open Customizable Cloud Network, by Dean H. Lorenz and his colleagues, argues for the desirability of the new ecosystem of managed network solutions to connect to the Cloud, outlines the main requirements and sketches possible solutions. Recommendations for Designing Hybrid Conferences, by Vaibhav Bajpai and colleagues, presents guidelines and considerations–spanning technology, organization and social factors–for organizing successful hybrid conferences.

I hope that you will enjoy reading this new issue and welcome comments and suggestions on CCR Online (https://ccronline.sigcomm.org) or by email at ccr-editor at sigcomm.org.

Answering three questions about networking research

Jennifer Rexford, Scott Shenker

Abstract

Researchers often talk about specific technical trends or research topics. But we rarely talk about how and why we do the research that we do. The process of submitting and reviewing papers puts our ideas through a particular kind of filter that may make all of the research seem like it follows some standard rubric, a SIGCOMM Normal Form if you will. During a panel at HotNets’21, five researchers—Hari Balakrishnan, Jon Crowcroft, Jennifer Rexford, Scott Shenker, and David Tennenhouse—each answered three questions about how they pick their own research topics, what areas they would like to see more research on, and how they evaluate conference papers. Due to the unexpectedly positive response to that panel, CCR will be publishing a series of answers to these three questions, starting with two participants from the panel but reaching out to others to provide answers from a broader cross-section of the SIGCOMM community.

Download from ACM

Important concepts in data communications

Craig Partridge

Abstract

The data communications field recently marked the 50th anniversary of the start of the ARPANET, which was one of the first and certainly the most influential of the early data communications networks. The anniversary provoked discussions about which concepts or ideas in data communications have proven to be enduring in the evolution of data communications. This paper presents one perspective.

Download from ACM

M-Lab: user initiated internet data for the research community

Phillipa Gill, Christophe Diot, Lai Yi Ohlsen, Matt Mathis, Stephen Soltesz

Abstract

Measurement Lab (M-Lab) is an open, distributed server platform on which researchers have deployed measurement tools. Its mission is to measure the Internet, save the data and make it universally accessible and useful. This paper serves as an update on the MLab platform 10+ years after its initial introduction to the research community [5]. Here, we detail the current state of the M-Lab distributed platform, highlight existing measurements/data available on the platform, and describe opportunities for further engagement between the networking research community and the platform.

Download from ACM

Roadmap for edge AI: a Dagstuhl perspective

Aaron Yi Ding, Ella Peltonen, Tobias Meuser, Atakan Aral, Christian Becker, Schahram Dustdar, Thomas Hiessl, Dieter Kranzlmüller, Madhusanka Liyanage, Setareh Maghsudi, Nitinder Mohan, Jörg Ott, Jan S. Rellermeyer, Stefan Schulte, Henning Schulzrinne, Gürkan Solmaz, Sasu Tarkoma, Blesson Varghese, Lars Wolf

Abstract

Based on the collective input of Dagstuhl Seminar (21342), this paper presents a comprehensive discussion on AI methods and capabilities in the context of edge computing, referred as Edge AI. In a nutshell, we envision Edge AI to provide adaptation for data-driven applications, enhance network and radio access, and allow the creation, optimisation, and deployment of distributed AI/ML pipelines with given quality of experience, trust, security and privacy targets. The Edge AI community investigates novel ML methods for the edge computing environment, spanning multiple sub-fields of computer science, engineering and ICT. The goal is to share an envisioned roadmap that can bring together key actors and enablers to further advance the domain of Edge AI.

Download from ACM