On the Applicability of Network Digital Twins in Generating Synthetic Data for Heavy Hitter Discrimination

Abstract

Differentiating between benign and malicious heavy hitter (HH) flows is a significant challenge for telecommunication infrastructure, as they cause significant congestion and degraded quality of service. Accurate identification is essential for effective mitigation, but current methods lack the granularity needed to distinguish between legitimate activity and malicious distributed denial-of-service (DDoS) traffic. To address this using machine learning (ML), a large labeled dataset is required, yet obtaining such datasets from live networks is infeasible due to privacy policies and operational constraints. To address these challenges, network digital twin (NDT) is proposed as an innovative approach to generate synthetic labeled data for ML applications tailored to complex network problems by emulating diverse and realistic network environments and traffic conditions. To demonstrate this approach, Telefonica’s Mouseworld NDT is extended for automated data collection and labeling of benign and malicious HH flows along with normal traffic. Results show that the ML model trained on this NDT-generated data accurately detects benign and malicious HH flows, validating the effectiveness of the proposed approach in creating realistic labeled data for the application of ML to complex network management solutions. For reproducibility and further research, the dataset and code are openly available.

Type
Publication
IEEE Communications Magazine
Amit Karamchandani
Amit Karamchandani
Predoctoral Researcher

Amit Karamchandani Batra, a predoctoral researcher and Ph.D. student at the Universidad Politécnica de Madrid, has contributed to EU-funded 5G cybersecurity projects, co-authored research papers, and received multiple academic awards, including for his B.Sc. and M.Sc. degrees.

Luis de la Cal
Luis de la Cal
Predoctoral researcher

I’m a PhD candidate focused on smart cities, collective intelligence, and innovative technologies like augmented reality and biosensors, with experience in research and entrepreneurship.

Alberto Mozo
Alberto Mozo
Head of the research group
Full professor

I am a Full Professor at the Technical University of Madrid (Universidad Politécnica de Madrid) and lead the Research Group on Mathematical Modeling and Biocomputing at the same institution.