Communications

Practical AI-Driven Traffic Classification for Next Gen Service-Aware RAN

By Sunhyun Kim Samsung Research

By Sangho Lee Samsung Research

By Daeun Ko Samsung Research

By Jeonga Lim Samsung Research

By Hyungwoo Ku Samsung Research

1. Introduction

Modern mobile networks are carrying increasingly diverse types of traffic, including video streaming, cloud gaming, Exteded Reality (XR) / Virtual Reality (VR) services, and emerging AI-driven applications. Each service has its own traffic characteristics and quality-of-service (QoS) requirements, making real-time analysis of traffic behavior essential for efficient Radio Access Network (RAN) operation. Traffic classification allows the network to identify service characteristics and optimize network control accordingly. For example, service-awareness, leveraging traffic pattern information, enables RAN operations to adaptively control Radio Resource Control (RRC) state. This approach can improve radio resource allocation, optimize QoS management, reduce unnecessary signaling, and enhance UE power efficiency.

With the growing diversity of mobile services and increasingly dynamic network conditions, network optimization can no longer rely solely on static configurations and predefined policies. This requires the RAN to continuously understand traffic behavior and adapt its operation accordingly. As networks evolve toward 6G, such service-aware operation is becoming an essential component of intelligent and autonomous RAN systems.

Recently, AI-RAN has emerged as a key direction for enabling self-aware and adaptive network operation [1]. By integrating artificial intelligence (AI) into the RAN, networks can dynamically analyze traffic patterns and adaptively optimize network behavior without relying on static rule-based policies. In particular, AI-based traffic classification has garnered significant attention because it can infer traffic characteristics even in encrypted traffic using statistical and temporal traffic features [2][3].

However, deploying AI-driven traffic classification in RAN systems presents significant challenges. Current methods typically depend on computationally intensive deep learning models or large-scale manually labeled datasets [4], rendering them unsuitable for real-time RAN environments with stringent latency and resource limitations. Furthermore, the continuous collection and maintenance of labeled traffic datasets become increasingly impractical as new applications and traffic patterns rapidly emerge.

To address these challenges, this blog introduces a practical AI-driven traffic classification framework for service-aware RAN operations. The proposed scheme combines clustering-based pseudo-labeling with lightweight flow-level inference to enable practical deployment in the Central Unit (CU). Furthermore, the classification results are utilized for adaptive RRC state control to reduce UE power consumption while minimizing signaling overhead. Experimental results in a testbed demonstrate the feasibility and effectiveness of the proposed approach. Beyond RRC optimization, the proposed framework can also be extended to various service-aware RAN functions that require real-time traffic understanding in future AI-native 6G networks.

2. Technical Challenges

Traffic classification systems in RAN environments encounter challenges at both the data processing and the model deployment levels. Achieving accurate, real-time classification while maintaining low computational overhead demands carefully designed algorithms and system architectures. Below we identify challenges in more detail.

Data Processing Level:

Feature Extraction while maintaining User Plane Throughput: Extracting meaningful features from raw packet data in real-time without degrading the throughput is a key requirement. Efficient processing pipelines are required to minimize overhead while ensuring real-time operation.

Temporal Dynamics: Traffic patterns present complex temporal dependencies that diverge across different applications. Capturing these patterns necessitates carefully chosen measurement windows and effective feature engineering.

Label Scarcity: Obtaining accurate labels for training supervised models is costly and time-consuming. The rapid emergence of new applications and services further intensifies this challenge.

Model Deployment Level:

Computational Budget: The CU has limited computational resources shared among various functions. Traffic classification models must operate efficiently within strict CPU and memory constraints.

Inference Latency: Classification decisions must be made quickly enough to support RRC state control. The entire pipeline, from packet reception to classification output, must complete within a time budget to be applicable for RRC state control.

Model Adaptability: Traffic characteristics evolve as applications update and new services emerge. The classification system must adapt to these changes without relying on frequent retraining with labeled data.

3. The Proposed Scheme

To address these technical challenges, we propose a lightweight AI-driven traffic classification framework designed for practical deployment in the CU. This approach integrates clustering-based pseudo-labeling and lightweight flow-level inference to enable real-time, service-aware RAN optimization.

3.1 System Overview

The overall architecture of the proposed scheme consists of two main stages as illustrated in Figure 1:

Figure 1. Overall architecture

Model Training Stage: Traffic data is first collected from the CU, capturing packet-level information including timestamps, packet sizes, and flow identifiers (5-tuple information: source IP, destination IP, source port, destination port, protocol). Clustering algorithms are then applied to generate labeled training data. A traffic classification model is trained using this data and subsequently deployed to the CU for RRC state control.

Model Inference and RRC State Control Stage: The deployed model performs real-time traffic classification on incoming flows and dynamically adjusts RRC state control parameters, based on the classified traffic type.

This design enables a data-driven pipeline that eliminates the need for manual labeling and supports real-time deployment in the CU. The separation of training and inference stages allows that model updates can be performed without disrupting ongoing operations.

3.2 Clustering-based Labeling

Since manual labeling is costly and does not scale well with increasing traffic diversity, we adopt an unsupervised approach using clustering. The labeling process consists of the following steps:

Step 1 - Data Collection: Network traffic data is collected from the CU, capturing packet-level information including timestamps, packet sizes, and flow identifiers.

Step 2 - Clustering: Flows are grouped based on statistical features using an un-supervised clustering algorithm. The clustering algorithm is particularly suitable for this task because it does not require pre-specifying the number of clusters, can identify clusters of arbitrary shape, and naturally handles noise and outliers.

Step 3 - Flow Selection: Clusters corresponding to target services are identified based on cluster characteristics such as average packet size, inter-arrival time distribution, and temporal patterns. Representative flows from selected clusters are assigned labels (e.g., video), while other flows are labeled as other traffic. To improve label quality, flow selection considers temporal information - specifically, flows that do not overlap in time are greedily selected to avoid redundant sampling.

Figure 2. Clustering-based labeling process

3.3 Traffic Classification

Traffic is measured by collecting packet data within a defined measurement window, which is then segmented into flows based on 5-tupe information. Statistical features of downlink and uplink packets are extracted to capture temporal traffic patterns. The labeled dataset is used to train a lightweight flow-level classification model capable of distinguishing between:

Video on Demand (VoD): Characterized by intermittent burst patterns

Live Streaming: Characterized by continuous traffic flow

Other Traffic: Represents other traffic types that do not align with the above categories.

3.4 Adaptive RRC State Control

Different traffic types exhibit distinct patterns, which can be leveraged to optimize RRC state control. The key insight is that the RRC release timing can be dynamically adjusted based on the classified traffic type. As shown in Figure 3, when traffic is classified as VoD with intermittent bursts, a shorter inactivity timer allows the UE to quickly transition to idle state during gaps, thereby saving power. For continuous traffic like live streaming, a longer timer prevents unnecessary frequent state transitions, ensuring efficient network operation. Therefore, this approach enhances UE power efficiency while minimizing the increase in signaling overhead.

Figure 3. Adaptive RRC state control based on traffic classification showing different timer values for VoD and live streaming

Table 1. Adaptive inactivity timer based on traffic type

4. Implementation Results

4.1 Experimental Environment

Experiments were conducted in an in-house testbed using a commercial smartphone and base station. The detailed setup is shown in Table 2:

Table 2. Experimental environment specification

The core network and CU/DU are built on a commercially deployed CU package, reflecting practical deployment conditions. Traffic data was collected using YouTube applications on the smartphone, including both VoD and live streaming sessions, along with background traffic.

4.2 Traffic Classification Performance

Overall, the results show reliable classification performance, effectively distinguishing between target and non-target traffic. The classification accuracy of 98% demonstrates that the clustering-based labeling approach produces high-quality training data.

The UE power consumption and the classification accuracy of the proposed scheme were measured through repeated tests of the same video using a power monitoring tool in a controlled environment.

By applying adaptive RRC state control, the proposed scheme allows the UE to transition to the idle state more frequently. Despite the power consumption associated with VoD playback, the results validate that optimized RRC control is effective in reducing UE power consumption while maintaining minimal signaling overhead. The observed smartphone power saving of over 2%, though seemingly modest, translated to a significant extension in battery life when accumulated over typical daily usage patterns.

5. Conclusion

This blog outlines the motivation and concept of AI-driven traffic classification for RRC state control in 6G systems. As a practical solution addressing the challenges of real-time traffic classification in RAN environments, we have developed a traffic classification and RRC state control framework that incorporates clustering-based labeling and lightweight inference. Its classification accuracy, inference latency, and power-saving benefits have been validated through extensive experiments in an in-house testbed with commercial setup.

Key Contributions:

Self-aware Traffic Understanding: RAN automatically identifies traffic characteristics without manual intervention, enabling intelligent resource management.

Adaptive RRC State Control: Reduces UE power consumption without increasing signaling overhead, achieving measurable power savings in our experiments.

Practical Deployment: Demonstrated high classification accuracy and performance in a commercial environment.

The future work will focus on expanding the traffic classification to support more traffic types, improving the clustering-based labeling with semi-supervised learning approaches, and integrating the framework with other RAN optimization functions that improve the actual quality of experience for each traffic type.

References

[1] AI-RAN Alliance, “AI-RAN Alliance Vision and Mission White Paper,” 2025.
[2] Aceto, Giuseppe, et al. "Mobile encrypted traffic classification using deep learning: Experimental evaluation, lessons learned, and challenges." IEEE transactions on network and service management 16.2 (2019): 445-458.
[3] Shapira, Tal, and Yuval Shavitt. "FlowPic: A generic representation for encrypted traffic classification and applications identification." IEEE Transactions on Network and Service Management 18.2 (2021): 1218-1232.
[4] Lin, Xinjie, et al. "Et-bert: A contextualized datagram representation with pre-training transformers for encrypted traffic classification." Proceedings of the ACM Web Conference 2022. 2022.

#6G #AI-RAN