Communications
In this blog, we introduce our new work [1] presented and published at the IEEE Globecom 2023 Conference.
Given the presence of diverse services and devices generating heterogeneous traffic patterns, the careful selection of appropriate ML models and optimal allocation of ML resources based on the current network’s ML resource usage is crucial [2]. However, manual management of these aspects, considering factors such as ML resource availability, specific service requirements, problem type for ML/AI application, ML periodicity (data collection interval and prediction frequency), acceptable error thresholds, and desired model accuracy, poses significant challenges. Manual provisioning can result in inadequate model selection and suboptimal allocation of ML resources, leading to ineffective problem mitigation, degradation of network QoE/QoS, increased OPEX for network operators or subscriber loss. Hence, there is a need to automate the process of ML model selection, related parameters (periodicity, errors, accuracies), and ML resource provisioning. The problem statement is depicted in Figure 1. In 5G networks, where a large number of services and cells generate vast amounts of data, manual management of ML services can lead to suboptimal or inappropriate selection of ML packages. Such suboptimal choices can result in inefficient utilization of ML resources and degrade the performance of ML training and prediction. Consequently, opting for suboptimal or inappropriate mitigation solutions may cause QoS/QoE degradation and violate service level agreements (SLAs) [3]. These degradations or SLA violations can lead to subscriber churn as users abandon the network due to poor service quality [3]. Additionally, suboptimal or inappropriate mitigation solutions can increase the operator’s OPEX.
Figure 1. Problem Statement
To tackle the challenges mentioned above, we introduce AutoMLPoweredNetworks: an automated solution for Machine Learning (ML) service provisioning in next-generation networks.
Figure 2. Proposed: AutoMLPoweredNetworks Framework
Our proposed AutoMLPoweredNetworks helps in saving the ML resources by re-using the ML models of representative cells and using the trained model on the rest of the cells in the group without performing any training for them.
We conducted our experiments using 961 cells from a real-field 5G operator dataset, which included three Network Key Performance Indicators (KPIs): Cell Data Rate, Resource Usage Percentage, and the number of connected UE’s. Datasets consisted of one month’s worth of data collected at hourly intervals. We divided the datasets into an 80-20 split, with 80% for testing and 20% for validation. For our experiments, we employed two deep learning models: LSTM: Long Short-Term Memory model and NN: Neural Networks model. The experiments were performed using an Intel Xeon Platinum 8268 CPU (16 GB RAM and 8 cores). We utilized the Keras RNN API for implementation. The experiment parameters are outlined in Table 1.
Table 1. Experiment Parameters.
We examined the efficacy of three applications (for 5G data) across various proposed Key Performance Indicators related to Machine Learning.
Figure 3. Average Gains
Figure 4. Reduction in Trainings
Figure 3 shows the average gains across different ML measurement KPIs for 5G network data. For 5G data we saved 39.94% of server processing time, 60.46% of server memory consumption and 75.11% of server Utilization on an average across different network KPIs. For Resource Usage Percentage KPI we saw that for 5G data, there were 97.89% of cell which performed well in terms of training accuracies when we trained the similar cells using training model of the representative cell in a group.
Figure 4 depicts the savings in number of training across different KPIs for 5G data. For 5G data, we saved 651 trainings for Cell Data-Rate network KPI, 571 training for Resource Usage Percentage network KPI and 725 trainings for the number of connected UE’s network KPI. On an average, we saved 649 trainings across all the network KPIs.
This research paper introduces a framework that addresses the automatic selection of ML models and optimization of ML resources. To evaluate the framework, we conducted tests using actual 5G operator data, and the results showcased significant improvements based on the proposed ML measurement KPIs.
In terms of future work, we aim to enhance the cell grouping algorithms by including more parameters. We intend to automate the process of automatically learning from the evaluation application and deciding the re-training layers dynamically rather than fixed re-training EPOCHS. We further plan to enhance the representative cell selection algorithm by including more similarity metric algorithms.
[1] Sukhdeep Singh et al., “AutoMLPoweredNetworks: Automated Machine Learning Service Provisioning for NexGen Networks”, in 2023 IEEE Global Communications Conference (GLOBECOM), Kuala Lumpur, Malaysia, Dec. 2023.
[2] J. Thaliath et al., "Predictive Closed-Loop Service Automation in O-RAN Based Network Slicing," in IEEE Communications Standards Magazine, vol. 6, no. 3, pp. 8-14, September 2022, doi: 10.1109/MCOMSTD.0001.2200017.
[3] V. P. Kafle, et al., ”Automation of 5G Network Slice Control Functions with Machine Learning,” in IEEE Communications Standards Magazine, vol. 3, no. 3, pp. 54-62, September 2019, doi: 10.1109/MCOMSTD.001.1900010.