Enhancing User Experience in Home Networks with Machine Learning-Based Classification

This repository contains the code and models for our paper "Enhancing User Experience in Home Networks with Machine Learning-Based Classification", published in the ITU Journal on Future and Evolving Technologies. This work won the Best Student Solution Award in the 2022 ITU AI/ML in 5G Challenge.

Abstract

Our novel time series characteristic-based method extracts thousands of descriptive statistics from time series sequences, achieving an impressive 67% validation accuracy. This represents a substantial 3% enhancement over the performance of conventional models on this dataset. We also explored the potential of a Recurrent Neural Network (RNN) model, which yielded promising results with a validation accuracy of 58%.

Figure 1: Overview of the downlink network side architecture

Introduction

With the rapid development of mobile Internet, home broadband quality has become a key factor in determining market competitiveness. This project aims to develop an efficient machine learning model to accurately evaluate home user network experiences, enabling network operators to proactively identify potential dissatisfied users and implement timely corrective measures.

Dataset

The dataset, provided by ZTE, encompasses network indicator data from 500 anonymized users. It presents several challenges:

Non-standard sampling rate and time range
Uneven distribution of observations
Multiple recorded observations for identical timestamps
Constrained sample size
Subjective definition of Internet experience
Lack of essential information regarding the data collection setup

UBE Indicators	UGE Indicators
Figure 2a: Network indicators for users reporting bad experience. Note the higher frequency and amplitude of spikes across multiple indicators.	Figure 2b: Network indicators for users reporting good experience. Observe the generally lower and more stable indicator values.

Data Processing

Linear Interpolation

We use linear interpolation to resample the data to fixed intervals, making it more manageable and easier to work with. This technique is particularly useful for handling:

Dense time series data
Unevenly spaced sampling rates
Non-standard time ranges

Figure 3: Illustration of linear interpolation process

Z-normalization

Z = (x - μ) / σ

We apply Z-normalization to transform input vectors so their mean is approximately zero and standard deviation is close to one. This helps:

Enable focus on structural patterns rather than amplitude differences
Handle different units and scales across indicator columns
Make comparisons between indicators more meaningful

Figure 4: Time series before and after Z‑normalization

Low-noise Padding

For handling unevenly-sized time series data, we use low-noise padding by:

Adding low-amplitude noise (1e-6) to shorter time series
Padding sequences to match the length of the longest time series
Preserving signal integrity while enabling uniform analysis

This approach was chosen over alternatives like uniform scaling, truncation, and ARIMA-based forecasting for its:

Ease of implementation
Superior performance on our specific problem
Compatibility with our preprocessing pipeline

Figure 5: Time series before and after low‑noise padding

Methodology

We explored various machine learning approaches:

Traditional MTSC (Multivariate Time Series Classification) models:
- ROCKET classifier
- DTW-KNN
- HIVE-COTE
Deep Learning models:
- Convolutional Neural Networks (CNN)
- Long Short-Term Memory (LSTM) Networks
Time Series Characteristic (TS-Char) models:
- Manual feature extraction + XGBoost
- TSFresh + PCA + XGBoost

Results

Our TSFresh + XGBoost model achieved the highest performance with 67% validation accuracy, outperforming other approaches in the 2022 ITU AI/ML in 5G Challenge. The LSTM model also showed promise with 58% accuracy.

Figure 6: TSFresh + XGBoost model confusion matrix

Repository Structure

├── models/               # Trained models
│   ├── rnn_checkpoint/  # Best RNN checkpoint
│   └── xgboost_model/   # TSFresh + PCA + XGBoost model
├── notebooks/           # Implementation notebooks
│   ├── rocket.ipynb    # ROCKET classifier implementation
│   ├── lstm_rnn.ipynb  # LSTM RNN implementation
│   ├── manual_features.ipynb  # Manual feature extraction
│   ├── tsfresh.ipynb   # TSFresh + PCA + XGBoost
│   └── ts_regularization.ipynb # Data preprocessing
├── requirements.txt    # Dependencies
└── docs/              # Documentation and paper

Installation and Usage

Clone this repository
Download the competition data from this Google Drive link and unzip it in the repository root
Install the required packages: pip install -r requirements.txt
Navigate to the notebooks/ directory to run the Jupyter notebooks

Note: A GPU is highly recommended for running the LSTM RNN notebook.

Acknowledgements

We would like to thank ITU for organizing the 2022 AI/ML in 5G Challenge and ZTE for providing the problem statement and dataset. Special thanks to the Telecommunication Standardization Bureau (TSB) of ITU for their support and collaboration in co-authoring the research paper.

Citation

If you use this code or our findings in your research, please cite:

@article{rai2024enhancing,
  title={Enhancing User Experience in Home Networks with Machine Learning-Based Classification},
  author={Rai, Rushat and Basikolo, Thomas},
  journal={ITU Journal on Future and Evolving Technologies},
  volume={5},
  number={1},
  year={2024},
  publisher={International Telecommunication Union}
}

License

This project is licensed under CC BY-NC-ND 3.0 IGO.

This research has been published in the ITU Journal on Future and Evolving Technologies, Volume 5, Issue 1, March 2024. For more details, please refer to the full paper: https://www.itu.int/pub/S-JNL-VOL5.ISSUE1-2024-A12

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
assets		assets
models		models
notebooks		notebooks
ML5G-PS-012-Xdding-Presentation.pptx		ML5G-PS-012-Xdding-Presentation.pptx
ML5G-PS-012-Xdding-Report.pdf		ML5G-PS-012-Xdding-Report.pdf
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Enhancing User Experience in Home Networks with Machine Learning-Based Classification

Abstract

Table of Contents

Introduction

Dataset

Data Processing

Linear Interpolation

Z-normalization

Low-noise Padding

Methodology

Results

Repository Structure

Installation and Usage

Acknowledgements

Citation

License

About

Releases

Packages

Languages

ITU-AI-ML-in-5G-Challenge/ML5G-PS-012-Xdding

Folders and files

Latest commit

History

Repository files navigation

Enhancing User Experience in Home Networks with Machine Learning-Based Classification

Abstract

Table of Contents

Introduction

Dataset

Data Processing

Linear Interpolation

Z-normalization

Low-noise Padding

Methodology

Results

Repository Structure

Installation and Usage

Acknowledgements

Citation

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages