Grievance Recognition System Based on Text/Speech

Introduction

This project develops an innovative Grievance Recognition System that utilizes advanced NLP and machine learning techniques to automate the processing of grievances via text and speech inputs. Aimed at improving efficiency in handling user grievances, the system is a part of our Major Project submission, Department of Electronics and Computer Engineering, Pulchowk Campus, Tribhuvan University.

Project Team

Ashwini Mandal
Mahima Dhakal
Nadika Poudel
Aashish Kumar Sah

Supervisor

Dr. Basanta Joshi, Assistant Professor, Department of Electronics and Computer Engineering, Pulchowk Campus

System Overview

The system is structured into three core components:

Automatic Speech Recognition (ASR): Utilizes the state-of-the-art wav2vec 2.0 model, fine-tuned for the Nepali language, to convert speech into text.
Text Classification: Employs both traditional machine learning models and deep learning approaches including LSTM, BiLSTM, and transformer-based models like NepaliBERT and MuRIL for efficient classification of the grievances.
Web Interface: A user-friendly web interface that facilitates easy submission and tracking of grievances.

Setup and Installation

Detailed instructions on how to set up and run the project locally.

Requirements

python version used -> 3.9.6

requirements.txt

Installation Steps

Clone the repository
Install Python dependencies
Setup Database
Run the backend server : port
Access the web interface at http://localhost:port.

Technical Description:

Project Overview

The Grievance Recognition System is designed to automate the handling of grievances received in text or speech formats using advanced machine learning and natural language processing techniques. This system is especially tuned to understand and process inputs in the Nepali language, accommodating the linguistic nuances essential for effective grievance handling.

System Architecture

Components

The system comprises three major components:

Automatic Speech Recognition (ASR)
Text Classification
Web Interface

Automatic Speech Recognition (ASR)

Model: wav2vec 2.0, fine-tuned for Nepali language.
Purpose: Converts speech input into text. It allows users to submit their grievances orally, making the system accessible to a broader user base, including those unable to write.
Process:
- The speech audio is captured and processed using a feature extractor.
- The processed features are then fed into the wav2vec 2.0 model to generate the corresponding text output.
Try our ASR model
- Try ASR model here

Text Classification

Models Used:
- Traditional ML models: SVM, Decision Trees.
- Deep Learning models: LSTM, BiLSTM.
- Transformer models: NepaliBERT, MuRIL.
Purpose: Classifies the text into categories such as complaints, queries, and suggestions, which can then be routed to the appropriate departments.
Process:
- Text input is tokenized and vectorized.
- The vectorized text is processed through the model to classify into predefined categories.
Try our fine-tuned models:
Nepali-bert fine-tuned for text classification
Muril fine-tuned for text classification

Web Interface

Technologies: Django
Features:
- Users can submit grievances through a text form or voice recording.
- Dashboard for tracking the status of submitted grievances.
- Admin panel for response management and analytics.

Data Flow

Input Acquisition: Users submit their grievances via the web interface using either text or speech.
Preprocessing:
- Speech to Text: Audio inputs are converted to text using the ASR component.
- Text Normalization: Text inputs are cleaned and normalized for processing.
Classification: The preprocessed text is then classified into categories using the text classification models.
Response Generation: Based on the classification, automated responses or routing decisions are made.
Output Delivery: Responses or status updates are communicated back to the user through the web interface.

Technologies

Backend: Python, Django, PyTorch, Hugging Face Transformers.
Frontend: HTML-CSS, Bootstrap.
Database: MySQL.
Machine Learning/Deep Learning: scikit-learn, TensorFlow, PyTorch.
Speech Processing: Librosa, Hugging Face Transformers.

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
.vscode		.vscode
Results		Results
backend		backend
major		major
media/audio		media/audio
notebook		notebook
static		static
templates		templates
.gitignore		.gitignore
db.sqlite3		db.sqlite3
manage.py		manage.py
oldrequirements.txt		oldrequirements.txt
readMe.md		readMe.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Grievance Recognition System Based on Text/Speech

Introduction

Project Team

Supervisor

System Overview