Credit Card Fraud Detection on Streamlit

Overview

Credit card fraud is a major concern for both consumers and financial institutions. Fraudulent transactions can lead to financial losses and damage to the reputation of financial institutions. Machine learning techniques have been used extensively to detect fraudulent transactions. In this project, we use logistic regression to classify transactions as either legitimate or fraudulent based on their features.

Dataset

The data used in this project is a CSV file containing credit card transaction data. The data has 31 columns and 284,807 rows. The "Class" column is the target variable, which indicates whether the transaction is legitimate (Class = 0) or fraudulent (Class = 1).

Link to the dataset: (https://www.kaggle.com/datasets/mlg-ulb/creditcardfraud)

Preprocessing

Before training the model, we first separate the legitimate and fraudulent transactions. Since the data is imbalanced, with significantly more legitimate transactions than fraudulent transactions, we undersample the legitimate transactions to balance the classes. We then split the data into training and testing sets using the train_test_split () function.

Model Used

We use logistic regression to classify transactions as either legitimate or fraudulent based on their features. Logistic regression is a widely used classification algorithm that models the probability of an event occurring based on input features. The logistic regression model is trained on the training data using the LogisticRegression () function from scikit-learn. The trained model is then used to predict the target variable for the testing data..

Evaluation

The performance of the model is evaluated using the accuracy metric, which is the fraction of correctly classified transactions. The accuracy on the training and testing data is calculated using the accuracy_score() function from scikit-learn.

Validation Set:

Accuracy Score: 0.975
Balanced Accuracy Score: 0.736
ROC AUC Score: 0.736

Test Set:

Accuracy Score: 0.966
Balanced Accuracy Score: 0.686
ROC AUC Score: 0.686

Streamlit Application:

We use Streamlit to create a user interface for the credit card fraud detection project. The Streamlit application allows the user to upload a CSV file containing credit card transaction data, and the uploaded data is used to train the logistic regression model. The user can also input transaction features and get a prediction on whether the transaction is legitimate or fraudulent.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Credit Card Fraud Detection using Machine Learning.ipynb		Credit Card Fraud Detection using Machine Learning.ipynb
README.md		README.md
app.py		app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Credit Card Fraud Detection on Streamlit

Overview

Dataset

Preprocessing

Model Used

Evaluation

Validation Set:

Test Set:

Streamlit Application:

About

Releases

Packages

Languages

aishikaranjan/CCFraud_Streamlit

Folders and files

Latest commit

History

Repository files navigation

Credit Card Fraud Detection on Streamlit

Overview

Dataset

Preprocessing

Model Used

Evaluation

Validation Set:

Test Set:

Streamlit Application:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages