Fake-News-Detection

Aim

This project aims to build a fake news classifier that can accurately distinguish fake news from genuine news. We also developed a simple UI, to enable the users to efficiently verify news articles

Technologies Used:

Frontend - HTML, CSS, Javascript for building the webapp
Backend - Flask, for running the localhost
Framework - Jupyter Notebook for Machine learning & deep learning models
Open Source Environment - Spyder

Dataset: 20,800 samples with 10387 real news,10413 fake news. Dataset is publically available here

Machine Learning / Deep Learning Algorithms used

1. Naive Bayes 
2. Decison Tree
3. SVM (Support Vector Machine)
4. TF-IDF Tokenizer with Passive Aggressive Classifier
5. BERT

Figure 1: Pipeline of the approach used

Pipeline & Output:

a) Main Model 1: BERT Tokeniser with Bert model

Figure 2: Pipeline for BERT Tokeniser with Bert model

b) Main Model 2: TF-IDF Tokenizer with Passive Aggressive Classifier

Figure 3: Pipeline for TF-IDF Tokenizer + Passive Aggressive Classifier

WebApp:

https://samyaksand-fakenewsnlp.pythonanywhere.com/

An end-to-end deployed tool which allows user to verify news articles in a click.
This project is the first step toward creating a data protection mechanism to protect against the spreading of fake news on social networks. The outcome of this research is proposed to be essential to designing innovative online social networks.

Features

Users can enter input directly.
Users can load random news articles
Users can see the text processing pipeline live.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
__pycache__		__pycache__
data		data
jupyter_project		jupyter_project
nltk_data		nltk_data
public		public
README.md		README.md
app.py		app.py
fake_or_real_news.csv		fake_or_real_news.csv
predictionModel.py		predictionModel.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fake-News-Detection

Aim

Technologies Used:

Dataset: 20,800 samples with 10387 real news,10413 fake news. Dataset is publically available here

Machine Learning / Deep Learning Algorithms used

Pipeline & Output:

a) Main Model 1: BERT Tokeniser with Bert model

b) Main Model 2: TF-IDF Tokenizer with Passive Aggressive Classifier

WebApp:

https://samyaksand-fakenewsnlp.pythonanywhere.com/

Features

Presentation Slides

About

Releases

Packages

Languages

samyaksand/Fake-News-Detection-ML

Folders and files

Latest commit

History

Repository files navigation

Fake-News-Detection

Aim

Technologies Used:

Dataset: 20,800 samples with 10387 real news,10413 fake news. Dataset is publically available here

Machine Learning / Deep Learning Algorithms used

Pipeline & Output:

a) Main Model 1: BERT Tokeniser with Bert model

b) Main Model 2: TF-IDF Tokenizer with Passive Aggressive Classifier

WebApp:

https://samyaksand-fakenewsnlp.pythonanywhere.com/

Features

Presentation Slides

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages