AI and REAL Image Classification

Introduction

An image classification model that takes an image as an input, and predicts whether the image is real or created by A.I. based on the results from two models.

Models Developed/Implemented

A Convolutional Neural Network (CNN) architecture was developed by our team
An EfficientNet-based model architecture implemented by our team.

Team

Names	Primary Roles	Secondary Roles
Farhikhta Farzan	Data collector and cleaner	Model Interpreation/ Visualization and deployment
Keira James	Feature Engineer	Model Developer
Tesneem Essa	Feature Engineer	Model Developer

Deployed Version

Try the project here 🎨 !

https://huggingface.co/spaces/Digital-Detectives/AI-vs-Real-Image-Detection

UI

_demo_.mp4

Project Outline

The goal of this project is to develop a deep learning model that can accurately distinguish between real images and AI-generated images. We will collect datasets of real images and fake images. The data will be preprocessed, normalized, and augmented to enhance training. Using TensorFlow and Keras, we will design a Convolutional Neural Network (CNN) for classification, and validating performance through a confusion matrix. Finally, the project will include documentation of the process, findings, and suggestions for future improvements.

Datasets Used

CIFAKE: Real and AI-Generated Synthetic Images
CIFAKE is a dataset that contains 60,000 synthetically-generated images and 60,000 real images (collected from CIFAR-10). The dataset contains two classes - REAL and FAKE. For REAL, images are collected from Krizhevsky & Hinton's CIFAR-10 dataset. For the FAKE images, the equivalent of CIFAR-10 with Stable Diffusion version 1.4 was generated. There are 100,000 images for training (50k per class) and 20,000 for testing (10k per class)

Paintings from 10 different popular artists
This data set is about paintings from 10 different artists. The artists are davinci, frida kahlo, henri matisse, jackson pollock, johannes vermeer, picasso, piere auguste, raphael, rembrandt, and van gough

ArtStyles-Dataset
ArtStyles dataset contains 360 images containing 3 different digital art styles. The art style includes Anime, Comic and Semi Realism

DALLE Art March, 2023
These are AI-generated art images using Midjourney for the month of March 2023. It contains around 660 files

Detecting AI-generated Artwork
This dataset was produced as part of the study "AI Generated Art: Latent Diffusion-Based Style and Detection". It contains 1705 fake images and 1705 real images.

Midjourney Images & Prompt
This dataset is an extensive collection of pictures produced by using the mid-journey idea. It provides a wide range of varied photos with corresponding prompts that are created automatically by means of an advanced captioning system.This dataset, designed primarily for diffusion model training, is a valuable resource for improving machine learning capabilities in image generation. Using the picture and accompanying cues, researchers can delve into the complexities of training stable diffusion models capable of producing visuals resembling mid-journey scenarios.

BEAR x DALLE - Robot Illustrations
This is a dataset for illustrations of robots in the styles of well-known artists, art genre styles, perspectives, formats, and lighting. This is a dataset of illustrations of robots in the styles of well-known artists, art genre styles, perspective, formats, and lighting, as part of the BEAR x DALLE project, which focuses on computational art.

Generated Abstract Art Gallery
This dataset used Generative x Diffusion models to generate 512x512 AI images with abtract style art.

Anime Chibi Datasets
Anime Chibi Characters Datasets : Scraped From safebooru.org/Scraped with tags "chibi standing white_background solo -translation_request -text" Aims to be used in gans to generate Chibi characters

Cats and Dogs Cartoons
This is a great dataset for learning image classification. The dataset has 400 images (200 cats and 200 dogs) in hand drawn cartoon style. Each image is 1024x1024 pixels. The data was generated in MidJourney and was proved to be very useful in my Machine Learning classes.

Indian Paintings Dataset
Dive into the vibrant kaleidoscope of Indian art with our meticulously curated Indian Painting Styles Dataset! Number of classes - 8 (gond, kalighat, kangra, kerala mural, madhubani, mandana, pichwai, warli paintings) Total Images - 2249 images. File Formats - .jpg, .jpeg, .png, .webp

Technologies Used

Kaggle
Numpy
Panda
cv2
Pillow
Seaborn
Tensorflow/Keras
Scikit-Learn
Streamlit

Model Performance

CNN Model:

  Accuracy: 97.62%
  Precision: 97.6%
  Recall : 97.6%
  F1 score : 97.6%

Efficientnet Model:

  Accuracy: 97.72%
  Precision: 97.4%
  Recall : 98.05%
  F1 score: 97.72%

Efficientnet Art Model:

  Accuracy: 98%
  Precision: 97%
  Recall: 98%
  F1 score: 98%

Model Evaluation

CNN Model 📸

The model demonstrates strong performance in classifying real images, particularly those depicting nature, animals, and humans. However, it faces challenges in accurately identifying certain fake images. The model struggles with discerning AI-generated human headshots, occasionally misclassifying them as real. It also misclassifies certain art images, including fantasy-style art and specific scenic art, as real. Despite these limitations, the model excels in detecting fake images overall, making it a reliable tool for identifying real versus fake content in various domains.

Efficientnet Model 🎨

The model exhibits exceptional performance in identifying real images, including high-quality real art images. It also demonstrates strong capabilities in detecting fake images, particularly AI-generated headshots, art portraits, and nature landscapes. However, it encounters difficulties with certain AI-generated digital art styles, such as 3D animated art, neon art, cyberpunk art etc.. Despite these challenges, the model remains highly effective in accurately classifying real images and provides reliable results in detecting fake ones overall.

Note The EfficientNet model performs better than the CNN model in areas where CNN struggles. Conversely, the CNN model shows better performance in areas where EfficientNet has limitations.

Efficientnet art Model 🥸

This model demonstrates exceptional performance in identifying fake images, particularly in categories such as animals, nature, and art. However, it faces challenges in accurately identifying real images.

Practical Application

Our project is crucial in today’s world, where AI-generated content is increasingly prevalent. This model can be used by social media platforms, news organizations, and even everyday people to verify the authenticity of images, helping to fight against misinformation and ensure the integrity of visual media.

Set up

Navigate to desired library

  cd your_directory

Clone repository

  git clone https://github.com/KeiraJames/CTP-Project-2024.git

Navigate to repo

 cd CTP-Project-2024

Create virtual environment

 python -m venv .aivsreal-venv

Activate virtual environment (MAC)

 source .aivsreal/bin/activate

Install the requirements

  pip install -r requirements.txt

Run streamlit

  streamlit run app.py

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
CNN_model_weight		CNN_model_weight
EfficientNet_Models		EfficientNet_Models
Models		Models
data_cleaning		data_cleaning
datasets		datasets
styles		styles
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI and REAL Image Classification

Introduction

Models Developed/Implemented

Team

Deployed Version

UI

Project Outline

Datasets Used

Technologies Used

Model Performance

Model Evaluation

CNN Model 📸

Efficientnet Model 🎨

Efficientnet art Model 🥸

Practical Application

Set up

About

Releases

Packages

Contributors 3

Languages

KeiraJames/Real-vs-AI-Image-Detector

Folders and files

Latest commit

History

Repository files navigation

AI and REAL Image Classification

Introduction

Models Developed/Implemented

Team

Deployed Version

UI

Project Outline

Datasets Used

Technologies Used

Model Performance

Model Evaluation

CNN Model 📸

Efficientnet Model 🎨

Efficientnet art Model 🥸

Practical Application

Set up

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages