Police Sketch Generation Project

⚠️ IMPORTANT NOTE: This project has already been run through completion, and all outputs are included in the repository. Rerunning the entire pipeline will take an extremely long time (multiple hours) and significant computational resources. The results of all studies, fine-tuning, and tests are already available in their respective directories.

This project implements a machine learning pipeline for generating police sketches from textual descriptions using fine-tuned Stable Diffusion and CLIP models.

Step-by-Step Guide

Step 1: Environment Setup

# Create virtual environment
python -m venv .venv

# Activate virtual environment
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

Step 2: Data Preparation

Download the CUHK Face Sketch FERET Dataset:

python download_data.py

This will download the dataset to the data/ directory.

Generate text descriptions using OpenAI's API:

cd chatgpt_descriptions
python gen_descriptions.py

This script uses OpenAI's GPT model to generate detailed descriptions for each sketch. The descriptions are saved in data/descriptions/.

Note: These steps have already been completed, and the data is included in the repository.

Step 3: Ablation Study

cd ..
python sd_fine_tune.py

This script performs an ablation study on different LoRA configurations:

Self-attention only (ablation_study_self_only/)
Cross-attention only (ablation_study_cross_only/)
Both attention types (ablation_study_both/)

Key Finding: The study concluded that applying LoRA to both self- and cross-attention layers produces the most sketch-like and consistent results. This configuration was used for the final model.

Note: This study has been completed, and results are available in the respective directories.

Step 4: CLIP Fine-tuning

python clip_fine_tune.py

Fine-tunes the CLIP model for better text-image alignment specific to police sketches. The final checkpoint is available at clip_checkpoint_epoch_20.pt.

Step 5: Model Validation

Test base Stable Diffusion:

python basic_sd_test.py

Generates sample images using the base model for comparison.

Test integrated pipeline:

python clip_sd_pipeline.py

Tests the combination of fine-tuned CLIP and Stable Diffusion models.

Step 6: Testing

Run the test suite in the tests/ directory:

Each test is provided as a standalone Jupyter Notebook. To run a test, simply execute the cells in the corresponding notebook:

img2clip_finetuned.ipynb: Executes the fine-tuned CLIP-to-img2img Stable Diffusion pipeline on a single image.
img2imgclip.ipynb: Executes the CLIP-to-img2img Stable Diffusion pipeline on a single image.
img2imgtest.ipynb: Executes the baseline img2img Stable Diffusion pipeline on a single image.
iteration_img2clip.ipynb: Runs the CLIP-to-img2img Stable Diffusion pipeline with iterative steps and plots metrics (SSIM, PSNR, CLIP Score, LPIPS) across iterations.
iteration_stable.ipynb: Runs the baseline img2img Stable Diffusion pipeline with iterative steps and plots metrics across iterations.
iteration_finetuned_img2clip.ipynb: Runs the fine-tuned CLIP-to-img2img Stable Diffusion pipeline with iterative steps and plots metrics across iterations.
final_metrics: Runs all iterative tests and computes performance comparisons across models

Unused Development Files

The following files were created during initial development but were not used in the final implementation:

Web Application (Unused)

app/ directory: Contains a Flask web application that was initially planned for deployment
run.py: Web application entry point
tables.py: Database schema definitions

Additional Test Files (Unused)

test_self_attention.py: Standalone test for self-attention mechanism

Project Structure

.
├── app/                        # Unused web application
├── data/                      # Dataset and descriptions
│   ├── sketches/             # CUHK Face Sketch FERET Dataset
│   └── descriptions/         # Generated text descriptions
├── tests/                    # Test files
├── ablation_study_*/         # Ablation study results
├── generated_sketches/       # Model outputs
└── requirements.txt         # Project dependencies

Results and Outputs

Ablation study results: ablation_study_*/
Generated samples: generated_sketches/
Training metrics: training_plots/
CLIP checkpoint: clip_checkpoint_epoch_20.pt

Requirements

See requirements.txt for complete list of dependencies. Key requirements:

PyTorch
Diffusers
Transformers
OpenAI API (for description generation)
NumPy
Matplotlib

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Police Sketch Generation Project

Step-by-Step Guide

Step 1: Environment Setup

Step 2: Data Preparation

Step 3: Ablation Study

Step 4: CLIP Fine-tuning

Step 5: Model Validation

Step 6: Testing

Unused Development Files

Web Application (Unused)

Additional Test Files (Unused)

Project Structure

Results and Outputs

Requirements

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
ablation_study_both		ablation_study_both
ablation_study_cross_only		ablation_study_cross_only
ablation_study_self_only		ablation_study_self_only
app		app
chatgpt_descriptions		chatgpt_descriptions
data		data
generated_images		generated_images
generated_sketches		generated_sketches
generated_sketches_clip		generated_sketches_clip
tests		tests
training_plots		training_plots
uploaded_images		uploaded_images
.gitignore		.gitignore
README.md		README.md
Sketchy__Final_Report.pdf		Sketchy__Final_Report.pdf
basic_sd_test.py		basic_sd_test.py
clip_fine_tune.py		clip_fine_tune.py
clip_sd_pipeline.py		clip_sd_pipeline.py
dataset.py		dataset.py
diagram.png		diagram.png
download_data.py		download_data.py
image_analysis.json		image_analysis.json
image_list.txt		image_list.txt
lora_layer.py		lora_layer.py
lora_sd_test.py		lora_sd_test.py
requirements.txt		requirements.txt
responses.db		responses.db
run.py		run.py
sd_fine_tune.py		sd_fine_tune.py
tables.py		tables.py
test_self_attention.py		test_self_attention.py

kharvey19/Sketchy

Folders and files

Latest commit

History

Repository files navigation

Police Sketch Generation Project

Step-by-Step Guide

Step 1: Environment Setup

Step 2: Data Preparation

Step 3: Ablation Study

Step 4: CLIP Fine-tuning

Step 5: Model Validation

Step 6: Testing

Unused Development Files

Web Application (Unused)

Additional Test Files (Unused)

Project Structure

Results and Outputs

Requirements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages