Image Extension Using Stable Diffusion

This project utilizes the Stable Diffusion model to extend images based on user-provided text prompts and specified directions (e.g., left, right, top, bottom). The pipeline processes the input image and generates an extended version according to the directions specified.

Features

Extend images based on a text prompt.
Specify one or more directions for the image extension: left, right, top, bottom.
Simple and interactive user interface via Gradio.

Usage Guide

Open the Google Colab notebook.
Run the setup code and run app code cells in the notebook.
After execution, click on the Gradio link displayed in the notebook output.
The Gradio link opens a live website where you can:
1. Upload an Image: Provide an image you want to extend.
2. Enter a Prompt: Describe the desired extension (e.g., "Lake and grass fields").
3. Select Directions: Choose one or more directions (left, right, top, bottom) for the extension.
4. Generate: Click the button to process and view the extended image.

How It Works

Input:
- An image to be extended.
- A text prompt describing how the extension should look.
- The direction(s) in which the image should be extended.
- No. of inference images to be generated.
Processing:
- The given image is masked w.r.t. the given direction.
- The masked image and prompt are passed to the Stable Diffusion model.
- The model generates an extended version of the image based on the given prompt and direction(s).
Output:
- The extended image is displayed on the live website.

Example

Input:
- Image :
- Prompt : Lake and grass fields
- Directions : left, right
Output:

Input:
- Image :
- Prompt : Cafe and cozy outdoor
- Directions : top
Output:
Input:
- Image :
- Prompt : coffee shop
- Directions : top
Output:

Technical Details

Model: Stable Diffusion, fine-tuned for image extension tasks.
Framework: Gradio for building the interactive interface.
Deployment: The system runs entirely within Google Colab for easy setup and execution.

Project Link

Google Colab Notebook

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
models/pretrained		models/pretrained
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
main.py		main.py
mask_image.py		mask_image.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Extension Using Stable Diffusion

Features

Usage Guide

How It Works

Example

Technical Details

Project Link

License

About

Contributors 2

Languages

License

OSSML/Zero_shot_Image_Synthesis

Folders and files

Latest commit

History

Repository files navigation

Image Extension Using Stable Diffusion

Features

Usage Guide

How It Works

Example

Technical Details

Project Link

License

About

Resources

License

Stars

Watchers

Forks

Contributors 2

Languages