VisionaryAI (formerly GeminiFusion)

VisionaryAI is a versatile web application that leverages advanced AI models, including Gemini Pro Vision, DALL-E 3, and Stable Diffusion XL, to provide three main features: Chatbot Interaction, Image Captioning, and Text-to-Image Generation.

Features

ChatBot: Engage in real-time conversations with the AI, powered by the Gemini Pro model.
Image Captioning: Generate descriptive captions for your images using the Gemini Pro Vision model.
Text to Image: Generate images using either DALL-E 3 or Stable Diffusion XL.

Installation

Clone the repository:

git clone https://github.com/Abhrankan-Chakrabarti/GeminiFusion.git
cd GeminiFusion

Create a virtual environment (optional but recommended):

python -m venv venv
source venv/bin/activate  # On Windows use `venv\Scripts\activate`

Install dependencies:
```
pip install -r requirements.txt
```
Set up environment variables:
- Create a .env file in the root directory.
- Add your Google API key:
```
api_key=YOUR_GOOGLE_API_KEY
```

Usage

Run the application:
```
streamlit run app.py
```
Features:
- ChatBot: Navigate to the ChatBot section to start a conversation with the AI.
- Image Captioning: Upload an image and enter a prompt to generate a caption.
- Text to Image: Enter a text prompt to generate images using either DALL-E 3 or Stable Diffusion XL.

Technology Stack

Python
Streamlit
Google Gemini Pro
Google Gemini Pro Vision
DALL-E 3
Stable Diffusion XL

Contributing

We welcome contributions! Please see our contribution guidelines for more information.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VisionaryAI (formerly GeminiFusion)

Features

Installation

Usage

Technology Stack

Contributing

License

About

Releases

Packages

Languages

License

Abhrankan-Chakrabarti/GeminiFusion

Folders and files

Latest commit

History

Repository files navigation

VisionaryAI (formerly GeminiFusion)

Features

Installation

Usage

Technology Stack

Contributing

License

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages