Skip to content

A sleek, modern iOS app that uses Google's Gemini AI to analyze images and engage in natural conversations. Take a photo or select an image, and watch as Gemini describes what it sees and answers your questions about it.

License

Notifications You must be signed in to change notification settings

SohanRaidev/Gemini-Wrapper-SwiftUI

Repository files navigation

Gemini Vision Wrapper for SwiftUI

A modern SwiftUI application that leverages Google's Gemini AI to analyze images and engage in natural conversations.

Screenshots

Overview

This app creates an interactive AI assistant that can analyze images and respond to text inputs. Simply take a photo or select an image from your photo library, and ask Gemini questions about what it sees.

Features

  • 📱 Clean, modern SwiftUI interface
  • 📸 Integrated camera for taking photos
  • 🖼️ Photo library integration for selecting images
  • 🧠 Gemini AI for intelligent image analysis
  • 💬 Persistent chat history
  • 🔄 Robust error handling with automatic retries

Getting Started

Prerequisites

Setup

  1. Clone the repository:
git clone https://github.com/SohanRaidev/Gemini-Vision-SwiftUI.git
cd Gemini-Vision-SwiftUI
  1. Open the project in Xcode:
open SwiftUI-AI-Wrapper.xcodeproj
  1. Add your Gemini API key:

    • Open SwiftUI-AI-Wrapper/Models/ChatModel.swift
    • Replace <YOUR_API_KEY> with your actual Gemini API key
  2. Build and run the app on your device or simulator

How to Use

  1. Launch the app
  2. Take a photo with the camera or select an image from your photo library
  3. Gemini will automatically analyze what's in the image
  4. Continue the conversation by asking follow-up questions
  5. Access your chat history through the history button

Requirements

The app requires the following permissions:

  • Camera access (for taking photos)
  • Photo Library access (for selecting images)

Customization

You can customize various aspects of the application:

  • Adjust the geminiModel property in ChatModel.swift to use different Gemini models
  • Modify UI appearance in the view files
  • Adjust image compression settings for different performance profiles

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

About

A sleek, modern iOS app that uses Google's Gemini AI to analyze images and engage in natural conversations. Take a photo or select an image, and watch as Gemini describes what it sees and answers your questions about it.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages