This repository contains the code for generating captions and answering questions on images using Mixtral-8x7B and LangChain. The code is written in Python 3.11.5
resp = agent("Explain this image: https://images.hindustantimes.com/auto/img/2023/07/23/1600x900/Tesla_Cybertruck_1688887534001_1690087911053.jpeg")
resp = agent('What is the color of the car?')
resp = agent('What is the brand of the car?')
resp = agent('How many peoples are these in the car?')
resp = agent('Explain the feature of this car?')