PaliGemma Model Training and Inference

This repository provides scripts for training and performing inference using the PaliGemma model. The model is designed for visual question answering (VQA) tasks. The scripts were made by our team "Attack On Python".

Task

Given the images of online products on Amazon with various measurements of physical quantities (e.g., height, width, weight) specified, extract the numerical values corresponding to the physical quantities given as input.

Results

Our solution achieved a maximum F1-Score of 0.661 and secured a 26th position finish (link) among over 2000 participating teams.

Requirements

Ensure you have the following dependencies installed:

Install the dependencies by running:

pip install -r requirements.txt

Training

Download the images in a directory by passing the list of links from train.csv to util.download_images(<list of link of images>)
Add the path of the images directory in the data_dir and the path to train.csv in csv_filename
Run the PaliGemma_Training_AttackOnPython.py file after setting up the NUM_EPOCHS and BATCH_SIZE hyperparameter

Testing

Download the images in a directory by passing the list of links from test.csv to util.download_images(<list of link of images>)
Add the path of the images directory in the data_dir and the path to train.csv in metadata_df's pd.read_csv(<test.csv path>)
Run the PaliGemma_Training_AttackOnPython.py file after setting up the batch_size and test_id hyperparameter.

Credits

A shout-out to my amazing friends Anurakt, Vibhu and Shivanshu for the great work!

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Report		Report
data		data
test_splits		test_splits
.gitignore		.gitignore
Final_Certificate.pdf		Final_Certificate.pdf
PaliGemma_Inference_AttackOnPython.py		PaliGemma_Inference_AttackOnPython.py
PaliGemma_Training_AttackOnPython.py		PaliGemma_Training_AttackOnPython.py
README.md		README.md
constants.py		constants.py
requirements.txt		requirements.txt
sanity.py		sanity.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PaliGemma Model Training and Inference

Task

Results

Table of Contents

Requirements

Training

Testing

Credits

About

Releases

Packages

Languages

jena-shreyas/Amazon-ML-Challenge

Folders and files

Latest commit

History

Repository files navigation

PaliGemma Model Training and Inference

Task

Results

Table of Contents

Requirements

Training

Testing

Credits

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages