SigPhi-Med: A Lightweight Vision-Language Assistant for Biomedicine

Introduction

SigPhi-Med is a lightweight vision-language model designed for biomedical applications. It leverages compact architectures while maintaining strong performance in visual question answering (VQA) and related multimodal tasks. This repository provides code for training, evaluation, and model deployment.

Results

Installation

To set up the environment, refer to TinyLLaVA Factory and requirements.txt file.

Model Weights

Huggingface

Datasets

SigPhi-Med is trained and evaluated on the following biomedical multimodal datasets:

Training

To train SigPhi-Med, modify the training script as needed:

Edit the configuration in scripts/train/train_phi.sh.
Run the training script:

sh scripts/train/train_phi.sh

Evaluation

To evaluate the model on biomedical VQA tasks, use:

sh scripts/eval/VQA.sh

Acknowledgements

We appreciate the contributions of the following projects:

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
assets		assets
scripts		scripts
tinyllava		tinyllava
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SigPhi-Med: A Lightweight Vision-Language Assistant for Biomedicine

Introduction

Results

Installation

Model Weights

Datasets

Training

Evaluation

Acknowledgements

About

Releases

Packages

Languages

NyKxo1/SigPhi-Med

Folders and files

Latest commit

History

Repository files navigation

SigPhi-Med: A Lightweight Vision-Language Assistant for Biomedicine

Introduction

Results

Installation

Model Weights

Datasets

Training

Evaluation

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages