From Fiction to Function: Leveraging Fine-Tuned Open-Domain Models for Character Mimicry

Overview

This repository contains a collection of experiments and analyses focused on the parameter-efficient fine-tuning of Large Language Models (LLMs) to mimic fictional characters. We use Spongebob as an example. The experiments are primarily conducted using Python and Jupyter Notebooks.

Repository Structure

Below is the structure of the repository with a brief description of each component:

README.md: This file, providing an overview and instructions for the repository.
data: Directory containing datasets used in the experiments.
falcon_lora.ipynb: Jupyter Notebook for the Falcon LoRa experiment.
gpt_finetune.ipynb: Jupyter Notebook for fine-tuning GPT models.
graphing: Folder containing scripts and notebooks for graphing and visualizing data with custom formatting.
outputs: Directory where output files from experiments are stored.
peft.ipynb: Jupyter Notebook for experimenting with PEFT (Performance-Efficient Fine-Tuning).
prefix_tuning.ipynb: Jupyter Notebook for prefix tuning experiments.
prompt_tuning.ipynb: Jupyter Notebook for prompt tuning experiments.
requirements.txt: File listing the necessary Python packages.
scripts: Miscellaneous scripts used in various parts of the project.
soft_prompt_tuning.ipynb: Jupyter Notebook for soft prompt tuning experiments.
temp.py: Temporary Python script for auxiliary purposes.
testing: Directory for testing scripts and experimental code.

Getting Started

Installation

To get started, clone the repository and install the required packages:

git clone [repository URL]
cd [repository name]
pip install -r requirements.txt

Running the Experiments

Each experiment is contained within its own Jupyter Notebook. To run an experiment:

Navigate to the notebook file (e.g., gpt_finetune.ipynb).
Open the notebook using Jupyter Lab or Jupyter Notebook.
Run the cells in the notebook sequentially.

These notebooks were run on a set of 8GB GPUs and should take a couple of hours to execute completely.

Validating the Results

After running the experiments, you can validate the results by comparing the generated graphs and outputs with those in the outputs directory. The graphs and output metrics should be comparable to ours with reasonable accuracy if the experiments are executed correctly. We generally saved our data and and re-graphed them using the scripts in the graphing library, but this was purely for formatting purposes. Given constraints, we were unable to save weights.

Support

For any queries or issues, please contact us at [email protected], or any of the other listed authors.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

From Fiction to Function: Leveraging Fine-Tuned Open-Domain Models for Character Mimicry

Overview

Repository Structure

Getting Started

Installation

Running the Experiments

Validating the Results

Support

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
data		data
graphing		graphing
outputs/checkpoint-500		outputs/checkpoint-500
scripts		scripts
README.md		README.md
falcon_lora.ipynb		falcon_lora.ipynb
gpt_finetune.ipynb		gpt_finetune.ipynb
last_layer_finetuning.ipynb		last_layer_finetuning.ipynb
peft.ipynb		peft.ipynb
peft_rlrf.py		peft_rlrf.py
prefix_tuning.ipynb		prefix_tuning.ipynb
prompt_tuning.ipynb		prompt_tuning.ipynb
requirements.txt		requirements.txt
soft_prompt_tuning.ipynb		soft_prompt_tuning.ipynb
temp.py		temp.py

rqchao/cs182-project

Folders and files

Latest commit

History

Repository files navigation

From Fiction to Function: Leveraging Fine-Tuned Open-Domain Models for Character Mimicry

Overview

Repository Structure

Getting Started

Installation

Running the Experiments

Validating the Results

Support

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages