GitHub - seldonian-toolkit/RoBERTa_reward_model: This repository contains the datasets and model code for running the Seldonian toolkit to ensure fairness on the RoBERTA hate speech reward model.

seldonian-toolkit / RoBERTa_reward_model Public

This repository contains the datasets and model code for running the Seldonian toolkit to ensure fairness on the RoBERTA hate speech reward model.

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data		data
larger_dataset		larger_dataset
.gitignore		.gitignore
startup.bash		startup.bash

About

This repository contains the datasets and model code for running the Seldonian toolkit to ensure fairness on the RoBERTA hate speech reward model.

No releases published

No packages published