Skip to content

Commit

Permalink
change the name to slicpairpm
Browse files Browse the repository at this point in the history
The training and use of the models are similar to that of Slic paper SLiC-HF: Sequence Likelihood Calibration with Human Feedback.
  • Loading branch information
WeiXiongUST committed May 10, 2024
1 parent a374ec1 commit f727b8c
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion rewardbench/models/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -35,7 +35,7 @@
build_starling_rm,
)
from .ziya import ZiyaPipeline
from .pair_pm import PairPMPipeline
from .slicpairpm import SlicPairPMPipeline

# Please open a PR if you need to add more custom modeling code / utilize existing code for you model
REWARD_MODEL_CONFIG = {
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -9,7 +9,7 @@
from typing import List, Optional, Tuple, Union


class PairPMPipeline:
class SlicPairPMPipeline:

def __init__(self, model_path):
self.model = AutoModelForCausalLM.from_pretrained(model_path,).cuda() #, attn_implementation="flash_attention_2", torch_dtype=torch.bfloat16
Expand Down

0 comments on commit f727b8c

Please sign in to comment.