You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The following error occurred while running cell 10 in 6. Tune language model using PPO with our preference model.
After adding __init__.py to /content/trlx/examples/summarize_rlhf/reward_model/, I still get the same error.
How can I fix it?
10 import torch
11 from datasets import load_dataset
---> 12 from reward_model.reward_model import GPTRewardModel
13 from tqdm import tqdm
14 from transformers import AutoTokenizer
ModuleNotFoundError: No module named 'reward_model.reward_model'; 'reward_model' is not a package
The text was updated successfully, but these errors were encountered:
The following error occurred while running cell 10 in 6. Tune language model using PPO with our preference model.
After adding
__init__.py
to/content/trlx/examples/summarize_rlhf/reward_model/
, I still get the same error.How can I fix it?
The text was updated successfully, but these errors were encountered: