Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MCTS Sampler #2967

Merged
merged 63 commits into from
Feb 8, 2025
Merged

MCTS Sampler #2967

merged 63 commits into from
Feb 8, 2025

Conversation

lxline
Copy link
Collaborator

@lxline lxline commented Jan 23, 2025

PR type

  • New Feature

PR information

Write the detail information belongs to this PR.

Experiment results

Paste your experiment result here(if needed).

if not child.terminated:
self.active_children.append(child)

def collect(self):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

文档:
sample.md
采样.md
强化微调.md
reinforce_fine_tuning.md

split_dataset(ds, device_count, dataset_dir)

ts = time.time()
client_sample(server_model, orm, dataset_dir, 0, device_count, output_dir)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

将训练过程也增加一个shell,方便开发者复现

@tastelikefeet tastelikefeet merged commit 8f0630e into modelscope:main Feb 8, 2025
2 checks passed
tastelikefeet added a commit to tastelikefeet/swift that referenced this pull request Feb 10, 2025
…ple_multi_modal

* commit 'a4d751356b36917e8d0e21c9e170418d8f35bd09':
  fix windows url (modelscope#3041)
  MCTS Sampler (modelscope#2967)
  fix docs
  support mistralai/Mistral-Small-24B-Instruct-2501 (modelscope#3030)

# Conflicts:
#	swift/llm/sampling/utils.py
#	swift/plugin/orm.py
#	swift/plugin/prm.py
tastelikefeet added a commit to tastelikefeet/swift that referenced this pull request Feb 10, 2025
…edding

* commit '646023dcae858f0fa388f7663a217790604339fa':
  Support sample multi modal models (modelscope#3048)
  fix windows url (modelscope#3041)
  MCTS Sampler (modelscope#2967)
  fix docs
  support mistralai/Mistral-Small-24B-Instruct-2501 (modelscope#3030)

# Conflicts:
#	swift/plugin/prm.py
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants