Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Mixtral-8x22B-v0.1 model support #286

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

VinitP1102
Copy link

Summary

This PR introduces the integration of the Mixtral-8x22B model into the codebase. Specifically, the following changes have been made:

  • Added the Mixtral model in mixtral.py to support the 8x22B architecture.
  • Introduced a new engine for the Mixtral model in mixtral_engine.py
  • Updated __init__.py files in the models and engines directories to register the new Mixtral model.
  • Modified generation_config.yaml to include parameters for the Mixtral model's generation tasks.
  • Updated finetuning_config.yaml to configure Mixtral model-specific parameters for finetuning.
  • Updated documentation, including README.md and supported_models.md, to reflect the addition of the Mixtral-8x22B model, with its identifier key as "mixtral".

Checklist

  • Tested the integration of the Mixtral model with both generation and finetuning tasks.
  • Updated documentation files to reflect the changes.

Additional Information

  • A new example script, mixtral.py, has been added in the examples directory. This script demonstrates how to use the Mixtral model with xTuring and provides instructions for testing the model.
  • These changes enable the Mixtral-8x22B model's functionality for both generation and finetuning tasks within the xTuring project. The model's parameters have been incorporated into the respective configuration files.

- Added Mixtral engine to support the model.
- Added example of Mixtral model.
- Edited `README.md` to display the latest added model.
- Edited config files for Mixtral model.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant