Add Glaive conversation format support #1365

brianfitzgerald · 2024-03-06T03:00:40Z

Adds support for the Glaive function calling dataset. This dataset has 2 columns, system and chat; and an additional tool role in the conversation. This role contains the output from a tool call.

In this PR we add support for the Glaive dataset, which we convert to the ShareGPT format; tool calls are masked, and consecutive tool calls are merged to one message.

How has this been tested?

I SFT trained a TinyLlama lora with the config below.

base_model: TinyLlama/TinyLlama-1.1B-Chat-v1.0
model_type: LlamaForCausalLM
tokenizer_type: LlamaTokenizer
is_llama_derived_model: true

load_in_8bit: true
load_in_4bit: false
strict: false

datasets:
  - path: glaiveai/glaive-function-calling-v2
    type: sharegpt.load_glaive
    conversation: chatml_glaive

winglian · 2024-03-06T15:08:27Z

@brianfitzgerald thanks for this PR. the ability to train models for function calling is very much needed. Currently, this PR breaks some of the existing tests for sharegpt datasets. I think if we can get the changes fixed so as not to break existing functionality as well as add new data fixtures and tests for the new functionality, we can get this merged. lmk if you'd like some help with this.

ehartford · 2024-03-06T15:24:21Z

This is an exciting development.
I will use this to add glaive to all my models

src/axolotl/prompt_strategies/sharegpt.py

Co-authored-by: Wing Lian <[email protected]>

winglian

thank for putting this all together! 🚀

…brianfitzgerald/axolotl into glaive-function-calling-support

hasan9090 · 2024-07-18T12:36:19Z

Thanks for the updated functionality. I would like to similarly treat any other tool handling dataset like for example this version of glaive lilacai/glaive-function-calling-v2-sharegpt , which already is in sharegpt, or similarly any other custom dataset of that format. When trying to do so and orientating at the above mentioned configs, I cannot make it work to recognize the tool role as 3rd role in axolotl and don't really know if it is possible and how the config has to look like.
From the above , I think setting type also to sharegpt.load_glaive would not work as the dataset is already in sharegpt and this function seems to convert it first to that format? But I do think I need to set the conversation to "chatml_glaive" since this is handling the tool role? When using the following config it is not working for me:

datasets:
-path: lilacai/glaive-function-calling-v2-sharegpt
type: sharegpt
conversation: chatml_glaive
field_human: human
field_model: gpt

It would be nice if someone could help.
Thanks Hasan

* Add Glaive conversation format support * fix black formatting errors * Fix black and pylint formatting errors * only set role_key_tool if provided in the dataset constructor * Update src/axolotl/prompt_strategies/sharegpt.py Co-authored-by: Wing Lian <[email protected]> * sharegpt test * tokenizer test * fix formatting --------- Co-authored-by: Wing Lian <[email protected]>

brianfitzgerald and others added 2 commits March 6, 2024 02:59

Add Glaive conversation format support

a37f5ce

Merge branch 'main' into glaive-function-calling-support

9f95656

brianfitzgerald and others added 4 commits March 6, 2024 18:16

Merge branch 'main' into glaive-function-calling-support

3ce96da

fix black formatting errors

0f06017

Fix black and pylint formatting errors

2da71da

only set role_key_tool if provided in the dataset constructor

91e2c63

winglian reviewed Mar 7, 2024

View reviewed changes

src/axolotl/prompt_strategies/sharegpt.py Outdated Show resolved Hide resolved

brianfitzgerald and others added 4 commits March 8, 2024 14:56

Update src/axolotl/prompt_strategies/sharegpt.py

f423d39

Co-authored-by: Wing Lian <[email protected]>

sharegpt test

9de68a9

tokenizer test

5d005a2

Merge branch 'main' into glaive-function-calling-support

4148575

winglian approved these changes Mar 8, 2024

View reviewed changes

brianfitzgerald added 2 commits March 8, 2024 22:46

fix formatting

e68202f

Merge branch 'glaive-function-calling-support' of https://github.com/…

732f0f8

…brianfitzgerald/axolotl into glaive-function-calling-support

winglian merged commit b7d8a7d into axolotl-ai-cloud:main Mar 11, 2024
6 checks passed

brianfitzgerald deleted the glaive-function-calling-support branch March 12, 2024 01:00

hasan9090 mentioned this pull request Jul 17, 2024

improve tool handling roles #1587

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Glaive conversation format support #1365

Add Glaive conversation format support #1365

brianfitzgerald commented Mar 6, 2024 •

edited

Loading

winglian commented Mar 6, 2024

ehartford commented Mar 6, 2024

winglian left a comment

hasan9090 commented Jul 18, 2024 •

edited

Loading

Add Glaive conversation format support #1365

Add Glaive conversation format support #1365

Conversation

brianfitzgerald commented Mar 6, 2024 • edited Loading

How has this been tested?

winglian commented Mar 6, 2024

ehartford commented Mar 6, 2024

winglian left a comment

Choose a reason for hiding this comment

hasan9090 commented Jul 18, 2024 • edited Loading

brianfitzgerald commented Mar 6, 2024 •

edited

Loading

hasan9090 commented Jul 18, 2024 •

edited

Loading