supporting fine tuning for OAI #441

Intex32 · 2023-09-21T11:04:53Z

contains multiple subtasks

~~server: adapt query endpoint to accept custom model name (depends on [DRAFT] Server conversations #413)~~
xef-core: implement an API for querying fine-tuned models
clone OAI endpoints for fine tuning (to later intercept and collect metrics)

https://platform.openai.com/docs/guides/fine-tuning
web api: https://platform.openai.com/docs/api-reference/fine-tuning/create
to estimate fine tuning costs: https://colab.research.google.com/drive/11Yl7cQ3vzYZzrzRaiQEH9Y9gAfn5-Pe6?usp=sharing

my experience with fine-tuning

general high level knowledge of ML applies (accuracy, learning rate)
overfitting possible, after a couple of epochs no more progress as training accuracy was 1.0
basic validation on file upload (complained about double line break instead of single)
actual validation performed during when training job is started (eg. too few lines)
training of ron-v2 model with 12 epochs and 50 lines took about half an hour

Intex32 · 2023-09-26T08:21:31Z

As this issue partly depend on #413, querying fine tuned models from the xef-server is not support yet. Currently, all requests are streamed from OpenAI without going through the xef-core logic.

Resolving a model based on it's name and it's base model's name might look like this later:

fun spawnCustomModel(provider: Provider, baseModelName: String, fineTunedModelName: String): LLM {
    val baseModel = when(provider) {
        Provider.OPENAI -> com.xebia.functional.xef.conversation.llm.openai.OpenAI().supportedModels().find { it.modelType.name == baseModelName }
        else -> TODO()
    } ?: error("base model $baseModelName not found")
    return if(baseModel is FineTuneable)
        baseModel.fineTuned(fineTunedModelName)
    else error("model $baseModelName supports no fine tuning")
    // we cannot know at this point if the fine tuned model exists
}

raulraja · 2023-09-26T09:14:32Z

This issue should not depend on #415 , we are not following that approach for now. We need to add the fine-tuning endpoint to the Xef server following what it's doing in the main now and forwarding directly to Open AI. I am happy to discuss this online in Slack if you need further clarification.

* query fine tuned models (from branch #441-fine-tuning-oai) * spotless * clean build, make tests and mocks compile * changes according to pr comments --------- Co-authored-by: José Carlos Montañez <[email protected]>

Intex32 · 2023-10-04T08:23:01Z

Aallam just closed my issue (aallam/openai-kotlin#236) regarding implementing the new fine tuning API. Foreseeably, there is going to be a new release soon. We could implement the actual fine tuning now more easily. But I question if this is actually of any value at this point. You @raulraja have to decide what has priority now. For later, I can imagine capturing the metrics like accuracy etc from the training that OAI provides to us.

Intex32 · 2023-10-04T11:27:25Z

https://github.com/aallam/openai-kotlin/releases/tag/3.5.0

Intex32 added openAI server labels Sep 21, 2023

Intex32 self-assigned this Sep 21, 2023

Montagon added the 0.0.4 label Sep 26, 2023

Intex32 linked a pull request Sep 28, 2023 that will close this issue

441 fine tuning oai #464

Closed

Intex32 added a commit that referenced this issue Sep 29, 2023

cherrypicked from branch #441-fine-tuning-oai

a9c1ad1

Intex32 added a commit that referenced this issue Oct 2, 2023

query fine tuned models (from branch #441-fine-tuning-oai)

26acdb3

Intex32 closed this as completed Nov 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

supporting fine tuning for OAI #441

supporting fine tuning for OAI #441

Intex32 commented Sep 21, 2023 •

edited

Loading

Intex32 commented Sep 26, 2023

raulraja commented Sep 26, 2023

Intex32 commented Oct 4, 2023 •

edited

Loading

Intex32 commented Oct 4, 2023

supporting fine tuning for OAI #441

supporting fine tuning for OAI #441

Comments

Intex32 commented Sep 21, 2023 • edited Loading

Intex32 commented Sep 26, 2023

raulraja commented Sep 26, 2023

Intex32 commented Oct 4, 2023 • edited Loading

Intex32 commented Oct 4, 2023

Intex32 commented Sep 21, 2023 •

edited

Loading

Intex32 commented Oct 4, 2023 •

edited

Loading