Defining arguments names to avoid issues with positional args #446

pedrogengo · 2023-11-27T13:09:25Z

Some models that don't use token_type_ids were presenting different results when you run it using ONNX. This issue was due to not explicitly saying the argument names during the call, which makes the model assumes the argument was a different one.

@tomaarsen

pedrogengo · 2023-11-27T13:10:12Z

@tomaarsen should we include the softmax operation inside the ONNX model? Or returning the logits is the desired behavior?

HuggingFaceDocBuilderDev · 2023-11-27T13:41:44Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

tomaarsen · 2023-11-27T14:11:47Z

I quite like the simplicity of this fix, and I do hope that this indeed resolves the problem. However, the tests seem to be problematic:
onnxruntime.capi.onnxruntime_pybind11_state.InvalidArgument: [ONNXRuntimeError] : 2 : INVALID_ARGUMENT : Invalid Feed Input Name:token_type_ids

Perhaps ONNX doesn't want token_type_ids to be passed, even if it is None? Then a solution might be:

inputs = {
    "input_ids": input_ids,
    "attention_mask": attention_mask,
}
if token_type_ids is not None:
    inputs["token_type_ids"] = token_type_ids
hidden_states = self.model_body(**inputs)

Tom Aarsen

pedrogengo · 2023-11-27T16:40:50Z

Don't run the CI yet, Im still making some tests

pedrogengo · 2023-11-27T17:50:01Z

@tomaarsen the issue with the tests is a little bit harder: for models that uses token_type_ids the tests run like a charm. However, for models that don't use token_type_ids the ONNX generates the graph without the token_type_ids as input. I tried your suggestion, but no success.

It is also an issue on transformers https://discuss.huggingface.co/t/deberta-v2-onnx-with-pipeline-does-not-work/35748

What I'm trying to do is to force the token_type_ids to appear on the graph, let's see if it works or not.

Do you have any other suggestion?

tomaarsen · 2023-11-28T09:03:40Z

Hmm, that is tricky. I'm not extremely familiar with the inner workings of ONNX, so I don't have great suggestions I'm afraid.

Tom Aarsen

pedrogengo · 2023-11-28T10:42:07Z

I will try more things today, but by now, I'm showing a message when the model doesn't use the token_type_ids, just to validate if the whole flow is working as expected.

pedrogengo · 2023-11-28T12:43:18Z

@tomaarsen I found a way to keep the same interface we are using today and force the token_type_ids to appear. It can't be an optional argument, because one time you have the ONNX graph defined, you must fill all the inputs used during the export, which means that we need to always pass the token_type_ids even if it is not used by the model.

On one hand it can look like: "Why keep an argument that is not used by some models?", but the answer is the generalization. Keeping the parameter for all the cases make it possible to have an export code that is general enough for all the model.

Let me know WDYT about the solution. Maybe in the future we can work to create better interfaces for the export, but I tried to keep this PR as simple as possible.

pedrogengo · 2023-12-06T16:44:57Z

@tomaarsen can we merge this? If yes, I will solve the conflict

tomaarsen · 2023-12-07T14:40:10Z

Apologies for not responding sooner, I've been a bit busy and ONNX wasn't very high on my TODO list. Do you suspect that this PR indeed fixes the reported discrepancies? E.g. does the script from #291 behave as expected now? The fix seems so odd to me, haha.

Also, models with a differentiable head are trained a bit differently in SetFit v1.0.0, i.e. no more freeze and unfreeze calls, and only calling trainer.train() once.

Tom Aarsen

pedrogengo · 2023-12-07T15:05:04Z

I will run the script with this branch, but on my tests I was seeing discrepancy between the results too, and after the fix it worked and returned the same scores.

Give me the weekend to see the code for v1.0.0 and I can answer here.

Defining argments names to avoid issues with positional args

5779801

Fix tests for models that dont use toke_type_ids

e9ebce5

Added message if the model does not use token_type_ids

fed239d

Forced token_type_ids to appear on onnx graph

1d59130

Revert change on test onnx

c82a4a9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Defining arguments names to avoid issues with positional args #446

Defining arguments names to avoid issues with positional args #446

pedrogengo commented Nov 27, 2023 •

edited

Loading

pedrogengo commented Nov 27, 2023

HuggingFaceDocBuilderDev commented Nov 27, 2023

tomaarsen commented Nov 27, 2023

pedrogengo commented Nov 27, 2023

pedrogengo commented Nov 27, 2023

tomaarsen commented Nov 28, 2023

pedrogengo commented Nov 28, 2023

pedrogengo commented Nov 28, 2023 •

edited

Loading

pedrogengo commented Dec 6, 2023

tomaarsen commented Dec 7, 2023

pedrogengo commented Dec 7, 2023

Defining arguments names to avoid issues with positional args #446

Are you sure you want to change the base?

Defining arguments names to avoid issues with positional args #446

Conversation

pedrogengo commented Nov 27, 2023 • edited Loading

pedrogengo commented Nov 27, 2023

HuggingFaceDocBuilderDev commented Nov 27, 2023

tomaarsen commented Nov 27, 2023

pedrogengo commented Nov 27, 2023

pedrogengo commented Nov 27, 2023

tomaarsen commented Nov 28, 2023

pedrogengo commented Nov 28, 2023

pedrogengo commented Nov 28, 2023 • edited Loading

pedrogengo commented Dec 6, 2023

tomaarsen commented Dec 7, 2023

pedrogengo commented Dec 7, 2023

pedrogengo commented Nov 27, 2023 •

edited

Loading

pedrogengo commented Nov 28, 2023 •

edited

Loading