Huggingface /generate in path to model is not expected #1727

nastyabakhshieva · 2024-11-12T15:33:56Z

Bug description
I was experimenting on Spring AI and was looking for a free model for ChatApi implementation. Upon investigation I found I can use HuggingFace microsoft/Phi-3-mini-4k-instruct model. So, I added the following dependency in my project and started to investigate:

// POM version is 1.0.0-M3
implementation 'org.springframework.ai:spring-ai-huggingface-spring-boot-starter'

Similarly, I added in application yaml:

spring:
  ai:
    huggingface:
      chat:
        api-key: hf_xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
        url: https://api-inference.huggingface.co/models/microsoft/Phi-3-mini-4k-instruct

Once done, I called HuggingfaceChatModel

Unfortunately, The following exception was received:

org.springframework.web.client.HttpClientErrorException$NotFound: 404 Not Found: "{"error":"Model microsoft/Phi-3-mini-4k-instruct/generate does not exist"}"

What I found is that, /generate postfix is appended in the code & hardcoded in org.springframework.ai.huggingface.api.TextGenerationInferenceApi#generateWithHttpInfo, line 83
Whereas, with postman everything works as expected as per hugging face documentation - https://huggingface.co/docs/api-inference/tasks/text-generation?code=curl

Environment
Java version - 21
Build tool - gradle 8.3
Spring AI version - 1.0.0-M3

Steps to reproduce
Just a simple ChatClient setup with hugging face model provided along with access key generated in hugging face account

Expected behavior
/generate is not hardcoded hence can put whatever url is needed

Minimal Complete Reproducible example
I can't put my access key here, but all you need to do is

Create a simple project
Add huggingface-ai dependency
Define hugging face url & access key in application properties file
Call the ChatClient

The text was updated successfully, but these errors were encountered:

jitokim · 2024-11-13T01:07:32Z

I removed the path from the generated client file using the openapi.json and ran it because of this issue.

It seems like I might need to get the latest version of the OpenAPI file from Hugging Face. (I haven’t fully analyzed it yet.)

As a temporary workaround, it works if remove the path in the URI component builder for the path.

- Update GenerateResponse content schema type to array at openapi.json - Use CompatGenerateRequest instead of GenerateRequest for the TextGenerationInference API Request Signed-off-by: jitokim <[email protected]>

jitokim · 2024-11-16T01:12:49Z

@nastyabakhshieva Hi, thanks for reporting issue.
The PR for Hugging Face has been merged, and also #1733
Now you can close this issue. 😃

jitokim mentioned this issue Nov 13, 2024

fix huggingface generate text #1733

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Huggingface /generate in path to model is not expected #1727

Huggingface /generate in path to model is not expected #1727

nastyabakhshieva commented Nov 12, 2024

jitokim commented Nov 13, 2024 •

edited

Loading

jitokim commented Nov 16, 2024

Huggingface /generate in path to model is not expected #1727

Huggingface /generate in path to model is not expected #1727

Comments

nastyabakhshieva commented Nov 12, 2024

jitokim commented Nov 13, 2024 • edited Loading

jitokim commented Nov 16, 2024

jitokim commented Nov 13, 2024 •

edited

Loading