Skip to content

[Bot] Update Inference Providers documentation #1723

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 8 commits into from
May 5, 2025
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion docs/inference-providers/providers/fireworks-ai.md
Original file line number Diff line number Diff line change
Expand Up @@ -43,7 +43,7 @@ Find out more about Chat Completion (LLM) [here](../tasks/chat-completion).

<InferenceSnippet
pipeline=text-generation
providersMapping={ {"fireworks-ai":{"modelId":"deepseek-ai/DeepSeek-V3-0324","providerModelId":"accounts/fireworks/models/deepseek-v3-0324"} } }
providersMapping={ {"fireworks-ai":{"modelId":"Qwen/Qwen3-235B-A22B","providerModelId":"accounts/fireworks/models/qwen3-235b-a22b"} } }
conversational />


Expand Down
40 changes: 0 additions & 40 deletions docs/inference-providers/providers/hf-inference.md
Original file line number Diff line number Diff line change
Expand Up @@ -79,36 +79,6 @@ Find out more about Feature Extraction [here](../tasks/feature_extraction).
/>


### Fill Mask

Find out more about Fill Mask [here](../tasks/fill_mask).

<InferenceSnippet
pipeline=fill-mask
providersMapping={ {"hf-inference":{"modelId":"google-bert/bert-base-uncased","providerModelId":"google-bert/bert-base-uncased"} } }
/>


### Image Classification

Find out more about Image Classification [here](../tasks/image_classification).

<InferenceSnippet
pipeline=image-classification
providersMapping={ {"hf-inference":{"modelId":"google/vit-base-patch16-224","providerModelId":"google/vit-base-patch16-224"} } }
/>


### Image Segmentation

Find out more about Image Segmentation [here](../tasks/image_segmentation).

<InferenceSnippet
pipeline=image-segmentation
providersMapping={ {"hf-inference":{"modelId":"jonathandinu/face-parsing","providerModelId":"jonathandinu/face-parsing"} } }
/>


### Text Classification

Find out more about Text Classification [here](../tasks/text_classification).
Expand Down Expand Up @@ -138,13 +108,3 @@ Find out more about Text To Image [here](../tasks/text_to_image).
providersMapping={ {"hf-inference":{"modelId":"black-forest-labs/FLUX.1-dev","providerModelId":"black-forest-labs/FLUX.1-dev"} } }
/>


### Translation

Find out more about Translation [here](../tasks/translation).

<InferenceSnippet
pipeline=translation
providersMapping={ {"hf-inference":{"modelId":"google-t5/t5-base","providerModelId":"google-t5/t5-base"} } }
/>

2 changes: 1 addition & 1 deletion docs/inference-providers/providers/novita.md
Original file line number Diff line number Diff line change
Expand Up @@ -46,7 +46,7 @@ Find out more about Chat Completion (LLM) [here](../tasks/chat-completion).

<InferenceSnippet
pipeline=text-generation
providersMapping={ {"novita":{"modelId":"deepseek-ai/DeepSeek-V3-0324","providerModelId":"deepseek/deepseek-v3-0324"} } }
providersMapping={ {"novita":{"modelId":"Qwen/Qwen3-235B-A22B","providerModelId":"qwen/qwen3-235b-a22b-fp8"} } }
conversational />


Expand Down
2 changes: 1 addition & 1 deletion docs/inference-providers/tasks/audio-classification.md
Original file line number Diff line number Diff line change
Expand Up @@ -35,7 +35,7 @@ Explore all available models and find the one that suits you best [here](https:/
### Using the API


No snippet available for this task.
There are currently no snippet examples for the **audio-classification** task, as no providers support it yet.



Expand Down
2 changes: 1 addition & 1 deletion docs/inference-providers/tasks/chat-completion.md
Original file line number Diff line number Diff line change
Expand Up @@ -59,7 +59,7 @@ The API supports:

<InferenceSnippet
pipeline=text-generation
providersMapping={ {"cerebras":{"modelId":"meta-llama/Llama-3.3-70B-Instruct","providerModelId":"llama-3.3-70b"},"cohere":{"modelId":"CohereLabs/c4ai-command-a-03-2025","providerModelId":"command-a-03-2025"},"fireworks-ai":{"modelId":"deepseek-ai/DeepSeek-V3-0324","providerModelId":"accounts/fireworks/models/deepseek-v3-0324"},"hf-inference":{"modelId":"Qwen/Qwen3-235B-A22B","providerModelId":"Qwen/Qwen3-235B-A22B"},"hyperbolic":{"modelId":"deepseek-ai/DeepSeek-V3-0324","providerModelId":"deepseek-ai/DeepSeek-V3-0324"},"nebius":{"modelId":"deepseek-ai/DeepSeek-V3-0324","providerModelId":"deepseek-ai/DeepSeek-V3-0324-fast"},"novita":{"modelId":"deepseek-ai/DeepSeek-V3-0324","providerModelId":"deepseek/deepseek-v3-0324"},"sambanova":{"modelId":"deepseek-ai/DeepSeek-V3-0324","providerModelId":"DeepSeek-V3-0324"},"together":{"modelId":"deepseek-ai/DeepSeek-R1","providerModelId":"deepseek-ai/DeepSeek-R1"}} }
providersMapping={ {"cerebras":{"modelId":"meta-llama/Llama-3.3-70B-Instruct","providerModelId":"llama-3.3-70b"},"cohere":{"modelId":"CohereLabs/c4ai-command-a-03-2025","providerModelId":"command-a-03-2025"},"fireworks-ai":{"modelId":"Qwen/Qwen3-235B-A22B","providerModelId":"accounts/fireworks/models/qwen3-235b-a22b"},"hf-inference":{"modelId":"Qwen/Qwen3-235B-A22B","providerModelId":"Qwen/Qwen3-235B-A22B"},"hyperbolic":{"modelId":"deepseek-ai/DeepSeek-V3-0324","providerModelId":"deepseek-ai/DeepSeek-V3-0324"},"nebius":{"modelId":"deepseek-ai/DeepSeek-V3-0324","providerModelId":"deepseek-ai/DeepSeek-V3-0324-fast"},"novita":{"modelId":"Qwen/Qwen3-235B-A22B","providerModelId":"qwen/qwen3-235b-a22b-fp8"},"sambanova":{"modelId":"deepseek-ai/DeepSeek-V3-0324","providerModelId":"DeepSeek-V3-0324"},"together":{"modelId":"deepseek-ai/DeepSeek-R1","providerModelId":"deepseek-ai/DeepSeek-R1"}} }
conversational />


Expand Down
5 changes: 1 addition & 4 deletions docs/inference-providers/tasks/fill-mask.md
Original file line number Diff line number Diff line change
Expand Up @@ -30,10 +30,7 @@ Explore all available models and find the one that suits you best [here](https:/
### Using the API


<InferenceSnippet
pipeline=fill-mask
providersMapping={ {"hf-inference":{"modelId":"google-bert/bert-base-uncased","providerModelId":"google-bert/bert-base-uncased"}} }
/>
There are currently no snippet examples for the **fill-mask** task, as no providers support it yet.



Expand Down
6 changes: 1 addition & 5 deletions docs/inference-providers/tasks/image-classification.md
Original file line number Diff line number Diff line change
Expand Up @@ -24,17 +24,13 @@ For more details about the `image-classification` task, check out its [dedicated

### Recommended models

- [google/vit-base-patch16-224](https://huggingface.co/google/vit-base-patch16-224): A strong image classification model.

Explore all available models and find the one that suits you best [here](https://huggingface.co/models?inference=warm&pipeline_tag=image-classification&sort=trending).

### Using the API


<InferenceSnippet
pipeline=image-classification
providersMapping={ {"hf-inference":{"modelId":"google/vit-base-patch16-224","providerModelId":"google/vit-base-patch16-224"}} }
/>
There are currently no snippet examples for the **image-classification** task, as no providers support it yet.



Expand Down
5 changes: 1 addition & 4 deletions docs/inference-providers/tasks/image-segmentation.md
Original file line number Diff line number Diff line change
Expand Up @@ -30,10 +30,7 @@ Explore all available models and find the one that suits you best [here](https:/
### Using the API


<InferenceSnippet
pipeline=image-segmentation
providersMapping={ {"hf-inference":{"modelId":"jonathandinu/face-parsing","providerModelId":"jonathandinu/face-parsing"}} }
/>
There are currently no snippet examples for the **image-segmentation** task, as no providers support it yet.



Expand Down
2 changes: 1 addition & 1 deletion docs/inference-providers/tasks/image-to-image.md
Original file line number Diff line number Diff line change
Expand Up @@ -35,7 +35,7 @@ Explore all available models and find the one that suits you best [here](https:/
### Using the API


No snippet available for this task.
There are currently no snippet examples for the **image-to-image** task, as no providers support it yet.



Expand Down
2 changes: 1 addition & 1 deletion docs/inference-providers/tasks/object-detection.md
Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,7 @@ Explore all available models and find the one that suits you best [here](https:/
### Using the API


No snippet available for this task.
There are currently no snippet examples for the **object-detection** task, as no providers support it yet.



Expand Down
2 changes: 1 addition & 1 deletion docs/inference-providers/tasks/question-answering.md
Copy link
Member

@julien-c julien-c May 5, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hopefully we want to have at least 1 model deployed for each task in the coming days (by either the automated or manual hf-inference deploys), cc @Vaibhavs10 @tomaarsen @oOraph

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes! deploying models as we speak.. we're skipping the following from automated deploys:

GPU_ONLY_TASKS = [
    "image-to-image",
    "text-to-image",
    "text-to-video",
    "text-generation",
    "automatic-speech-recognition",
    "object-detection",
]
NOT_IMPLEMENTED_TASKS = [
    "text-ranking",
]

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For GPU_ONLY_TASKS we can just deploy one of each manually imo

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

(for the ones where we don't already have a third party provider)

Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,7 @@ Explore all available models and find the one that suits you best [here](https:/
### Using the API


No snippet available for this task.
There are currently no snippet examples for the **question-answering** task, as no providers support it yet.



Expand Down
2 changes: 1 addition & 1 deletion docs/inference-providers/tasks/summarization.md
Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,7 @@ Explore all available models and find the one that suits you best [here](https:/
### Using the API


No snippet available for this task.
There are currently no snippet examples for the **summarization** task, as no providers support it yet.



Expand Down
2 changes: 1 addition & 1 deletion docs/inference-providers/tasks/table-question-answering.md
Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,7 @@ Explore all available models and find the one that suits you best [here](https:/
### Using the API


No snippet available for this task.
There are currently no snippet examples for the **table-question-answering** task, as no providers support it yet.



Expand Down
1 change: 0 additions & 1 deletion docs/inference-providers/tasks/text-classification.md
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,6 @@ For more details about the `text-classification` task, check out its [dedicated

- [distilbert/distilbert-base-uncased-finetuned-sst-2-english](https://huggingface.co/distilbert/distilbert-base-uncased-finetuned-sst-2-english): A robust model trained for sentiment analysis.
- [ProsusAI/finbert](https://huggingface.co/ProsusAI/finbert): A sentiment analysis model specialized in financial sentiment.
- [cardiffnlp/twitter-roberta-base-sentiment-latest](https://huggingface.co/cardiffnlp/twitter-roberta-base-sentiment-latest): A sentiment analysis model specialized in analyzing tweets.

Explore all available models and find the one that suits you best [here](https://huggingface.co/models?inference=warm&pipeline_tag=text-classification&sort=trending).

Expand Down
2 changes: 1 addition & 1 deletion docs/inference-providers/tasks/token-classification.md
Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,7 @@ Explore all available models and find the one that suits you best [here](https:/
### Using the API


No snippet available for this task.
There are currently no snippet examples for the **token-classification** task, as no providers support it yet.



Expand Down
6 changes: 1 addition & 5 deletions docs/inference-providers/tasks/translation.md
Original file line number Diff line number Diff line change
Expand Up @@ -24,17 +24,13 @@ For more details about the `translation` task, check out its [dedicated page](ht

### Recommended models

- [google-t5/t5-base](https://huggingface.co/google-t5/t5-base): A general-purpose Transformer that can be used to translate from English to German, French, or Romanian.

Explore all available models and find the one that suits you best [here](https://huggingface.co/models?inference=warm&pipeline_tag=translation&sort=trending).

### Using the API


<InferenceSnippet
pipeline=translation
providersMapping={ {"hf-inference":{"modelId":"google-t5/t5-base","providerModelId":"google-t5/t5-base"}} }
/>
There are currently no snippet examples for the **translation** task, as no providers support it yet.



Expand Down
2 changes: 1 addition & 1 deletion docs/inference-providers/tasks/zero-shot-classification.md
Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,7 @@ Explore all available models and find the one that suits you best [here](https:/
### Using the API


No snippet available for this task.
There are currently no snippet examples for the **zero-shot-classification** task, as no providers support it yet.



Expand Down
2 changes: 1 addition & 1 deletion scripts/inference-providers/package.json
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@
"author": "",
"license": "ISC",
"dependencies": {
"@huggingface/inference": "^3.10.0",
"@huggingface/inference": "^3.11.0",
"@huggingface/tasks": "^0.19.1",
"@types/node": "^22.5.0",
"handlebars": "^4.7.8",
Expand Down
18 changes: 9 additions & 9 deletions scripts/inference-providers/pnpm-lock.yaml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,6 @@

{{else}}

No snippet available for this task.
There are currently no snippet examples for the **{{ task }}** task, as no providers support it yet.

{{/if}}