Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DeepSeek Load Balancing Bug #11160

Open
5 tasks done
walden668 opened this issue Nov 27, 2024 · 3 comments
Open
5 tasks done

DeepSeek Load Balancing Bug #11160

walden668 opened this issue Nov 27, 2024 · 3 comments
Labels
🐞 bug Something isn't working cloud When the version is cloud and it is a bug report

Comments

@walden668
Copy link

Self Checks

  • This is only for bug report, if you would like to ask a question, please head to Discussions.
  • I have searched for existing issues search for existing issues, including closed ones.
  • I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
  • [FOR CHINESE USERS] 请务必使用英文提交 Issue,否则会被关闭。谢谢!:)
  • Please do not modify this template :) and fill in all the required fields.

Dify version

0.12.1

Cloud or Self Hosted

Cloud

Steps to reproduce

2024-11-26_6 52 02
2024-11-26_6 53 44

Reporting a bug: DeepSeek load balancer triggers an error approximately every two attempts.
I've verified that all keys are working fine - direct API calls to the official endpoint using curl work normally.

✔️ Expected Behavior

No response

❌ Actual Behavior

No response

Copy link

dosubot bot commented Nov 27, 2024

@walden668 Hello there! I'm here to help you with bug reports, answer your questions, and assist you in becoming a contributor. Let's tackle this issue together while we wait for a human maintainer.

The issue you're experiencing with the DeepSeek load balancer might be related to its request distribution mechanism. The load balancer uses a Round-robin strategy, which can lead to intermittent failures if rate limits are triggered. When these limits are exceeded, a 1-minute cooldown period is applied, potentially causing temporary unavailability and resulting in errors approximately every two attempts [1].

To address this, you might want to check if the rate limits are being exceeded and adjust your request patterns accordingly. If the issue persists, further investigation into the specific error messages or logs generated by the load balancer could provide more insights.

To continue talking to Dosu, mention @dosu.

@dosubot dosubot bot added cloud When the version is cloud and it is a bug report 🐞 bug Something isn't working labels Nov 27, 2024
@walden668
Copy link
Author

I've confirmed that no rate limit was triggered, since the returned error is a 401 auth error

@crazywoola
Copy link
Member

Can you paste the text errors as well?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
🐞 bug Something isn't working cloud When the version is cloud and it is a bug report
Projects
None yet
Development

No branches or pull requests

2 participants