Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

A proper form for creating inference services #9

Open
markwinter opened this issue Oct 6, 2021 · 1 comment
Open

A proper form for creating inference services #9

markwinter opened this issue Oct 6, 2021 · 1 comment

Comments

@markwinter
Copy link
Member

markwinter commented Oct 6, 2021

Currently when you click "New Model Server" it just lets you paste YAML.

It would be nicer if it presented a form to create an Inference Service without YAML knowledge (and also allow pasting YAML if desired)

I can work on this if the idea is approved.

Form would be something like this, most fields would be optional:

  • Name Field [REQUIRED]

  • Model Format [REQUIRED]

    • Make backend call to get available formats from ClusterServingRuntime and ServingRuntime
  • Serving Runtime (auto set by model format but editable)

  • StorageURI Field [REQUIRED]

  • Resources

    • Requests
      • CPU
      • Memory
    • Limits
      • CPU
      • Memory
      • GPU
  • Custom Ports

    • Port field
  • Replicas

    • MinReplicas Field
    • MaxReplicas Field
  • Enable Autoscaling Checkbox

    • HPA, KPA Selector
      • KPA CPU Utilisation Field
      • HPA Metric+Target Field
    • ContainerConcurrency
  • Enable Transformer Checkbox

    • Container Image Field
    • Container Command Field
    • Resources
      • Requests
        • CPU
        • Memory
      • Limits
        • CPU
        • Memory
    • Custom ENV
    • Custom Ports
      • Add port field
  • ImagePullSecrets Ref Field

@ConverJens
Copy link

We would also love to have this!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants