Skip to content

Leaderworkerset v0.3.0

Compare
Choose a tag to compare
@liurupeng liurupeng released this 04 Jun 20:42
v0.3.0
f55ce01

Features:

  • RollingUpdate with MaxSurge support
  • Subgroup support for disaggregated serving
  • Example for multi-node serving of llama 70B on GPUs with vLLM
  • Add a new start policy API
  • Inject leader address environment variable to every container
  • Spec.rolloutStrategy should be a non-required field

Acknowledgments

Thanks to our contributors in this release, in alphabetic order:
@ahg-g @Edwinhr716 @googs1025 @gujingit @jjk-g @kerthcet @liurupeng @nayihz