Skip to content

Commit

Permalink
Updates
Browse files Browse the repository at this point in the history
  • Loading branch information
concretevitamin committed Jan 19, 2024
1 parent bca2a1a commit 7373639
Showing 1 changed file with 8 additions and 2 deletions.
10 changes: 8 additions & 2 deletions docs/source/serving/sky-serve.rst
Original file line number Diff line number Diff line change
Expand Up @@ -12,9 +12,9 @@ Why SkyServe?

* **Bring any serving framework** (vLLM, TGI, FastAPI, ...) and scale it across regions/clouds
* **Reduce costs and increase availability** of service replicas by leveraging multiple/cheaper locations and hardware (spot instances)
* **Out-of-the-box** load-balancing and :ref:`autoscaling <serve-autoscaling>` of service replicas
* Out-of-the-box **load-balancing** and **autoscaling** of service replicas
* **Privacy and Control**: Everything is launched inside your cloud accounts and VPCs
* Manage multi-cloud, multi-region deployments with a single control plane
* **Privacy**: Everything is launched inside your cloud accounts and VPCs

.. * Allocate scarce resources (e.g., A100) **across regions and clouds**
.. * Autoscale your endpoint deployment with load balancing
Expand Down Expand Up @@ -444,6 +444,12 @@ Terminate services:
$ sky serve down http-server # terminate the http-server service
$ sky serve down --all # terminate all services
Autoscaling
-----------

See :ref:`Autoscaling <serve-autoscaling>` for more information.


SkyServe Architecture
---------------------

Expand Down

0 comments on commit 7373639

Please sign in to comment.