-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
docs: SMMF introduction and usage (#878)
Co-authored-by: junewgl <[email protected]>
- Loading branch information
Showing
44 changed files
with
276 additions
and
1,259 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -9,7 +9,6 @@ __pycache__/ | |
message/ | ||
|
||
.env | ||
.idea | ||
.vscode | ||
.idea | ||
.chroma | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,3 @@ | ||
include README.md | ||
include LICENSE | ||
include README.md | ||
include requirements.txt |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file was deleted.
Oops, something went wrong.
This file was deleted.
Oops, something went wrong.
File renamed without changes.
File renamed without changes.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,64 @@ | ||
# SMMF | ||
|
||
The DB-GPT project provides service-oriented multi-model management capabilities. Developer who are interested in related capabilities can read the [SMMF](/docs/modules/smmf) module part. Here we focus on how to use multi-LLMs. | ||
|
||
|
||
Here we mainly introduce the usage through the web interface. For developer interested in the command line, you can refer to the [cluster deployment](/docs/installation/model_service/cluster) model. Open the DB-GPT-Web frontend service and click on `Model Management` to enter the multi-model management interface. | ||
|
||
|
||
## List Models | ||
By opening the model management interface, we can see the list of currently deployed models. The following is the list of models. | ||
|
||
<p align="left"> | ||
<img src={'/img/module/model_list.png'} width="720px"/> | ||
</p> | ||
|
||
## Use Models | ||
Once the models are deployed, you can switch and use the corresponding model on the multi-model interface. | ||
|
||
<p align="left"> | ||
<img src={'/img/module/model_use.png'} width="720px"/> | ||
</p> | ||
|
||
## Stop Models | ||
As shown in the figure below, click Model Management to enter the model list interface. Select a specific model and click the red `Stop Model` button to stop the model. | ||
|
||
<p align="left"> | ||
<img src={'/img/module/model_stop.png'} width="720px"/> | ||
</p> | ||
|
||
After the model is stopped, the display in the upper right corner will change. | ||
|
||
<p align="left"> | ||
<img src={'/img/module/model_stopped.png'} width="720px"/> | ||
</p> | ||
|
||
## Model Deployment | ||
|
||
1. Open the web page, click the `model management` button on the left to enter the model list page, click `Create Model` in the upper left corner, and then select the name of the model you want to deploy in the pop-up dialog box. Here we choose `vicuna-7b-v1.5`, as shown in the figure. | ||
|
||
<p align="left"> | ||
<img src={'/img/module/model_vicuna-7b-1.5.png'} width="720px"/> | ||
</p> | ||
|
||
|
||
2. Select the appropriate parameters according to the actual deployed model (if you are not sure, the default is enough), then click the `Submit` button at the bottom left of the dialog box, and wait until the model is deployed successfully. | ||
|
||
3. After the new model is deployed, you can see the newly deployed model on the model page, as shown in the figure | ||
|
||
<p align="left"> | ||
<img src={'/img/module/model_vicuna_deployed.png'} width="720px"/> | ||
</p> | ||
|
||
# Operations and Observability | ||
|
||
Operations and observability are important components of a production system. In terms of operational capabilities, DB-GPT provides a command-line tool called dbgpt for operations and management, in addition to the common management functionalities available on the web interface. The dbgpt command-line tool offers the following functionalities: | ||
|
||
- Starting and stopping various services | ||
- Knowledge base management (batch import, custom import, viewing, and deleting knowledge base documents) | ||
- Model management (viewing, starting, stopping models, and conducting dialogues for debugging) | ||
Observability tools (viewing and analyzing observability logs) | ||
|
||
We won't go into detail about the usage of the command-line tool here. You can use the `dbgpt --help` command to obtain specific usage documentation. Additionally, you can check the documentation for individual subcommands. For example, you can use `dbgpt start --help` to view the documentation for starting a service. For more information, please refer to the document provided below. | ||
|
||
- [Debugging](/docs/application/advanced_tutorial/debugging) |
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1 +1,3 @@ | ||
# Documentation Description | ||
# ChangeLog | ||
|
||
Our version release information is maintained on GitHub. For more details, please visit [ReleaseNotes](https://github.com/eosphoros-ai/DB-GPT/releases) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,2 +1,25 @@ | ||
# Connections | ||
The connections module supports connecting to various structured, semi-structured, and unstructured data storage engines. Bring multi-dimensional data into the framework and realize the interaction between natural language and multi-dimensional data | ||
The connections module supports connecting to various structured, semi-structured, and unstructured data storage engines. Bring multi-dimensional data into the framework and realize the interaction between natural language and multi-dimensional data | ||
|
||
The list of data sources we currently support is as follows. | ||
|
||
| DataSource | support | Notes | | ||
| ------------------------------------------------------------------------------ | ----------- | ------------------------------------------- | | ||
| [MySQL](https://www.mysql.com/) | Yes | MySQL is the world's most popular open source database. | | ||
| [PostgresSQL](https://www.postgresql.org/) | Yes | The World's Most Advanced Open Source Relational Database | | ||
| [Spark](https://github.com/apache/spark) | Yes | Unified Engine for large-scale data analytics | | ||
| [DuckDB](https://github.com/duckdb/duckdb) | Yes | DuckDB is an in-process SQL OLAP database management system | | ||
| [Sqlite](https://github.com/sqlite/sqlite) | Yes | | | ||
| [MSSQL](https://github.com/microsoft/mssql-jdbc) | Yes | | | ||
| [ClickHouse](https://github.com/ClickHouse/ClickHouse) | Yes | ClickHouse is the fastest and most resource efficient open-source database for real-time apps and analytics. | | ||
| [Oracle](https://github.com/oracle) | No | TODO | | ||
| [Redis](https://github.com/redis/redis) | No | The Multi-model NoSQL Database | | ||
| [MongoDB](https://github.com/mongodb/mongo) | No | MongoDB is a source-available cross-platform document-oriented database program | | ||
| [HBase](https://github.com/apache/hbase) | No | Open-source, distributed, versioned, column-oriented store modeled | | ||
| [Doris](https://github.com/apache/doris) | No | Apache Doris is an easy-to-use, high performance and unified analytics database. | | ||
| [DB2](https://github.com/IBM/Db2) | No | TODO | | ||
| [Couchbase](https://github.com/couchbase) | No | TODO | | ||
| [Elasticsearch](https://github.com/elastic/elasticsearch) | No | Free and Open, Distributed, RESTful Search Engine | | ||
| [OceanBase](https://github.com/OceanBase) | No | OceanBase is a distributed relational database. | | ||
| [TiDB](https://github.com/pingcap/tidb) | No | TODO | | ||
| [StarRocks](https://github.com/StarRocks/starrocks) | Yes | StarRocks is a next-gen, high-performance analytical data warehouse | |
Oops, something went wrong.