Open TC117 opened 2 weeks ago
Should not allow delete model / engine while running same action API
does the /engines/:name/ load still work ???
On CLI, we dont have engines load
but on /chat/completions
show message "Engine is not loaded yet"
QA details:
Version:
v1.0.1-227
OS (select one)
1. Manual QA (CLI)
Installation
/usr/local/bin
]Data/Folder structures
cortex.so/model_name/variants/
, with .gguf and model.yml filehuggingface.co/author/model_name
with .gguf and model.yml filemodel
,author_repo_id
,branch_name
,path_to_model_yaml
(view via SQL)Cortex Update
cortex update
should update from ~3-5 versions ago to latest (+3 to 5 bump)cortex update
should update from the previous version to latest (+1 bump)cortex update -v 1.x.x-xxx
should update from the previous version to specified versioncortex update
should update from previous stable version to latestOverall / App Shell
cortex
orcortex -h
displays help commandsEngines
Server
cortex start
should start server and output localhost URL & port numbercortex start
can be configured with parameters (port, logLevel [WIP]) https://cortex.so/docs/cli/start/cortex ps
should return server status and running models (or no model loaded)cortex stop
should stop serverModel Pulling
cortex.so
huggingface.co
Interrupted Download
Found unfinished download! Additional XGB needs to be downloaded
)Model Management
Model Running
cortex run <cortexso model>
- if no local models detected, showspull
model menucortex run
- if local model detected, runs the local modelcortex run
- if multiple local models detected, shows list of local models (from multiple model sources eg cortexso, HF authors) for users to select (via regex search)cortex run <invalid model id>
should return gracefullyModel not found!
cortex run <model>
starts interactive chat (by default)cortex run <model> -d
runs in detached modecortex models start <model>
exit()
should exit interactive chatHardware Detection / Acceleration [WIP, no need to QA]
Uninstallation / Reinstallation
--
2. API QA
Checklist for each endpoint
cortex start
, API page is displayed at localhost:port endpointEndpoints
Chat Completions
v1/chat/completions
Engines
/v1/engines
/v1/engines/{name}
/v1/engines/install/{name}
v1/engines/{name}/default
v1/engines/{name}/default
v1/engines/{name}/load
v1/engines/{name}/load
v1/engines/{name}/update
/v1/engines/install/{name}
Pulling Models
/v1/models/pull
starts download (websockets)websockets /events
emitted/v1/models/pull
(websockets)websockets /events
stoppedv1/models/import
Running Models
v1/models
/v1/models/start
/v1/models/stop
/v1/models/{id}
/v1/models/{id}
/v1/models/{model}
updates model.yaml paramsServer
/healthz
[x] terminate server: DELETE
/processManager/destroy
Test list for reference:
1357 e2e tests for APIs in CI
1147, #1225 for starting QA list