kubeagi / arcadia

A diverse, simple, and secure one-stop LLMOps platform
http://www.kubeagi.com/
Apache License 2.0
64 stars 21 forks source link

take 9seconds to search references during chat #971

Closed bjwswang closed 2 months ago

bjwswang commented 3 months ago

@ggservice007 Please provide more details about this problem

ggservice007 commented 3 months ago

how to trace

kubectl logs -n kubeagi-system arcadia-fastchat-65dbfd7dfc-457c5 -c fastchat-api-server

img_v3_029h_4dfe8291-077d-4d06-b772-6c1a4e1e9a7g

step

step 1

embedding

step 2

query the pg vector

This step takes about 7 ~ 9 seconds.

step 3

invoke llm to generate the answer