[Feature]: Loading special data(csv) from the frontend manually

SkywardAI / kirin

APIs aggregator for inference, fine-tuning and build models.

https://skywardai.github.io/skywardai.io/

Apache License 2.0

5 stars 7 forks source link

[Feature]: Loading special data(csv) from the frontend manually #172

Open Aisuko opened 2 weeks ago

Aisuko commented 2 weeks ago

Contact Details(optional)

No response

What feature are you requesting?

Currently, we add some data while after we create vector-db's collection. Here, we do not need to do that. We can load the data from the frontend. Default, we don't have to provide any special context only chat history.

And also we don't need to covert input to vectors by using our vector-db

Aisuko commented 1 week ago

As we discussed, let's see how can we use this feature to solve this issue.

Micost commented 1 week ago

https://milvus.io/docs/embeddings.md#Example-1-Use-default-embedding-function-to-generate-dense-vectors

milvus db cannot directly search strings with complex algorithm . All it could do is using like as other normal rds do. However, the milvus lib contains embedding function to generate embeddings vectors. I think it would help since we won't need any extra model and lib for tokenizing any more

Aisuko commented 6 days ago

As we discussed, we can use embedding API of the inference engine. @Micost

Aisuko commented 5 days ago

Note: We only consider dataset for RAG feature. And also we only use dataset https://www.kaggle.com/datasets/aisuko/rmit-2024-postgraduate-study-areas. No need to support other formats(more rows and columns) in current stage. @Micost @jinronga