-
### Bug Description
I have the prebuild Vector Store RAG, but it doesn't upload the fille on the Datastax Astra DataBase that I have created
### Reproduction
open the prebuild Vector Store RAG, an…
-
Hallo,
I have been training model in distributed pytorch using hugging face trainer API. Now i have been training model on slrum multi node multi gpu and for every GPU, it logs in mlflow ui. Is th…
-
I'm encountering an error while trying to save the default unsloth Llama 3.1 model to GGUF format. The issue occurs when running the code on Google Colab with a T4 GPU.
**Environment:**
- Google…
-
To run LLaMA 3.1 (or similar large language models) locally, you need specific hardware requirements, especially for storage and other resources. Here's a breakdown of what you typically need:
### …
-
🤘🏼 This is awesome. Let's build a local version 😅
📺
https://www.youtube.com/watch?v=vN0t-kcPOXo
👨🏻💻
https://github.com/disler/poc-realtime-ai-assistant
-
### 🚀 The feature, motivation and pitch
Currently, some APIs already support input already fakefied (e.g. `torch.onnx.dynamo_export` and `torch._dynamo.export` as of #105477 #100017 #103865 #106515…
-
Create an API service that can be called to process the requests from the app.
We can then host this into a server.
The API shall accept the role and the token.
Instructions for deploying the …
-
### Is your feature request related to a problem? Please describe.
_No response_
### Describe the solution you'd like
Google Gemini API is generally available now. Please add support for it. Also g…
-
As a [platform engineer](https://github.com/artificialwisdomai/origin/wiki/User-Personas), I want to interact datastorage, inference, configuration, and model selection via API.
***
Acceptance Crite…
-
使ったcurlは:
```
curl -k -H "Content-Type: application/json" -X POST -d '{"events": [{"replyToken": "nHuyWiB7yP5Zw52FIkcQobQuGDXCTA","type": "message","timestamp": 1462629479859,"source": {"type": "u…