-
Dear developers,
I decided to clone the code/demo but apparently the data files are not present on the LFS servers. I get the following error during clone:
```
Downloading Inference/db/imdb_raw…
-
- 环境
- docker: registry.cn-hangzhou.aliyuncs.com/havenask/rtp_llm:0.1.13_cuda12
- cuda: 12.1
- driver: 515.105.01
- 模型:
- llama: https://huggingface.co/lmsys/vicuna-33b-v1.3
…
-
We currently use the inference client in the NL-to-ESQL task. We should fully migrate to it, which means that we replace all instances of `client.chatComplete()` and `client.chat()` with `inferenceCli…
-
**Describe the package you'd like added**
`vllm` has become a popular inference server for LLMs: https://github.com/vllm-project/vllm
**Describe how this package fits in with the project**
GenAI/…
-
**Describe the bug**
The MNIST Inference web example is not working. It appears to be trying to load `pkg/mnist_inference_web.js` but no such file exists.
**To Reproduce**
```
cd examples/mnist-…
-
### System Info
- `transformers` version: 4.46.0.dev0
- Platform: Linux-5.15.0-120-generic-x86_64-with-glibc2.35
- Python version: 3.10.15
- Huggingface_hub version: 0.25.2
- Safetensors versio…
-
Terraform apply fails.
When runnint terraform apply, it fails deploying Kubernetes.
Used Branch: release-1.1
Logs:
````
module.inference-server.kubernetes_deployment.inference_deploymen…
-
does this do lyrics like the demo app at https://app.riffusion.com/
-
请问
registry.baidubce.com/paddlepaddle/fastdeploy:llm-base-gcc12.3-cuda11.8-cudnn8-nccl2.15.5
的dockerfile方便提供一下吗?
-
### Area of Improvement
Right now if user didn't set `QueryClient` `defaultOptions.retry` to false, `trpc` will automatically fallback to this `retry` property's default value (which is `4`) and igno…