-
Hi everyone and firstly thanks for Wildebeest. I much appreciate it already.
I did run into a problem during my deploy though. All actions above this one - "download Terraform state" - are successf…
-
![image](https://github.com/user-attachments/assets/2babc2d1-75e5-4fb3-8ddc-d12ba0bf78c5)
I mistakenly set up these two, how should I delete them?
-
### What happened + What you expected to happen
the ray.data.RandomAccessDataset.multiget expected return a None for missing records, in fact, I got an unexpected value for the missing key.
I find…
-
The `getBindingsProxy` utility (introduced in https://github.com/cloudflare/workers-sdk/pull/4523) should include as part of its return object some utility methods that allow users to get hold of spec…
-
Workers KV writes cost 10x as much as KV reads. This leads to rather large bills, something I do not desire. Therefore, I intend to implement a Bloom filter that works like so:
On every request tha…
astei updated
2 years ago
-
We need to do a count how many people has accessed particular shortened url.
And there are two options:
1. Use metadata
- https://developers.cloudflare.com/workers/runtime-apis/kv/#metadata
…
-
这样就能全套部署在 CF 了,而不是其它第三方有用量限制
-
### Report of performance regression
Hi I use this:
```
server_vllm.py \
--model "/data/models_temp/functionary-small-v2.4/" \
--served-model-name "functionary" \
--dtype=bfloat16 \
-…
rvsh2 updated
1 month ago
-
It would be very helpful if a subrequest counter value was exposed at runtime for long-running scheduled tasks performed by workers. A real-world example use case is an hourly CRON task running for up…
-
### Your current environment
Using latest docker image vllm/vllm-openai:v0.5.0.post1
### 🐛 Describe the bug
When trying to send a request to the /v1/embeddings endpoint of the deployed mode…