workers-kv Search Results

1000+ results
for workers-kv

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

evanderkoogh/otel-cf-workers #116

Instrumentation breaks new RPC style service calls

https://blog.cloudflare.com/javascript-native-rpc If you instrument your worker, and it tries to call a method on a bound service using the new RPC style, you get a `TypeError: Illegal invocation` …

johtso updated 6 months ago
12
vllm-project/vllm #9243

[Bug]: vllm0.6.2 Using FLASHINFER to start VLLM reported an…

Using FLASHINFER to start VLLM reported an error, enabling -- quantification gptq -- kv cache dtype fp8_e5m2 Start command: python3 -m vllm.entrypoints.openai.api_server --host 0.0.0.0 --port 78…

Rssevenyu updated 2 days ago
3
ilyydy/cf-openai #14

cloudflare 实现限流/锁功能

KV 不同节点同步需要时间，适合常读少写的场景。 https://developers.cloudflare.com/workers/learning/how-kv-works/ >KV achieves this performance by being eventually-consistent. Changes are usually immediately visible in…

ilyydy updated 1 year ago
10
triton-inference-server/tensorrtllm_backend #573

Inference server stalling

### System Info - tensorrtllm_backend built using Dockerfile.trt_llm_backend - main branch tesnorrt llm (0.13.0.dev20240813000) - 8xH100 SXM - Driver Version: 535.129.03 - CUDA Version: 12.5 …

siddhatiwari updated 1 month ago
4
cloudflare/workers-rs #177

async initializer for durable objects

I'm not sure it is already supported in workers-rs. But hopefully there would be an equivalent binding to `blockConcurrencyWhile` in JavaScript. Something like `async fn new(...) -> Self` I use a p…

cometkim updated 10 months ago
2
NVIDIA/TensorRT-LLM #2109

Conver qwen2-57B-A14B failed

### System Info GPU Name: 8 * H20 TensorRT-LLM : 0.12.0.dev2024080600 NVIDIA-SMI 535.154.05 Driver Version: 535.154.05 CUDA Version: 12.4 ### Who can help? _No response_ ### Inform…

zymy-chen updated 2 months ago
3
vllm-project/vllm #5825

[RFC]: Classifier-Free Guidance

### Motivation. I am one of the authors of the paper Stay On Topic with Classifier-Free Guidance ( https://openreview.net/forum?id=RiM3cl9MdK&noteId=s1BXLL1YZD ) who has been nominated as ICML'24 Spo…

Vermeille updated 1 week ago
3
vllm-project/vllm #3033

Qwen 14B AWQ deploy: AttributeError: 'ndarray' object has no…

$ python -m vllm.entrypoints.openai.api_server --host 0.0.0.0 --port 8001 --model Qwen1.5-14B-Chat-AWQ --tensor-parallel-size 2 --quantization awq --trust-remote-code --dtype half INFO 02-26 1…

testTech92 updated 1 week ago
10
wintercg/proposal-minimum-common-api #9

CacheStorage / localStorage or something similar needs to be…

- https://www.w3.org/TR/service-workers/#cache-objects - https://html.spec.whatwg.org/multipage/webstorage.html I think those APIs should be included for supporting local KV features.

XadillaX updated 11 months ago
9
cloudflare/wildebeest #388

The GitHub action "download Terraform state" failed whilst d…

Hi everyone and firstly thanks for Wildebeest. I much appreciate it already. I did run into a problem during my deploy though. All actions above this one - "download Terraform state" - are successf…

chilts updated 1 year ago
2

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for workers-kv

1000+ results
for workers-kv