kv Search Results - Githubissues

1000+ results
for kv

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

cockroachdb/cockroach #105850

kv: make disk reads asynchronous with respect to Raft state …

This issue is the "disk read" counterpart to https://github.com/cockroachdb/cockroach/issues/17500, which was addressed by https://github.com/etcd-io/raft/pull/8 and https://github.com/cockroachdb/coc…

nvanbenschoten updated 4 weeks ago
8
CircleRadon/TokenPacker #13

About the inference times reported in Figure 4 and Table 3

Hello, I would like to know if the inference times reported in Figure 4 are measured under NO KV cache? While the "TPS" results in Table 3 are prefill time (first token inference time)?

Osilly updated 2 months ago
3
restatedev/restate #1784

Implement a highly available metadata store

To make Restate fully highly available, we also need the metadata store to be highly available. We either find a kv store that provides us with linearizable reads and writes or we need to build it our…

tillrohrmann updated 2 weeks ago
2
sandrohanea/whisper.net #257

whisper.net.runtime.cuda 1.7.2-preview3 running on Tesla T4 …

This is my driver: ![image](https://github.com/user-attachments/assets/582c828f-19f6-431c-9ad1-215ea07b9cbd) I run whisper.net\examples\NvidiaCuda project,and input 00:01:48 duration with 11.7mb s…

dfengpo updated 1 week ago
1
vllm-project/vllm #6135

[Bug]: Phi-3 long context (longrope) doesn't work with fp8 k…

### Your current environment (latest docker image `vllm/vllm-openai:latest`) ```text root@68ac2e4db323:/vllm-workspace# python3 collect_env.py Collecting environment information... PyTorch versi…

jphme updated 1 month ago
1
Zefan-Cai/PyramidKV #24

H2O implementation

In the ```update_kv``` function of ```H2OKVCluster``` class, I see this code. ``` attn_weights = torch.matmul(query_states[..., -self.window_size:, :], key_states.transpose(2, 3)) / math.sqrt(head…

gopikrishnajha updated 1 month ago
1
yatulearn/yatulearn #268

Adding Android Development

**Is your feature request related to a problem? Please describe.** Many of the students including me faces to learn the roadmap of android developer. **Describe the solution you'd like** I will c…

Bhumika-00 updated 1 month ago
1
FuelLabs/fuel-core #2169

chore(db_lookup_times): Modify benchmarks to use `TableBluep…

In https://github.com/FuelLabs/fuel-core/pull/2142 we introduced benchmarks that were not too clean, and part of the rework has been addressed in - - https://github.com/FuelLabs/fuel-core/pull/2168…

rymnc updated 2 months ago
1
Meituan-AutoML/MobileVLM #50

running gguf Mobile VLM on Mac (metal)

I tried two gguf conversion on M2 ultra (metal) but no luck. I converted them myself and still the same error. Here is the first model I tried: https://huggingface.co/guinmoon/MobileVLM-1.7B-GGUF…

rezacopol updated 4 months ago
1
ollama/ollama #6338

ollama slower than llama.cpp

### What is the issue? When using the llm benchmark with ollama https://github.com/MinhNgyuen/llm-benchmark , I get around 80 t/s with gemma 2 2b. When asking the same questions to llama.cpp in conve…

phly95 updated 3 weeks ago
10

上一页 1...93 94 95 96 97 98 99...100 下一页

1000+ results for kv

1000+ results
for kv