kv-cache-compression Search Results

891 results
for kv-cache-compression

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

cockroachdb/pebble #112

db: blob storage / WiscKey-style value separation

## Background The [WiscKey](https://www.usenix.org/system/files/conference/fast16/fast16-papers-lu.pdf) paper showed the value of segregating storage for large values in order to reduce write ampli…

petermattis updated 1 month ago
14
turboderp/exllama #272

YaRN Support

Any thoughts/plans about YaRN support for the positional embeddings? https://github.com/jquesnelle/yarn I don't actually see them beat regular linear scaling w/ fine-tuning in the paper, but presu…

grimulkan updated 1 year ago
8
ROCm/AMDMIGraphX #3307

[INT4] Compress model by quantizing weights to int4

### Idea Use int4 as the compression technique to fit larger models onto Navi machines or possibly MI series machines. Weights would be compressed using encoding scheme that would pack two 4 bits n…

umangyadav updated 1 week ago
27
pmwkaa/serenity #10

Buggy scenario

``` Endpoint#write *1 $4 info Redis#write $2688 # Storage info sophia.version = 1.2.3 sophia.build = 59ff278 sophia.error = sophia.path = ./serenity_db sophia.path_create = 1 memory.limit = 107374182…

FGRibreau updated 8 years ago
1
ideawu/ssdb #919

SSDB Uses too much Memory

Greetings, I am running SSDB on my server, the data directory is about 1.9GB, all the items are simple KV records , the length of keys and values are about 10B and 20B I found the the ssdb occupy a l…

yushengery updated 7 years ago
5
ideawu/ssdb #994

你好，8核24G内存ssd的机器运行单个ssdb，配置如何优化

你好，8核24G内存ssd的机器运行单个ssdb，配置如何优化？ compression设置为true后内存（cache_size + 10 \* write_buffer_size \* 66 + 32）超过物理内存会出现什么现象，谢谢

tongmeng256 updated 7 years ago
7
Zefan-Cai/PyramidKV #16

Settings and implementations of baselines

Hi, thanks for your insightful work! But it seems that some settings and implementations of baselines are not so proper and fair? 1. Window size. In `run_longbench.py: 221`, PyramidKV uses window s…

bingps updated 3 months ago
3
natverse/fafbseg #67

consider caching expensive flywire id lookups

rootid->supervoxels this can be cached essentially permanently since rootids mutate whenever the mapping changes however they are quite large, so the object store could quickly start taking a l…

jefferis updated 3 years ago
8
hao-ai-lab/LookaheadDecoding #14

How to generate the n-grams - which to keep, which to discar…

This is actually not my question, llama.cpp wants to implement this but encountered some problems. https://github.com/ggerganov/llama.cpp/pull/4207

bobqianic updated 9 months ago
27
thanos-io/thanos #5758

Document metrics exported by Thanos' components

### Is your proposal related to a problem? Yes. Figuring out which metrics are exported by the various Thanos components and the meaning behind them is often a process that requires diving deep…

douglascamata updated 3 months ago
13

上一页 1...2 3 4 5 6 7 8...90 下一页

891 results for kv-cache-compression

891 results
for kv-cache-compression