streamingllm Search Results

princeton-nlp/CEPE #3

An Issue on Reproducing Streamingllm

Congratulations on your excellent work! I attempted to run `bash scripts/run_streamingllm_lm.sh` to reproduce the results of streaming_llm, but I encountered the following error: ``` TypeError: ll…

Ocean-627 updated 1 month ago

vllm-project/vllm #1253

StreamingLLM support?

Hey, This was a really interesting solution to the KV cache for long context. https://github.com/mit-han-lab/streaming-llm I was wondering it could be implemented here. From the looks of thing…

nivibilla updated 1 month ago

mit-han-lab/Quest #3

Why the critical token is dynamici n Quest while attention s…

Hi, I was reading your paper and have a question about the "critical tokens". In Quest, the criticality of the tokens can change with different query tokens. While in StreamingLLM, the initial keys…

Zhuohao-Li updated 2 days ago

NVIDIA/TensorRT-LLM #1474

How does streamingllm support unlimited input and output?

When I experiment with streamingllm in llama with this (https://github.com/NVIDIA/TensorRT-LLM/tree/main/examples/llama#run-llama-with-streamingllm), i wonder why always report length-related errors。w…

zhangfeiyu5610 updated 1 month ago

zyxxmu/cam #2

can not reproduce results in the paper

I run your instructions on the openbookqa task and got the following results: full cache / dense: `"openbookqa": { "acc": 0.414, "acc_stderr": 0.02204949796982787, "acc_norm": 0…

0-KaiKai-0 updated 2 weeks ago

lm-sys/FastChat #2511

Add StreamingLLM to improve streaming performance

Came across [this paper](https://github.com/mit-han-lab/streaming-llm/tree/main) on streaming LLMs that may be good for improving the efficiency of streaming.

BabyChouSr updated 9 months ago

microsoft/MInference #16

[Question]: For the tests such as RULER and InfiniteBench me…

### Describe the issue _No response_

hijkzzz updated 12 hours ago

FlagOpen/FlagEmbedding #409

activation_beacon最长上下文窗口长度400K，是否与现有的长上下文模型（baichuan-192k，GP…

看论文主要是跟微调方法（如Positional Interpolation、NTK-Aware Scale ROPE和StreamingLLM）比较有没有跟现有商业长上下文模型准确度对比评测结果？想知道该技术方案的效果

cnsky2016 updated 5 months ago

mit-han-lab/streaming-llm #53

While streaming with sinks, how does the framework change th…

From Section 3.2 in the paper: ``` When determining the relative distance and adding positional information to tokens, StreamingLLM focuses on positions within the cache rather than those in the …

Bhuvanesh09 updated 4 months ago

irthomasthomas/undecidability #681

MultiAgentLLM a faithful recreation of the Small LLMs Are We…

- [ ] [RichardAragon/MultiAgentLLM](https://github.com/richardaragon/multiagentllm) # RichardAragon/MultiAgentLLM **DESCRIPTION:** "Multi Agent Language Learning Machine (Multi Agent LLM) (Update)…

irthomasthomas updated 3 months ago

109 results for streamingllm

109 results
for streamingllm