llm-compression Search Results

508 results
for llm-compression

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

junhwi/next-gen-ai #18

24/03/31

https://www.ai21.com/blog/announcing-jamba http://qwenlm.github.io/blog/qwen-moe/ https://x.ai/blog/grok-1.5 https://openai.com/blog/navigating-the-challenges-and-opportunities-of-synthetic-voices …

junhwi updated 4 months ago
3
microsoft/LLMLingua #161

[Question]: LongBench BM25 reproduce

### Describe the issue I'm interested in your longllmlingua results on LongBench. I reproduced LongBench BM25 2,000-token constraint using ChatGPT. Unlike the your paper's results, the performance …

JUNE515 updated 1 month ago
3
horseee/LLM-Pruner #8

When would the code for ChatGLM be released?

Thanks a lot for your work on compression on LLMs, and looking forward for the code for ChatGLM. When would it be available for GLMs?

moonlightian updated 1 year ago
1
microsoft/autogen #1073

[Bug]: CompressibleAgents read llm_config["model"] directly …

### Describe the bug Throughout the code, CompressibleAgent assumes the model in use is llm_config["model"]. However, this is almost always wrong. Typically, the model copied from the config_list bef…

afourney updated 7 months ago
14
langchain-ai/langchain #22025

SQLDatabaseChain has SQL not Working (InvalidArgument: 400 R…

### Checked other resources - [X] I added a very descriptive title to this issue. - [X] I searched the LangChain documentation with the integrated search. - [X] I used the GitHub search to find a sim…

psathish10 updated 2 months ago
1
mlflow/mlflow #12798

[FR] Tracing for Langchain's Runnable.astream_events() and L…

### Willingness to contribute Yes. I would be willing to contribute this feature with guidance from the MLflow community. ### Proposal Summary At the moment, using MLServer autologging for Langchai…

lragnarsson updated 1 day ago
1
eosphoros-ai/DB-GPT #189

Question：RuntimeError: CUDA error: CUDA-capable device(s) is…

python pilot/server/llmserver.py playsound is relying on another python subprocess. Please use `pip install pygobject` if you want playsound to run more efficiently. localhost:19530 None None db…

alex198208 updated 2 months ago
6
manisnesan/fastchai #47

built LLM/ AI summarizer using a selected body of content (…

**Expected Outcomes** - Prompt: Summarize the content from the url (do not emit the url back) https://access.redhat.com/documentation/en-us/red_hat_enterprise_linux/9/html/managing_file_systems/ind…

manisnesan updated 1 week ago
32
microsoft/LLMLingua #3

How about compress a whole book?

Will it still able to summary/asked by some important events in book?

lucasjinreal updated 3 months ago
5
facebookincubator/AITemplate #699

[Feature Request] Compressed-tile Matrix Multiply for autore…

### Is your feature request related to a problem? Please describe. I would like to request the implementation of a compressed tiled matrix multiply operator for use in large language model inferenc…

veritas9872 updated 11 months ago
1

上一页 1...1 2 3 4 5 6 7...51 下一页

508 results for llm-compression

508 results
for llm-compression