llm-compression Search Results

615 results
for llm-compression

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

getnamo/SocketIOClient-Unreal #352

Assertion failed: (Index >= 0) & (Index < ArrayNum) [File:R…

![图片](https://user-images.githubusercontent.com/13312038/202338124-c4ff5c1e-5f54-46ef-8c2b-01923a20ee7b.png) Hi,getnamo: I try to use ToBytes(SoundWave) function to convert soundwave to bytes.…

pythonclound updated 1 year ago
5
openvinotoolkit/openvino_notebooks #1723

(llm-chatbot notebook) Weight compression using Optimum Inte…

In the llm-chatbot notebook - https://github.com/openvinotoolkit/openvino_notebooks/blob/main/notebooks/254-llm-chatbot/254-llm-chatbot.ipynb, within the section "Weight Compression using Optimum Inte…

HLneoh updated 7 months ago
2
GoogleCloudPlatform/generative-ai #571

[Bug]: InactiveRPCError when sending base64 encoded mp3 data

### File Name gemini/getting-started/intro_gemini_1_5_pro.ipynb ### What happened? The below is mentioned in the notebook for Gemini 1.5 Pro, but unfortunately all examples are for Google Cloud hos…

bent-verbiage updated 6 months ago
2
run-llama/llama_index #10389

[Question]: How to do Multi-Document RAG with Document Summa…

### Question Validation - [X] I have searched both the documentation and discord for an answer. ### Question Using open source LLMs `zephyr-7b-alpha`, how can I do multiple document RAG especially …

GxTeo updated 5 months ago
3
NVIDIA/TensorRT-LLM #1106

AWQ config parameter.

[AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration](https://arxiv.org/pdf/2306.00978.pdf) The original study compares performance against the number of 'fraction of weigh…

matichon-vultureprime updated 7 months ago
3
microsoft/LLMLingua #137

[Bug]: AssertionError when executing Code.ipynb

### Describe the bug When I executed code.ipynb, AssertionError happened after waiting for 2-3 mins: ``` from llmlingua import PromptCompressor import os os.environ['CUDA_VISIBLE_DEVICES'] = "1…

maxcccc updated 5 months ago
3
zju-pi/Knowledge-Distillation-Paper #1

Missing paper

Great work. I would like to introduce two papers: Name: Weight-Inherited Distillation for Task-Agnostic BERT Compression paper: code: https://github.com/wutaiqiang/WID-NAACL2024 Blog: https:/…

wutaiqiang updated 6 months ago
8
langflow-ai/langflow #2735

problems with Vertex AI credentials on windows 11 pro

### Bug Description Hello, I do have strange error, I am using Windows 11 Pro, And while trying to use Credentials file on VertexAI, I am always getting 'Error Building Component Error build…

severfire updated 2 months ago
13
rnadigital/agentcloud #166

Retrieval strategies

Currently we use agents to craft the query, we would either like more standard retrieval strategies that can be added out of the box in our RAG connector (TS, Python, Mongo). Using langchain Name | …

anada10 updated 6 months ago
1
hkust-nlp/llm-compression-intelligence #5

diff between code and paper

![image](https://github.com/hkust-nlp/llm-compression-intelligence/assets/24559732/0c1e36a4-52e2-49d6-a388-ef0618d1aa2f) ![image](https://github.com/hkust-nlp/llm-compression-intelligence/assets/2455…

jiangh0 updated 5 months ago
14

上一页 1...33 34 35 36 37 38 39...62 下一页

615 results for llm-compression

615 results
for llm-compression