int4 Search Results - Githubissues

1000+ results
for int4

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #1100

gptq Qwen-7B-Chat-Int4 load_error

qwen.py", line 303, in load_weights param = state_dict[name] KeyError: 'transformer.h.0.attn.c_attn.g_idx'

Cloopen-ReLiNK updated 2 days ago
15
pytorch/ao #1117

int4wo can't use same packed weight for cpu and cuda

This is mostly to keep track of this problem which has been around for a while if you ever do something like 1)quantize cpu model with int4, 2)move it to cuda then the output of the model will be …

HDCharles updated 1 day ago
14
NetEase-FuXi/EETQ #32

aarch64/ arm64 support

Is there a plan to support arm? I have a gh200 and would like to use EETQ for quantization. Bitsandbytes supports Int4 but int8 on GH200 is not available

khayamgondal updated 1 week ago
1
microsoft/onnxruntime-genai #986

Extra Options use_qdq flags work incorrectly

**Describe the bug** When building a model, `--extra_options use_qdq=1` and `--extra_options use_qdq=0` yield the same binary model.onnx (compared using `diff`) which differs from the `model.onnx` wh…

aendk updated 2 weeks ago
1
cdiddy77/react-native-llm-mediapipe #13

Using expo file system to download and cache model, using fi…

The app crashes when using modelPath after downloading the file from network. ``` const llmInference = useLlmInference({ storageType: 'file', modelPath: '/data/user/0/com.offlinellmpoc/fi…

nadeem-portico updated 3 weeks ago
7
dbeaver/dbeaver #35597

Grouping feature doesn't allow to rearrange double quoted co…

### Description It works correctly for the second table, but not the first. use code: ``` drop table if exists hello1; drop table if exists hello2; CREATE TABLE hello1 ( "Hello" int4 …

ask9 updated 1 month ago
1
meta-llama/llama-stack #347

Model ids that contains a colon throws error when trying to …

### System Info Windows 11 Python 3.12.7 (and 3.12.5 apparently depending on running py --version or python --version from PowerShell) ### Information - [X] The official example scripts …

Sandstedt updated 3 days ago
1
vllm-project/vllm #5663

[Bug]: Qwen2-72B-Instruct-gptq-int4 Repetitive issues

### Your current environment ```text The output of `python collect_env.py` ``` ### 🐛 Describe the bug Machine A800, VLLM 0.5.0, PROMPT=开始, output max tokens = 2048, Temperature sets 0.7 VLLM…

Storm0921 updated 1 week ago
2
elastic/elasticsearch #115475

[CI] RcsCcsCommonYamlTestSuiteIT test {p0=search.vectors/42_…

**Build Scans:** - [elasticsearch-periodic-platform-support #4521 / oraclelinux-8_platform-support-unix](https://gradle-enterprise.elastic.co/s/mhjo3pknmbdc2) - [elasticsearch-pull-request #37673 / pa…

elasticsearchmachine updated 1 week ago
4
langgenius/dify #10200

{"arxiv_search": "tool invoke error: Model models/Qwen/Qwen2…

### Self Checks - [X] This is only for bug report, if you would like to ask a question, please head to [Discussions](https://github.com/langgenius/dify/discussions/categories/general). - [X] I have s…

yyf0910 updated 5 hours ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for int4

1000+ results
for int4