gs-quant Search Results

145 results
for gs-quant

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

viviskas/github-slideshow #16

Help

viviskas updated 3 years ago
1
turboderp/exllamav2 #262

Inquiring about Calibration Procedures and Issues for Model …

### Context and Issue I'm attempting to quantize the model [alpindale/goliath-120b](https://huggingface.co/alpindale/goliath-120b) using [royallab/PIPPA-cleaned](https://huggingface.co/datasets/royal…

MatrixC7 updated 7 months ago
11
langchain-ai/langchain #18968

Issue with the use of sqlalchemy in community.document_loade…

### Checked other resources - [X] I added a very descriptive title to this issue. - [X] I searched the LangChain documentation with the integrated search. - [X] I used the GitHub search to find a sim…

aganchev1 updated 5 months ago
2
turboderp/exllamav2 #290

87225fe "Optimize kernel batch performance" breaks some chat…

I'm using exllamav2 for the first time. I built from source today (minus the ~3 commits that break Windows builds). I saw some strange behavior and bisected it to the commit in the title. Expected:…

bjj updated 7 months ago
8
turboderp/exllamav2 #246

Quantizing goliath120b @ 3bpw : calibration perplexity (quan…

Hi, I tried the new quant method (`master` branch) with goliath 120b using the built-in calibration dataset (not specifying -c parameter). `-b 3.0 -hb 8 -rs 1.0` `# Module quantized, calibration pe…

alexconstant9108 updated 8 months ago
8
nod-ai/SHARK #1805

RX 7900 XTX "rocm://1 is not supported"

Important note: it is working, but only with Vulkan, not with ROCm. However, I have installed ROCm 5.5 and it does support RX 7900 XTX: https://rocm.docs.amd.com/en/docs-5.5.1/release/windows_support.…

f1am3d updated 5 months ago
20
gnolang/meetings #36

Minutes: Core Staff Weekly Syncs [every Monday]

**Purpose**: This issue compiles meeting notes for the Gno Core Staff's recurring meetings. **Process**: 1. **Drafting**: Notes are initially taken in Hackmd or Google Docs during meetings. 2. *…

moul updated 2 days ago
40
python-websockets/websockets #1387

TimeoutError: timed out during handshake |

I am facing below very frequently, mostly 1 out 20 times ``` ws = connect( /usr/local/python/python-3.10/std/lib64/python3.10/site-packages/websockets/sync/client.py:289: in connect conn…

tushargoyal22 updated 1 year ago
11
turboderp/exllama #10

Splitting model on multiple GPUs produces RuntimeError

When attempting to split the model on multiple GPUs, I get the following error: ``` > python test_chatbot.py -d /home/john/Projects/Python/text-models/text-generation-webui/models/TheBloke_guanaco…

h3ss updated 1 year ago
19
turboderp/exllama #165

Support for LLaMA-2-70B quants

Hi there, thanks for all the hard work. My system has 2x4090. First, the FP16 model works if using bitsandbytes 4bit, with decent speeds. `Output generated in 81.62 seconds (4.47 tokens/s, 36…

Panchovix updated 1 year ago
25

上一页 1...4 5 6 7 8 9 10...15 下一页

145 results for gs-quant

145 results
for gs-quant