quant-seq Search Results

1000+ results
for quant-seq

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

PromtEngineer/localGPT #600

The model 'LlamaGPTQForCausalLM' is not supported for text-g…

What can I do for this issue? using: MODEL_ID = "TheBloke/Wizard-Vicuna-13B-Uncensored-GPTQ" MODEL_BASENAME = "model.safetensors" The model 'LlamaGPTQForCausalLM' is not supported for text-gener…

PGCJ updated 1 year ago
1
be-green/quantspace #6

Smoother splines for interpolation

Right now the spline code has sharp breaks where it meets the tails. This isn't always wrong, but I think it happens more often than it should. Perhaps there is a way to resolve this issue via penaliz…

be-green updated 3 years ago
1
nextflow-io/nextflow #4546

google-batch workflow interruptions with "UNKNOWN: channel c…

## Bug report When running large workflows, the main process gets interrupted with the error `UNKNOWN: channel closed`. ### Expected behavior and actual behavior Expected: Normal execution…

Puumanamana updated 8 months ago
1
turboderp/exllama #158

Increased context length with NTK Rope Scaling

I am having bad quality results with prompts longer than 2048 tokens with a LoRA trained with alpaca_lora_4bit. These are the settings I am using: ``` config = ExLlamaConfig(model_config_path) …

juanps90 updated 1 year ago
13
NVIDIA/TensorRT #2843

Marginal Improvement Between INT8 and FP16

I have `INT8` quantized a `BERT` model for binary text classification and am only getting a marginal improvement in speed over `FP16`. I am using the `transformer-deploy` library that utilizes Tens…

alexriggio updated 1 year ago
3
princeton-nlp/DensePhrases #34

Recipe to build dense representations from corpus

HI, I'm trying to create a dense representations from my corpus and search paragraphs/phrases by keywords or a question. I don't have labeled Questions and Answers and I don't need for now to get a…

vabatista updated 1 year ago
1
PromtEngineer/localGPT #348

Issue while trying to ask the model a question.

Loading the quanitzed TheBloke/Llama-2-70B-chat-GPTQ or TheBloke/Llama-2-70B-GPTQ model across multiple GPUs. The model is getting loaded, but the query is throwing an error ``` ValueError: not en…

sauravm8 updated 1 year ago
11
tzhu-bio/cisDynet #1

Error when running "quant_mat <- quantification(...)"

Hi, I find the error as shown below when running ``` quant_mat

yashi99 updated 1 year ago
4
ModelTC/llmc #158

fail to run awq on qwen2-7B

config like this: ``` base: seed: &seed 42 model: type: Qwen2 path: /home/LLMCompression/model/Qwen2-7B # model path tokenizer_mode: slow torch_dtype: auto calib: nam…

Muuut updated 1 day ago
2
NVIDIA/TensorRT-LLM #2195

❗ Phi3-Visual: Incorrect outputs

### System Info Hello TensorRT-LLM team! 👋 I'm facing an issue where the inference output does not contain the expected "Singapore" text. Below are the details of my setup and steps to reproduce the …

eoastafurov updated 1 day ago
12

上一页 1...13 14 15 16 17 18 19...100 下一页

1000+ results for quant-seq

1000+ results
for quant-seq