gpt-neox Search Results

1000+ results
for gpt-neox

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #2602

Add multi-LoRA support for more architectures

Currently, multi-LoRA supports only Llama and Mistral architectures. We should extend this functionality to all architectures. Yi, Qwen, Phi and Mixtral architectures seem to be the most demanded r…

Yard1 updated 3 months ago
6
ggerganov/ggml #484

gpt-j, starcoder, gptneox examples cause "not enough space …

Hi all - I'm working on an issue where users on M1 apple silicon get `ggml_new_tensor_impl: not enough space in the context's memory pool ` when they try to use starcoder or gptneox models from turbop…

ravenscroftj updated 1 year ago
2
conceptofmind/toolformer #12

Goose AI

```python ''' Goose AI pip install openai Uses GPT-NeoX 20B to generate text. input_query - A string, the input query (e.g. "what is a dog?") output - A string, the generated text ope…

conceptofmind updated 1 year ago
1
Stability-AI/StableLM #37

How to Fine-tune the Model?

Hi, I want to fine-tune the 7b model, am I supposed to download the provided checkpoint and fine-tune it as shown in this repo: https://github.com/EleutherAI/gpt-neox#using-custom-data . Would they be…

berkecanrizai updated 1 year ago
17
GirinMan/HYU-Graduation-Project-Quantization #14

GPT2 & GPTJ quantization 적용 결과 노트북 업로드

# 개요 - BitsAndBytes를 이용하여 GPT2/GPTJ 모델 8bit quantization 적용 코드 구현 - https://huggingface.co/hivemind/gpt-j-6B-8bit 를 참고 - GPTJ 모델은 kogpt 6B 모델로 변경 - GPT2 모델의 경우 attention과 fc layer가 nn.Linear가 아닌 별…

GirinMan updated 1 year ago
1
TransformerLensOrg/TransformerLens #708

[Bug Report] Gemma-2-2b not found

If you are submitting a bug report, please fill in the following details and use the tag [bug]. **Describe the bug** Gemma-2-{size} is not loadable using from_pretrained. I checked OFFICIAL_MODEL_…

jasonlim131 updated 3 weeks ago
1
Dao-AILab/flash-attention #483

RuntimeError: CUDA error: an illegal memory access was encou…

I'm getting a `RuntimeError: CUDA error: an illegal memory access was encountered` using FlashAttention with a GPT-NeoX-esque model. I ``` from transformers import AutoConfig import torch from…

zanussbaum updated 9 months ago
6
intel/intel-extension-for-pytorch #438

Failed to do quantization for models like EleutherAI/gpt-neo…

### Describe the bug MODEL_ID="/models/models--EleutherAI--gpt-neox-20b" mkdir saved_results_gpt_neox python run_gpt-neox_int8.py --ipex-weight-only-quantization --output-dir "saved_results_gpt_neo…

RenyanDiao updated 11 months ago
3
CERC-AAI/multimodal #25

Setup an interactive inference script on Colab to see how th…

1) Convert Robin Model to HF checkpoint. For this you need to extend the GPT-NeoX class in HF and add CLIP encoder, and adapter to it and adapt the conversion script by including clip and adapter weig…

kshitijkg updated 1 year ago
2
ggerganov/ggml #90

Convert gpt4all pythia

How would I convert this into the ggml format? https://huggingface.co/andreaskoepf/pythia-2.8b-gpt4all-pretrain/tree/main

atorsvn updated 1 year ago
1

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for gpt-neox

1000+ results
for gpt-neox