neox Search Results - Githubissues

1000+ results
for neox

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/DeepSpeed #4180

[RESOLVED] An error occurs when running in the deepspeed cpu…

**Describe the bug** I am using 0.9.4 version after reflecting the two PRs below, but cpu inference is not working CUDA optional deepspeed ops https://github.com/microsoft/DeepSpeed/pull/2507 Enabl…

park12sj updated 10 months ago
19
EleutherAI/pythia #83

Weights of "step0" and "step1" checkpoints are identical for…

Dear EleutherAI team, I've noticed that the weights associated with the recently added "step0" and "step1" checkpoints are identical for all pythia models: ``` def main(): print(f"========…

byungdoh updated 1 year ago
6
EleutherAI/gpt-neox #870

RuntimeError: The expanded size of the tensor (1) must matc…

**Describe the bug** RuntimeError: The expanded size of the tensor (1) must match the existing size (10) at non-singleton dimension 2. Target sizes: [1, 4, 1, 10]. Tensor sizes: [1, 1, 10, 10] F…

crazyofapple updated 1 year ago
4
LostRuins/koboldcpp #98

error loading model: missing tok_embeddings.weight

First of all, I congratulate you on this project. I am on Ubuntu 22.04 platform. I wanted to try https://huggingface.co/cakewalk/ggml-q4_0-stablelm-tuned-alpha-7b/blob/main/ggml-model-stablelm-tune…

trappedinspacetime updated 1 year ago
11
krafton-ai/KORani #1

LlamaTokenizer makes error : [TypeError: not a string]

LlamaTokenizer.from_pretrained('KRAFTON/KORani-v1-13B') ` 309 def LoadFromFile(self, arg): --> 310 return _sentencepiece.SentencePieceProcessor_LoadFromFile(self, arg) TypeError: not a s…

JoonHong-Kim updated 1 year ago
2
oobabooga/text-generation-webui #3217

Finetune Llama-2 error

### Describe the bug I am trying to finetune Llama-2 with raw textfile data. ### Is there an existing issue for this? - [X] I have searched the existing issues ### Reproduction My llama file is t…

ioma8 updated 1 year ago
5
mosaicml/llm-foundry #304

Finetuning with LoRA issue "TypeError: forward() got an unex…

Hi! I am trying to finetune MPT-7B with LoRA configurations. # Model model_name = "mosaicml/mpt-7b" config = transformers.AutoConfig.from_pretrained( model_name, trust_remote_code=…

UmarJawad updated 1 year ago
2
PygmalionAI/data-toolbox #20

multilingual

Hi guys. If I translate the datasets, will they work with pygmalion? I want to translate the datasets into portuguese.

ghost updated 1 year ago
3
mosaicml/llm-foundry #249

Question: Does mpt-7b-instruct have the ability to understan…

Hi, I've tested mpt-7b-instruct and It does understand Chinese, but what confuses me is that the tokenizer that mpt-7b uses does not support Chinese, it's English only. So, how to understand this? Is …

Mewral updated 1 year ago
1
kyegomez/Andromeda #2

Build Dataset Script Fails

python3 Andromeda/build_dataset.py --seed 42 --seq_len 8192 --hf_account "" --tokenizer "EleutherAI/gpt-neox-20b" --dataset_name "EleutherAI/the_pile_deduplicated" Traceback (most recent call las…

evannorstrand-mp updated 1 year ago
1

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for neox

1000+ results
for neox