gptneox Search Results - Githubissues

207 results
for gptneox

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ggerganov/ggml #484

gpt-j, starcoder, gptneox examples cause "not enough space …

Hi all - I'm working on an issue where users on M1 apple silicon get `ggml_new_tensor_impl: not enough space in the context's memory pool ` when they try to use starcoder or gptneox models from turbop…

ravenscroftj updated 10 months ago
2
microsoft/DeepSpeed #3453

[BUG] The text generated by the hybrid engine does not meet …

huggingface model config: { "activation_function": "gelu", "architectures": [ "GPTNeoXForCausalLM" ], "bos_token_id": 0, "eos_token_id": 2, "hidden_act": "gelu", "hidden_siz…

sunhaohai updated 1 year ago
2
NVIDIA/FasterTransformer #553

Implementations of GPT/GPT-J/GPT-Neox

Hi, GPT/GPT-J/GPT-Neox have similar nn architecures. In my view, the implementations of them in `src/fastertransformer/models` (`multi_gpu_gpt`, `gptj`,`gptneox`) are also very similar. I am wonde…

maltoak updated 1 year ago
4
togethercomputer/OpenChatKit #57

FileNotFoundError: [Errno 2] No such file or directory: 'mod…

Ubuntu Ubuntu 22.04.2 LTS After downloading the model and now trying to convert: ``` (OpenChatKit) georgi@georgi-hackintosh:~/Documents/GitHub/OpenChatKit$ python3.10 tools/convert_to_hf_gptneox.…

georgihacker updated 1 year ago
1
NVIDIA/FasterTransformer #526

Stop the generation if the eod is reached

I play with `examples/cpp/gpt/gpt_example.cc` and found that generation of tokens does't finish when the first EOD is reached. This is my gpt_config.ini. I use gpt2 model which was converted int…

akhoroshev updated 11 months ago
7
ggerganov/ggml #225

GPTNeoX model with 16k context results in context-size relat…

Hey guys Today I was doing quants of a [new GPTNeoX model called Literature-7B-16384](https://huggingface.co/hakurei/Literature-7B-16384) I tried making GGMLs through the usual process: ``` py…

TheBloke updated 1 year ago
2
EleutherAI/gpt-neox #571

Introduce improvements from OSLO

1. [AOTAutograd](https://github.com/pytorch/functorch) is a novel engine provided by functorch that can fuse all parts of a neural network. I added it to [OSLO](https://github.com/tunib-ai/oslo/tree/m…

hyunwoongko updated 1 year ago
6
microsoft/DeepSpeedExamples #448

[DeepSpeedExamples/applications/DeepSpeed-Chat/] Error happe…

Error info: File "/opt/conda/lib/python3.8/site-packages/deepspeed/runtime/hybrid_engine.py", line 99, in new_inference_container File "/opt/conda/lib/python3.8/site-packages/deepspeed/module_…

GxjGit updated 7 months ago
2
mudler/LocalAI #2636

Error when GPT-16K model is used in the telegram-bot example

**LocalAI version:** quay.io/go-skynet/local-ai:v1.18.0-ffmpeg rebuild with GO_TAGS=stablediffusion **Environment, CPU architecture, OS, and Version:** rtx4060/ryzen5700/32G **Describ…

greygoo updated 3 weeks ago
7
state-spaces/mamba #457

Questions regarding pretrained Mamba2-Attention Hybrid Model…

When inspecting the config of the hybrid model https://huggingface.co/state-spaces/mamba2attn-2.7b/blob/main/config.json, I came up with two questions: - Why is the number of heads 30? Wouldn't we us…

vasqu updated 5 days ago
2

上一页 1...1 2 3 4 5 6 7...21 下一页

207 results for gptneox

207 results
for gptneox