gptneox Search Results - Githubissues

212 results
for gptneox

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers #25921

Batch Decoding of LMs will cause different outputs with diff…

### System Info Transformers=4.31 Torch=2.01 Cuda=11.8 Python=3.10 A100 GPU 80GB ### Who can help? @ArthurZucker , @younesbelkada , @gante ### Information - [ ] The official example script…

wenhuchen updated 7 months ago
7
microsoft/DeepSpeed #3038

[BUG] Checkpoint loading gpt-neoxt-chat-base-20b not working

**Describe the bug** Following #2547 I tried to run the model gpt-neoxt-chat-base-20b, which is a neox-20B derivative I think and I think it should work. Inference works if the model is loaded the n…

thies1006 updated 1 year ago
2
intel-analytics/ipex-llm #8787

[bigdl.llm.langchain.embeddings.BigdlNativeEmbeddings] 不支持 c…

现在很多中文用户正在基于chatglm系列模型搭建自己的langchain应用，而且chatglm.cpp项目已提供ggml支持，按理说这个应该也属于native_int4的范畴，所以能否支持一下bigdl.llm.langchain在这系列模型上的BigdlNativeEmbeddings API呀？

intelyoungway updated 1 year ago
12
Zodiark-ch/Emergence-of-LLMs #1

Arg.py bugs and optimization of code

import os import pickle from typing import List from dataclasses import field, dataclass from utils import set_default_to_empty_string FOLDER_ROOT = ( os.path.abspath(os.path.dirname(os.pa…

Grokci updated 1 week ago
3
mudler/LocalAI #1037

could not load model - all backends returned error

**LocalAI version:** According to git the last commit is from Sun Sep 3 02:38:52 2023 -0700 and says "added Linux Mint" **Environment, CPU architecture, OS, and Version:** Linux instance-7 6.…

aaron13100 updated 11 months ago
3
triton-inference-server/fastertransformer_backend #67

How to support different models with different tensor_para_s…

I have 4 GPUs and 3 models called small, medium and large. I want to deploy small model on GPU 0, medium model on GPU 1, and large model on GPU 2 and GPU3 with tensor_para_size=2 due to large model is…

TopIdiot updated 1 year ago
29
benjaminocampo/gen_hs_explanations #3

Finetune T5 and GPTNeoX

## Description Finetune T5 and GPTNeoX using HateCheck data. ## Steps - [x] Experiment fine-tuning already pre-trained T5 tasks such as "summarize:" but using hateful data. - [x] Try other p…

benjaminocampo updated 3 months ago
4
state-spaces/mamba #457

Questions regarding pretrained Mamba2-Attention Hybrid Model…

When inspecting the config of the hybrid model https://huggingface.co/state-spaces/mamba2attn-2.7b/blob/main/config.json, I came up with two questions: - Why is the number of heads 30? Wouldn't we us…

vasqu updated 2 months ago
2
mudler/LocalAI #897

Not building on windows locally

**LocalAI version:** #895 **Environment, CPU architecture, OS, and Version:** sh-5.2$ uname -a MSYS_NT-10.0-19045 DESKTOP-S7HQITA 3.4.7-ea781829.x86_64 2023-07-05 12:05 UTC x86_64 Msys …

bartanderson updated 1 year ago
1
google-ai-edge/model-explorer #56

Values not displayed completely

I followed the settings to set the value display limit to `1024`, but the actual values displayed do not match the size of the weights (`arg0_1`): This is from the quick_start example colab.

justinchuby updated 4 months ago
5

上一页 1...3 4 5 6 7 8 9...22 下一页

212 results for gptneox

212 results
for gptneox