neox Search Results - Githubissues

1000+ results
for neox

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

flexflow/FlexFlow #1154

Helps needed in testing out TinyLlama

The TinyLlama project aims to pretrain a 1.1B Llama on 3T tokens. So that model should be an ideal draft model for speculative inference. https://github.com/jzhang38/TinyLlama https://huggingfac…

jzhang38 updated 1 year ago
2
pytorch/pytorch #84169

mps: weird results given by Transformer CausalLM

### 🐛 Describe the bug ``` python %env PYTORCH_ENABLE_MPS_FALLBACK=1 import transformers from transformers import AutoModelForCausalLM, AutoTokenizer import torch assert torch.backends.mps.i…

Willian-Zhang updated 1 year ago
9
EleutherAI/project-menu #4

[Replication] Interpretability Illusion

## Background Replicate and visualize https://arxiv.org/abs/2104.07143 ## What to Replicate? ## Modifications ## Related Papers/Frameworks

quinn-dougherty updated 1 year ago
4
oobabooga/text-generation-webui #1374

Cannot run as nonroot

### Describe the bug After typing the first prompt, my own prompt and the assistant's response (onscreen says *Typing...*) vanish. Traceback: ``` Loading checkpoint shards: 100%|████████████…

Rudd-O updated 1 year ago
2
microsoft/DeepSpeedExamples #299

step1-sft use lora failed

env ``` gpu: 4*A100 80G pytorch: 1.13.1 cuda version: 11.7 deepspeed: 0.9.0 transformers: 4.28.0.dev ``` run script ``` OUTPUT=$1 ZERO_STAGE=3 if [ "$OUTPUT" == "" ]; then OUTPUT=./…

bytes-lost updated 1 year ago
10
CERC-AAI/multimodal #15

Option to load multiple datasets

Add option to load multiple datasets

kshitijkg updated 1 year ago
1
EleutherAI/math-lm #10

Filtering Github issues and diffs

Our Github source code dataset is based on [the deduplicated stack](https://huggingface.co/datasets/bigcode/the-stack-dedup) filtered down to only include numerical computing, computer algebra, and fo…

zhangir-azerbayev updated 1 year ago
1
LAION-AI/Open-Assistant #261

Creating augmented data using few-shot prompts for explanati…

See https://www.lesswrong.com/posts/EHbJ69JDs4suovpLw/testing-palm-prompts-on-gpt3. Try doing 2, 3 or 4 shot inference on something like JT or neox 20B or galactica. After we find a promising …

huu4ontocord updated 1 year ago
9
EleutherAI/gpt-neox #935

How to preprocess data from plaintext

I want to test gpt-neox to make a tiny GPT model. I would like to train it on some plaintext files. I converted it into JSONL file and... I stuck in Tokenization. I don't know how to do that. I hav…

Maniues updated 1 year ago
2
huggingface/peft #119

Error when applyling the fp 16 training option using acceler…

Hi. PEFT is amazing. Thank you for sharing this amazing package for us. However, when I used fp 16 training option using accelerate deepspeed ZeRO 3 with PEFT LoRA, error occured. How can I handle t…

codingchild2424 updated 1 year ago
1

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for neox

1000+ results
for neox