bloomz Search Results - Githubissues

397 results
for bloomz

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

bigscience-workshop/xmtf #18

how to repreduce bloomz-*

Thank you for contributing such an excellent work. I notice that bloomz-* outperform bloom-* via instruct tuning, I want to build a new bloomz-* model upon bloom model, (e.g. bloom-1b7-> bloomz-1b7-…

fpcsong updated 1 year ago
6
huggingface/transformers #23280

[bloomz] attn_mask return bool, but Deepspeed softmax input …

### System Info - `transformers` version: 4.27.1 - Platform: Linux-4.18.0-240.el8.x86_64-x86_64-with-glibc2.2.5 - Python version: 3.8.12 - Huggingface_hub version: 0.13.3 - PyTorch version (GPU…

shenzhuo updated 1 year ago
6
hpcaitech/ColossalAI #3534

[BUG]:train_reward_model acc is low

### 🐛 Describe the bug use default rm_static dataset, set train_data to 75000 pretrain_model: bloomz-1b1 batch_size: 8 max_epochs: 4 max_len: 256 machine: 2 v100 32g loss_fn: log_sig after…

guoweigang updated 1 year ago
1
huggingface/transformers #23791

[Bloom] Inconsistent results when testing pretrained model b…

### System Info Hi, We suffer inconsistent results when running model 'bloom' under different dtypes(float16, float32) Is that a bug? Environment: - `transformers` version: 4.29.1 - Platform: …

Lemon-412 updated 1 year ago
4
hiyouga/LLaMA-Factory #13

QLoRA 训练报错

int4 报错：RuntimeError: self and mat2 must have the same dtype 训练参数： CUDA_VISIBLE_DEVICES=0 python src/train_sft.py \ --model_name_or_path /models/bloomz-7b1-mt \ --do_train \ --dataset…

Zarc98 updated 1 year ago
3
shibing624/MedicalGPT #143

run_rm.sh 里面的加载的模型不应是merged-sft？怎么还是基线模型bigscience/bloomz-56…

### Describe the bug Please provide a clear and concise description of what the bug is. If applicable, add screenshots to help explain your problem, especially for visualization related problems.

cqray1990 updated 1 year ago
1
LAION-AI/Open-Assistant #186

Supervised data

A large part of making the assistant is to teach it to follow instructions. While training using RLHF seems like the main ingredient, there are already prepared supervised instruction-following datase…

prompteus updated 1 year ago
15
huggingface/huggingface_hub #1596

Show number of parameters on model card

**Is your feature request related to a problem? Please describe.** I often want to see the model size (the number of parameters). Often this is documented on the model card (or even the model name), …

darabos updated 1 year ago
3
huggingface/text-generation-inference #99

Cannot load local model?

Hi, I tried to serve GPT-J with huggingface repo id, it works as follows: ```

jiacheng-ye updated 1 year ago
6
hiyouga/LLaMA-Factory #34

AssertionError: The given checkpoint is not a LoRA checkpoin…

训练参数： CUDA_VISIBLE_DEVICES=0 python src/train_sft.py --model_name_or_path ./Bloom/ --do_train --dataset alpaca_gpt4_en --finetuning_type lora --checkpoint_dir path_to_pt_checkpoint…

neverstoplearn updated 1 year ago
2

上一页 1...22 23 24 25 26 27 28...40 下一页

397 results for bloomz

397 results
for bloomz