bloomz Search Results - Githubissues

396 results
for bloomz

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

OpenVoiceOS/ovos-persona #4

The Persona Sprint

we hit our stretch goal for persona! this issue will document the progress - [Framework](#framework) - [Session](#session) - [Pipeline](#pipeline) - [Solvers](#solvers) - [Server](#server) …

JarbasAl updated 1 year ago
1
mudler/LocalAI #720

Getting started example not working

This project got my interest and wanted to give it a shot. Was attempting the getting started docker example and ran into issues: **LocalAI version:** Latest image **Environment, CPU architectu…

wouterverduin updated 10 months ago
15
Felixgithub2017/MMCU #2

如何处理多选题？

在使用MMCU的数据集的时候，发现有很多题是多选题，请问这种情况下是选对一个就算对，还是需要全选对？当前代码里使用 ``` if label in pred： ``` 来判断是否正确，会不会对多选题造成误判。参考HELM中对MMLU的处理，只需要选对一个即可。感谢！

dongZheX updated 1 year ago
4
huggingface/transformers-bloom-inference #68

"bloom-ds-zero-inference.py" works but "inference_server.cli…

`deepspeed --num_gpus 4 bloom-inference-scripts/bloom-ds-zero-inference.py --name /raid/data/richardwang/bloomz --cpu_offload` worked and gave me inference output. `/raid/data/richardwang/bloomz` is a…

richarddwang updated 5 months ago
4
huggingface/peft #1469

Using LoRA consumes high memory

### System Info transformers==4.34.0 torch ==1.13.1 peft==0.5.0 accelerate==0.23.0 ### Who can help? @pacman100 @younesbelkada @sayakpaul @stevhliu @MKhalusova ### Information - [X…

WenxiongLiao updated 1 month ago
21
hipudding/llama.cpp #6

模型支持情况

hipudding updated 3 months ago
5
ninehills/blog #92

大语言模型（LLM）微调技术笔记

> 注：本文大段摘抄自 [^2] **图1：大模型进化树**[^1] ## 0x00 大模型微调在预训练后，大模型可以获得解决各种任务的通用能力。然而，越来越多的研究表明，大语言模型的能力可以根据特定目标进一步调整。这就是微调技术，目前主要有两种微调大模型的方法[^2]： 1. 指令微调，目标是增强（或解锁）大语言模型的能力。 2. 对齐微调，目标是将大…

ninehills updated 4 weeks ago
22
langchain4j/langchain4j #1156

Error

QianfanChatModel.builder().modelName("ERNIE-Speed-8K").temperature(0.7).topP(1.0).maxRetries(1) .apiKey(apiKey) .secretKey(secretKey) .build(); --------- Err…

WuJingLearn updated 6 months ago
2
microsoft/DeepSpeed #4264

[BUG] exit with code -11

Running LLaMA Efficient Tuning PPO scripts to train a only 560M llm with deepspeed on A100*1(Only for testing the pipeline). Without deepspeed, the code runs fine, while getting unexpected error with …

Anonymousplendid updated 4 months ago
8
hiyouga/LLaMA-Factory #1347

DPO 训练后输出重复问题

v100 qwen模型 dpo训练后模型输出一直重复，还出各种乱码及其他语种的东西数据使用的comparison_gpt4和oaast_rm

Cloopen-ReLiNK updated 4 months ago
15

上一页 1...9 10 11 12 13 14 15...40 下一页

396 results for bloomz

396 results
for bloomz