-
What are some of the intended use cases for the 0.5B model.
There are not a lot of other similar sized models and neither is there a lot of hype around them. Though general audience seems to love th…
-
Hi, congratulations to the great work and thanks for open source!
I am running step 3.2 with pair-preference-model-LLaMA3-8B. However, I encountered the warning "Some weights of LlamaForSequenceCl…
-
Very excellent job, if you migrate him to 50-step SD-2-1, can you work well?
-
First, thank you for your efforts in helping to bring accurate and performant RLHF techniques to the open-source community.
I'm raising this issue hoping to get some clarification on a couple implem…
-
Hi team getting the following error while enabling 4-bit and LORA
```
File "/root/miniconda3/envs/open/lib/python3.11/site-packages/deepspeed/runtime/engine.py", line 262, in __init__
self._c…
-
Declaration of Webkul\Rewards\Models\Cart::items() must be compatible with Webkul\Checkout\Models\Cart::items(): Illuminate\Database\Eloquent\Relations\HasMany in /var/www/vhosts/hqol.store/httpdocs/v…
-
Thank you for your open-source materials. I have also tried to successfully run the run_mappo and run_maacktr models, but encountered an error while running the run_madqn model: **self. memory. push (…
-
感谢作者无私开源,看到官方README里说中文的reward-model是基于open-chinese-llama-7b做的,但是后面的步骤说明里写的是:python merge_weight_zh.py recover --path_raw decapoda-research/llama-7b-hf --path_diff ./models/moss-rlhf-reward-model-7B-z…
-
I am trying to run the model that was downloaded from [huggingface](https://huggingface.co/nicklashansen/tdmpc2/tree/main/dmcontrol) using the following command:
```
python evaluate.py task=humanoid…
Zzl35 updated
3 months ago
-
**Describe the bug**
When I use the fine-tuned LLAMA3 model to run the `examples/raft_align.py` script, I encountered the following error:
```
Traceback (most recent call last):
File "/home/work…