-
### Question
I'm trying to implement sampling and training asynchronously using the SAC algorithm. I made the attempt shown in the code below. But I always get an error because there seems to be a …
-
Hello, I tried to make checkpoint to save the model.
So I tried to open session with import tensorflow and saver = tf.train.Saver().
But I've got an error 'ValueError' : No variables to save.
I th…
-
With the release of Detectron2 it would be awesome to see inferencing in the wild be reworked to integrate with Detectron2. Is there any plans for this?
-
I successfully implemented PPO2 with MlpPolicy with two different custom environments I built. Now I want to extend to MlpLstmPolicy in one of my games.
I tried to understand the MlpLstmPolicy by r…
-
Where do you want to train the dataset or where to change the path?
I may have missed some instructions, thank you.
-
- [ ] [S-LoRA: Serving Thousands of Models From One GPU for Fun and Profit - OpenPipe](https://openpipe.ai/blog/s-lora)
# S-LoRA: Serving Thousands of Models From One GPU for Fun and Profit - OpenPi…
-
How do you use --env-kwargs correctly?
I have this code in a custom environment.
```python
custom_env = gym.make('forex-v0',
df = FOREX_EURUSD_1H_ASK,
window_size …
-
- [ ] [blog/starcoder2.md at main · huggingface/blog](https://github.com/huggingface/blog/blob/main/starcoder2.md?plain=1)
# blog/starcoder2.md at main · huggingface/blog
---
## StarCoder…
-
- [ ] [LoRA Land: Fine-Tuned Open-Source LLMs that Outperform GPT-4 - Predibase - Predibase](https://predibase.com/blog/lora-land-fine-tuned-open-source-llms-that-outperform-gpt-4)
# LoRA Land: Fine…
-
- [ ] [RichardAragon/MultiAgentLLM](https://github.com/richardaragon/multiagentllm)
# RichardAragon/MultiAgentLLM
**DESCRIPTION:** "Multi Agent Language Learning Machine (Multi Agent LLM)
(Update)…