-
i use huggingface trl sfttrainer and peft and deepspeed to train a 6B model.
i have a 4 12GB gpu.
when i use automodel.from_pretrained(device_map="auto"), it works. but the training process is so…
-
### Validations
- [X] I believe this is a way to improve. I'll try to join the [Continue Discord](https://discord.gg/NWtdYexhMs) for questions
- [X] I'm not able to find an [open issue](https://gi…
-
### Description
I have this replacement:
```yaml
- trigger: "doesnt"
replace: "doesn't"
propagate_case: true
```
But when I type `doesnt ` (fast typing with a space at the end), it re…
-
(llm) test@test-Z590-VISION-D:~/ipexllm_whowhat/ipex-llm/python/llm/dev/benchmark/harness$ python run_llb.py --model ipex-llm --pretrained /home/test/models/LLM/haiyan-scp/chatglm3-6b/pytorch --precis…
-
In the flutter app which relies on
- auto_route: 8.2.0
when having nested routes, back gesture pops all the added routes till parent route.
Expected behaviour: On Back Gesture we reach Screen A
…
-
**Describe the bug**
when fine-tuning my model using deepspeed==0.13.5, and huggingface trainer, loss and grad_norm will be nan at step 2
![image](https://github.com/microsoft/DeepSpeed/assets/29994…
-
After succesfully installing docker-wazuh i want to connect the first agent:
```
sudo systemctl start wazuh-agent
Job for wazuh-agent.service failed because the control process exited with error co…
-
## Bug Report
Code - OSS ui is unusable and blood red. Spotify theme sync errors.
### Steps to Reproduce
1. Ml4w dotfiles installation.
2. Deleting ~/.config/hypr , ~/.config/Code - OSS, ~/…
-
### What happened?
We are attempting to use the beam java sdk to consume from a kafka topic with contains avro messages of varying schemas but however need to be correctly ordered. However as you can…
-
config.guess provided is from 2011, and newer ones allow it to compile under aarch64.
Available at `stevenlafl/dante` for expediency.
Only changed lines:
```
ARG GUESS_URL=http://git.savannah.…