-
This is in response to a query from Steven regarding my issue related to Tetra3: Tetra3: https://github.com/esa/tetra3/issues/30
I had questioned the performance I was seeing in terms of solution t…
-
### System Info
As per the documentation https://github.com/huggingface/blog/blob/main/4bit-transformers-bitsandbytes.md#can-we-train-4bit8bit-models it is not possible
```
Can we train 4bit/8bit…
-
I think it would be interesting and useful to implement GLoRA at least as a parameterization of the procedure, even without the evolutionary search piece. It's a generalization over LoRA and several o…
-
### Reminder
- [X] I have read the README and searched the existing issues.
### Reproduction
CUDA_VISIBLE_DEVICES=0,1 python src/train_bash.py can run sucessfully.
deepspeed --num_gpus 2…
-
Assume we are trying to learn a sequence to sequence map. For this we can use Recurrent and TimeDistributedDense layers. Now assume that the sequences have different lengths. We should pad both input …
-
Hi,
Thank you for the amazing work! May I ask about why are the models different during pre training and finetune? I noticed that the strategy used during the finetune stage to document the pre-tra…
-
Original Author: @shelby3
Original URL: https://github.com/keean/zenscript/issues/35#issue-243345358
Original Date: July 17, 2017
---
|
----- | -----
![](https://upload.wikimedia.org/wik…
-
Hi!
I use this command line:
`-y -rtbufsize 300M -f gdigrab -framerate 60 -offset_x $area_x$ -offset_y $area_y$ -video_size $area_width$x$area_height$ -draw_mouse $cursor$ -i desktop -an -pix_fmt yu…
mzso updated
8 months ago
-
### System Info
### Describe the bug
I want to use LoRa tuning to fine-tune Whisper, but after I installed peft (parameter-efficient finetuning), it comes to a `segmentation fault` for each speech…
-
I am trying to use the model but it gave me this error
![Screenshot 2023-10-01 at 7 00 26 PM](https://github.com/kongds/scaling_sentemb/assets/145554661/2165d4a6-e4f1-4198-bdb7-664f74a20e3a)
![Scre…