-
I really like the idea of having a basic driver that just sequentially executes modules. The problem I am having currently is that sometimes I may want to aggregate outputs from multiple modules or sp…
-
The paper did not mention a specific location encoding method. I took a brief look at the code. May I ask if a learnable encoding method is being used.
-
### Reminder
- [X] I have read the README and searched the existing issues.
### System Info
when I just add one line in the `examples/extras/adam_mini/qwen2_full_sft.yaml` got a error below.
```…
-
Hi, thank you for your great work!
I tried to train an IP-Adapter upon my own Stable-Diffusion-like backbone model (for my backbone model: I slightly expand the model size of SDXL and then I well p…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Current Behavior
采用数据并行lora微调,报错如下
CUDA SETUP: CUDA runtime path found: /usr/local/cuda/lib64/libcudart.so.11…
-
Dear all,
I was trying to run a MOFA model in python but I faced the following error during the training:
Module 'scipy' has no attribute 'where'
My data consists of two views with different …
-
I ran a few test jobs based on the recent [llama2-7B fine-tuning blog](https://www.philschmid.de/fine-tune-llama-7b-trainium#3-fine-tune-llama-on-aws-trainium-using-the-neurontrainer) using the latest…
-
# Description
`@rpccall` for functions that take no arguments throws an error when defining a function with no arguments.
# To reproduce
Try to run the following file:
```python
from p4p…
-
Kohya has added preliminary support for Flux.1 LoRA to his SD3 branch. I have created a `sd3-flux.1` branch and updated to the latest sd-scripts sd3 branch code... No GUI integration yet... I will sta…
-
I was successfully able to run the LSTM based pointer generator. While running the transformer_encoder branch with LSTM=false, I encounter this error:
```
File "training_ptr_gen/train.py", line 400,…