-
NotImplementedError Traceback (most recent call last)
Cell In[56], line 1
----> 1 trainer = Trainer(
2 model=model,
3 args=args,
4 train_datase…
-
你好,请问下# Single GPU 训练时
train_text_to_video_sft.sh
ACCELERATE_CONFIG_FILE="accelerate_configs/uncompiled_1.yaml"
训练没有问题。我将uncompiled_1.yaml-->deepspeed.yaml换为deepspeed报错,
企业微信截图_17325422283473
请问下…
-
Hello,
i have a task with unknown global optima and since optimizers can stuck in local optima i want to make sure the achieved optima is reached from various random starting points. Therefore i wo…
-
I maintain the humpday package that compares global derivative-free optimizers. I'm curious to know how L2o approaches stack up against others, so just saying hi. I think it is a nice benchmark for mo…
-
initializing deepspeed distributed: GLOBAL_RANK: 0, MEMBER: 1/1
Enabling DeepSpeed BF16.
LOCAL_RANK: 0 - CUDA_VISIBLE_DEVICES: [0,1,2,3]
Traceback (most recent call last):
File "./train.py", lin…
-
### Is your feature request related to a problem? Please describe.
The task of finding the MLE for a three-parameter (generalized) Pareto distribution boils down to finding _all_ roots of the (univ…
-
Hi there,
I have tried running this code and I cannot get past the create model step, I've pasted the error below. Please let me know if you need more information.
Thanks!
```
------------…
-
**System information**.
- Have I written custom code (as opposed to using a stock example script provided in Keras): yes
- OS Platform and Distribution (e.g., Linux Ubuntu 16.04): google colab
- T…
-
Following the recent exchange https://github.com/JuliaDiffEq/DiffEqParamEstim.jl/issues/60#issuecomment-356646400, I have uploaded here https://github.com/finmod/DiffEqParamEstim.jl/blob/master/test…
-
By "thread-safe" here I mean "being able to call geogram functions concurrently". E.g. being able to run Delaunay triangulation on two meshes in parallel (which can happen when integrating Geogram fun…