grads Search Results - Githubissues

1000+ results
for grads

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/DeepSpeed #6524

[BUG] Distributed Training randomly stuck in trainings loop

Hi I have a script that runs with the DataParralell trainer on a machine with 8 H100 GPUs (aws p5 VM) with deepspeed. When we run the script it starts to randomly get stuck forever at some iteration r…

raeudigerRaeffi updated 3 weeks ago
2
pnnl-predictive-phenomics/emll #16

shape mismatch issues when running ADVI

In our previous meeting, Jeremy mentioned that emll now automatically handles missing data. I'm using emll right now, and it's complaining about array shape mismatches, but I have no idea where thes…

janisshin updated 3 weeks ago
13
FoundationVision/OmniTokenizer #15

maybe bugs in loss backward?

Hello, thanks for your great work. When exploring the code, I found something confusing and want to sure whether it is a bug. In https://github.com/FoundationVision/OmniTokenizer/blob/main/OmniToke…

shinshiner updated 2 months ago
3
jax-ml/jax #23693

OptimSteps not compatible with shard_map due to lax.cond

### Description When trying to use optax.MultiSteps on a data-parallel setup with shard_map, I am getting the following error: ``` NotImplementedError: No replication rule for cond. As a workar…

Rocamonde updated 9 hours ago
3
liuqidong07/MOELoRA-peft #16

关于代码model.enable_input_require_grads()

` if training_args.do_train: model.gradient_checkpointing_enable() model.enable_input_require_grads()` 作者你好，请问这里令输入也计算梯度的目的是什么呢？是否冗余了这条代码，还是说这个一…

lhyscau updated 3 months ago
4
yanyan-li/GeoGaussian #9

About clone and split, where is the code relate to your algo…

Debugging your code, i have not found the code about your clone and split algorithm, where is it?? In /scene/gaussian_model.py, line 492 is the function def densify_and_split(self, grads, grad_thres…

xxxrc4 updated 1 month ago
2
WISDEM/FLORISSE #10

compiling dev_grads on Peregrine

On linux on Peregrine, to get things to compile, I needed to add the flag -fPIC: gfortran -c -fPIC adBuffer.f gcc -c -fPIC adStack.c Although I am not sure what it means, maybe it is something to ad…

pgebraad updated 8 years ago
1
wutaiqiang/MoSLoRA #5

When freezing the model parameters and training only the new…

I first freeze the model parameters: `for param in self.parameters(): param.requires_grad = False ` Then, I unfreeze the parameters of the new layer that I want to train: `for param in ne…

Travis-Tang-chen updated 1 day ago
3
sungyubkim/GBML #5

Reptile: track_higher_grads=True?

Thanks so much for the great codes! I was checking the reptile code, and it appears I need to set track_higher_grads=True in the context for this to run. Is there something I am missing here? Th…

hang-wu updated 3 years ago
4
amahjoor/Hackathons #3

<issue> Add field for participant eligibility

### What you have to share adding a field to each hackathon to denote participant eligibility (active undergrad/grad only, 12 months after graduation, etc.) might help new grads or those who have alr…

edtton updated 1 month ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for grads

1000+ results
for grads