-
### Description
Hi, we encounter an `XlaRuntimeError: INTERNAL: unsupported operands type` that is hard to understand and debug.
This started to happen on some unit tests that we have on our CI:
…
-
chatglm2_loar运行错误,提示glm2没有实现这个方法
model.enable_input_require_grads() NotImplementedError
-
For gaussian-opacity-fields it appears that grads and grads_abs may not be getting cleared correctly between calls of densify_and_prune(). They seem to double in size each time, eventually leading to …
-
WHO: Cecelia
Enable GRADS or LAS endpoints w/o having to republish their data.
-
I see that layernorm grads etc need to be synced in sequence parallel but I think DeepSpeed bypasses this logic since it doesn't use `MegatronOptimizer` class.
How does it work with sequence parallel…
-
https://github.com/tensorflow/benchmarks/blob/5d03cf8e356d2ae17df440cdb612c378cbacf5ef/scripts/tf_cnn_benchmarks/variable_mgr_util.py#L575
@reedwm
In parameter server mode, I managed to replace `…
-
aistudio上跑的paddlex。版本paddlex-2.1.0
```
model.train(
num_epochs=300,#训练轮数
train_dataset=train_dataset, #训练数据
eval_dataset=eval_dataset, #验证数据
train_batch_size=128, #batchsize
…
-
I have been trying to get `Scan` to work within a gradient expression without success. I wouldn't be surprised if I am using `Scan` incorrectly, so let me know :)
```python
def add_five(a):
…
-
in saliency functions we have
`utils.normalize(grads)[0]`
i want an option to get the raw gradients for thresholding or comparing between filters
-
model.enable_input_require_grads() 这块报错。如何解决呀