-
### Describe the feature request
The `DepthToSpace` and `SpaceToDepth` ops support integer types:
https://github.com/onnx/onnx/blob/38afbd31ac9a585abb7463dcaaae121651e0a2d7/docs/Operators.md#Depth…
-
```
It will be great if all the information parsed from the incoming json
request is made available to the robots. This will give robots access to
any new fields without the need for additional API.…
-
(.venv) mldl@ub1604:~/ub16_prj/NARRE/model$ python train.py
I tensorflow/stream_executor/dso_loader.cc:128] successfully opened CUDA library libcublas.so locally
I tensorflow/stream_executor/dso_lo…
-
When using summary alerts, it is useful to not emit actions if there are only ongoing alerts. Ongoing alerts can be filtered out using the `If alert matches a query` feature, but doing so will lead to…
-
Traceback (most recent call last):
File "/home/tanglin/data/Code/DCNets/dcnet_cifar100/linear_cos/train_resnet.py", line 161, in
train(args.base_lr, args.batch_size)
File "/home/tanglin/da…
-
I posted this as an issue before ( https://github.com/openai/gradient-checkpointing/issues/4 ), however, neither of the suggestions appear to work. I get the same error with both methods suggested:
…
-
### 🐛 Describe the bug
```python
import torch
from torch.nn.attention.flex_attention import create_block_mask, flex_attention
torch.set_default_device("cuda")
@torch.compile(dynamic=True)
de…
-
Hi,
When running the following code, I get an illegal memory access error with the following graph. I am not sure why and do not understand the algorithm or C++ well enough to track it down. I do …
-
**Describe the bug**
When comparing zero-1 and zero-2, I noticed discrepancies between the results in the DeepSpeed Flops Profiler and the training speed metrics in transformers, and the conclusions …
-
```
============================================================
Deno has panicked. This is a bug in Deno. Please report this
at https://github.com/denoland/deno/issues/new.
If you can reliably re…