-
### 🐛 Describe the bug
```
File "/mnt/clusterstorage/workspace/kevin/ml-monorepo/chadfusion/train_fsdp.py", line 363, in fsdp_train
scaler.step(opt)
File "/usr/local/lib/python3.10/dist-…
-
Setting the seed and repeating the fused neighborhood sampling for a source code does not reproduce the same subgraph, have identified a fix that will be slower but allow reproducible subgraphs
## …
-
## Current State of OSS FP8 Operators
So far, all examples of fp8 ops (compute in fp8) are scaled matmuls that accumulate in a higher precision type. In fact, there are really only 2 classes of in…
-
### Preliminary Checks
- [X] This issue is not a duplicate. Before opening a new issue, please search existing issues.
- [X] This issue is not a question, feature request, or anything other than a…
-
According to the original paper, the 367th line in TDANet seems to be changed as follow:
```
# expanded = self.last_layer[i](x_fused[i], x_fused[i - 1])
expanded = self.last_layer[i](x_fused[i], x_…
-
(marconet) C:\Users\L\Pictures\MARCONet>python test_sr.py -i "C:\Users\L\Downloads\bsrgan\inputs" --real_ocr
################################################################
Input …
-
### Describe your issue
In
The last example of https://cython.readthedocs.io/en/latest/src/userguide/fusedtypes.html#indexing
looks like this:
```python
ctypedef fused my_fused_type:
int[:,…
-
When I used this command to install hart_backend, I encountered this error. How can I solve it?
`cd hart/kernels && python setup.py install`
`The above exception was the direct cause of the follow…
-
### 🚀 The feature, motivation and pitch
For transformer architecture (for example https://github.com/pytorch-labs/gpt-fast/blob/main/model.py#L195-L211) it tends to be most performant to merge the qk…
-
**Is your feature request related to a problem? Please describe.**
Composite version has a lot of calls to the slwo binary ops:
**Describe the solution you'd like**
Have one op call for this op…