Open hxdtest opened 2 months ago
We simply haven't implemented it. We welcome contributions in this space.
Hi @jackkosaian could I take care of it? Could you assign me?
Yes, please feel free to submit a PR supporting this.
Hi, I have added PR https://github.com/NVIDIA/cutlass/pull/1843 I added it because:
However I have some doubts:
I was not able to run 02_pytorch_extension_grouped_gemm.ipynb with my changes so I have no living proof that it works at least in this example. Any advice how to import pytorch from local repo will be welcome.
What is your question? In
python/cutlass/emit/pytorch.py
, bfloat16 is not supported?