-
### 🐛 Describe the bug
## Description
Before https://github.com/pytorch/pytorch/pull/123732, when running with AOTI, the SDPA pattern can be hit and `torch.ops.aten._scaled_dot_product_flash_atten…
-
**Brief intro**
I apologize if this is not the correct forum for this question, but any help would be greatly appreciated! I already asked [here](https://community.rstudio.com/t/gradienttape-attribut…
-
您好!我之前看了您的代码实现和PCGrad原作者的代码,原作者代码如下[PCGrad](https://github.com/tianheyu927/PCGrad/blob/master/PCGrad_tf.py),对代码实现有一些疑问。
- 1、您这个算例[PINN_for_medium_homo_CTMC_reliability_analysis.ipynb](https://githu…
-
Providing dense matrix support for VexCL would probably be of use for many people, but currently it is unclear what is the best approach to do this.
Options are:
1. Provide `vex::matrix` and make `ve…
-
Hi, I have several questions to ask:).
1. After I ran the project with one session, I deleted the model.pth and ran another session(other files kept) which I had run before(running from scratch, no d…
-
Currently, we have `helper/cvx_kron.py` which computes the tensor product between two `cvxpy` `Expression` objects. We should instead remove this function and deal with this logic explicitly within th…
-
> BiocManager::install("beachmat")
/home/lenovo/R/x86_64-redhat-linux-gnu-library/4.4/assorthead/include/tatami/isometric/binary/DelayedBinaryIsometricOperation.hpp:608:13: 警告:right operand of comma …
-
I have some not-huge molecules (a little bigger than the ones in this demo), and `mcismol`/`mcesmol` are taking more runtime than I care to allow to finish. Here's a demo that illustrates the problem:…
-
I am trying to enable cpu activation offload when training my custom LLAMA model. However, an error occur:
![image](https://github.com/microsoft/DeepSpeed/assets/104556055/4a500207-faa4-4414-968f-a9f…
-
I am using TensorFlow 0.12.1 and Python 2.7.12 (as mentioned) but I am still running into the following issue:
:~/Downloads/NARRE-master/model$ python train.py
Parameters:
ALLOW_SOFT_PLACEMENT…