-
**Is your feature request related to a problem? Please describe.**
Models whose topmost layers have intricate details and reduced cross-section, rely on layer-time minimums in order to maintain str…
-
when I train the model, there is a bug in layers/modules/multibox_loss.py
In line 97, loss_c[pos]=0, the shape of mask [32,8732] ar index 0 not match the shape of the indexed tensor [270424,1] at ind…
-
My success/failure cases here have been a bit inconsistent, so I'll try to be explicit as possible.
@daveliddell and I are trying to get SDXL VAE on f32 working with AIE offload, and first we need …
-
### Description
In some cases (see e.g. #7734) derivatives are involved in nonlinear strong components. If not solved with the appropriate precision, they may fool the ODE solvers's error control m…
-
Thank you for your great work! I was just wondering how I would modify the code to control the output length, as running the SkipLSTM model produces a Tensor of dimensions [386, 6165] (no pooling) whe…
-
We are having a xgboost model (0.6a) and we are trying to do explain_prediction on eli5 (0.8) and we are seeing significant response time issue (of order of 4 secs for a single row). Our model is of a…
-
### 🚀 The feature, motivation and pitch
**Context**:
While using [float8 training](https://fburl.com/4mblb81i), the operators of `fp8 = cast_to_fp8(input_tensor); fp8_t = fp8.t().contiguous().t()`…
-
## Your Daily Accomplishment 🚀:
No coding today, but I came across this interesting blog post on KDnuggets about Zillow's [failed home flipping business](https://www.kdnuggets.com/2021/12/9b-ai-failu…
-
## Description
This is a meta issue made from three encountered during Sage Days 123. It was noticed that there was ambiguous representation and failing group laws in certain scenarios. Issue #3709…
-
### 🐛 Describe the bug
For mean_out op, the print value of y1 suppose to be "tensor([[[2., 2., 2., 2.]]], dtype=torch.float16)" but it will be randn value which is same as y1 create. This means it no…