-
**Describe the bug**
during the following code block in the [basic tutorial](https://starry.readthedocs.io/en/latest/notebooks/Basics/), the error below happened. the entire text of the error is past…
-
### 🐛 Describe the bug
I previously attempted to submit a similar issue on #3383, but some of my imprecise expressions may cause unnecessary misunderstandings, which could increase the cost of unders…
-
### 🐛 Describe the bug
Hi~ We tried to use pipeline parallel + gemini to train a model.But it seems that there was a deadlock during communation.The following is a simple reproduction based on the [o…
-
LeCun, Yann, Yoshua Bengio & Geoffrey Hinton. 2015. “[Deep Learning](https://www.nature.com/articles/nature14539).” Nature 521: 436-444.
Karpathy, Andrej. 2015. “[The Unreasonable Effectiveness of …
-
> Benchmarl automatically makes a video.
>
> In particular you might want to set these parameters
>
> https://github.com/facebookresearch/BenchMARL/blob/a9309159d6d46d099bd3d395ef1…
-
Thank you for the implementation for the paper. This is the first time I'm dealing with transformer model, I tried to train over Kinetics700 dataset using this model. and I just want to share some of …
-
### What is the problem?
SAC calculates the gaussian log probability based on clamped values, which can result in very large values if the tanh saturates and as a consequence result in explodin…
-
**Describe the bug**
I try to finetune `llama3-8B` model with multi nodes but get an AtrributeError when finishing loading mcore format checkpoint and starting to build datasets, the error is below:
…
-
Click to expand!
### Issue Type
xla
### Source
source
### Tensorflow Version
tf.__git_version__ = v2.10.0-rc3-6-g359c3cdfc5f
### Custom Code
No
### OS Platform and Distri…
-
## Keyword: efficient
### End-to-end codesign of Hessian-aware quantized neural networks for FPGAs and ASICs
- **Authors:** Javier Campos, Zhen Dong, Javier Duarte, Amir Gholami, Michael W. Mahoney,…