-
I am trying to follow https://github.com/mlcommons/training/blob/master/large_language_model/megatron-lm/README.md#data-download to download data on gs://mlperf-llm-public2 as following:
gsutil cp -r…
-
I am trying to follow https://github.com/mlcommons/training/blob/master/large_language_model/megatron-lm/README.md#data-download to download data on gs://mlperf-llm-public2 as following:
gsutil cp -r…
-
### 🐛 Describe the bug
torchbench_amp_bf16_inference
- [ ] `detectron2_fasterrcnn_r_101_c4`
- [ ] `detectron2_fasterrcnn_r_101_dc5`
- [ ] `detectron2_fasterrcnn_r_101_fpn`
- [ ] `detectron2_fas…
-
C4 modeling notation and shapes are verbose, text heavy. Good use cases there for this Python library. A package of C4 icon shapes may be a good start.
Separating the textual content from the Py…
-
### Model
Antimalarial activity (ABS and sexual stages) - eos80ch
### Molecules
N[C@H]([C@H]([C@@H]([C@@H](CO)O)O)O)C=O
CCC/C(O)=N/C(C(O)=O)CCO
O=CC(O)C(O)C(O)C(O)COC(O1)C(O)C(O)C(O)C1COC(O2)C(…
-
As the pytm is really charming in saving threat modeling time. But we have a lot of c4 models in c4plantuml type, it is great to convert them into pytm type. It is reasonable to doing threat modeling …
-
### Model
Antimalarial activity (ABS and sexual stages) - eos80ch
### Molecules
O=CC(O)C(O)C(O)C(O)COC(O1)C(O)C(O)C(O)C1COC(O2)C(O)C(O)C(O)C2CO
O=CC(O)C(O)C(O)C(O)COC(O1)C(O)C(O)C(O)C1COC(O2)C(O)C…
-
Hello, thank you for providing the community with a great framework. I did some experiments with the T5-small model on the C4 Vietnamese dataset, and I would like to get some feedback from you:
1. …
-
Hello,
first and foremost, I want to thank you for your incredible work!
I'd like further information on how to reproduce your code. I followed the code instructions in your README, but I am unabl…
-
**Describe the bug**
I've benchmarked both settings of `model.mcore_gpt` in an FSDP setting on the two most recent NVIDIA GPU architectures and found `model.mcore_gpt=False` to be consistently fast…