-
OS version: macOS 10.13.6
Cuda version: 10.1
GPU: GTX 1060
`[1/893] Linking CXX shared library lib/libtorch_cpu.dylib
FAILED: lib/libtorch_cpu.dylib
: && /Library/Developer/CommandLineTools…
-
Hi,
We are experiencing unexpected `sccache` server shutdowns when building our C++ project ([OpenVINO](https://github.com/openvinotoolkit/openvino)) for RISC-V with [Conan](https://conan.io/) in G…
-
| ckpt_id | batch_size | fuse | compile | quantization | sparsify | memory | time |
|:--------------------------------------:|-------------:|:------:|:---…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues and checked the recent builds/commits
### What happened?
There are numerous errors in the instructions for Linux i…
n4mwd updated
9 months ago
-
Hi,
I changed gap_sdk version into 4.22.0. (why I must use gap_sdkv4.22.0,pls see here[gap_sdk/issues/370](https://github.com/GreenWaves-Technologies/gap_sdk/issues/370))
But when I tried to run`mak…
-
Hello, thank you for your great work!
We are currently exploring the utilization of radio as a vision encoder for vision language models. In our specific setup, we employ [SigClip](https://huggingfac…
-
### 🚀 The feature, motivation and pitch
All T5 models and their derivatives (t5, mt5, t0, etc.) use `RMSNorm`, instead of `LayerNorm`. The former is a subset of the latter, it only scales and doesn…
-
**Describe the bug**
Context parallel does not work in some cases, such as pretrain llama-34b with 64 A800 GPUs and seqlen>=32768. **But using megatron-lm directly has no problem with the same conf…
XLzed updated
2 months ago
-
Hello,
Thank you for your contribution!
I am trying to prune a YOLOv5 Nano model by modifying the script of YOLOv7 pruning on this repository. The code is executed without any errors but the MACs…
-
We are using `examples/nlp/language_modeling/megatron_gpt_pretraining.py` with a small GPT model (1.35 billion parameters) on 4 H100 DGX nodes (8 GPUs each). Our DGX nodes are connected using Infini…