-
### Description
We have a large scale C++ HPC astronomy application and want to use calls to Python actors from within C++. From the codebase it looks like it's possible to call C++ actors from Pytho…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Feature Description
This issue aims to implement a sequence-to-sequence model with an attention mechanism for …
-
### Your current environment
The output of `python collect_env.py`
```text
Your output of `python collect_env.py` here
```
### Model Input Dumps
_No response_
### 🐛 Describe t…
-
Research task on architecture and tech stack in preparation to the project
-
How to install VLLM with CPU? @RepoPilotAssistant
-
### Which jobs are flaking?
https://testgrid.k8s.io/sig-network-kind#sig-network-kind,%20ipvs,%20master
https://testgrid.k8s.io/sig-network-kind#sig-network-kind,%20ipvs,%20dual,%20master
https://…
aojea updated
2 weeks ago
-
### Your current environment
Collecting environment information...
PyTorch version: 2.1.2+cu121
Is debug build: False
CUDA used to build PyTorch: 12.1
ROCM used to build PyTorch: N/A
OS: Ubunt…
-
### Background
During model initialization, each PP stage initializes their own portion of the model. This is preferrable over initializing the model all in one place and then transferring it to e…
-
Why do we need to communicate like this?
Pytest reports tests as it discovers/runs it. we moved to this model so User has feedback on what is going on. For repos with > 10k tests some times the disco…
-
We have observed the nccl initialization error on PyTorch NGC 23.12(with nccl 2.19.3 + cuda12.3) docker image on AWS P4DE(A100). The error surfaces during the nccl initialization and happen intermitte…