-
> Trying to resize storage that is not resizable at /opt/conda/conda-bld/pytorch_1550802451070/work/aten/src/TH/THStorageFunctions.cpp:70
> File "/home/chen/CLOCs/second/pytorch/train.py", line 51,…
-
Hi Dr.Yue,
I am new in this area. And I have complied the previous steps. When trying your 'train_gaussianhead.py', error occurs: "CUDA error: an illegal memory access was encountered". Then I locali…
-
**Describe the bug**
The Python pytorch emitter does not output functioning code when compiling `Gemm` with an `EVT`.
**Steps/Code to reproduce bug**
The script below reproduces the bug.
Sw…
-
ModuleNotFoundError: No module named 'torchvision.transforms.functional_tensor'
-
Unable to move large tensors to device. In other cases if the large tensors get created from operations we are unable to move them to host.
```
def test_large_slicing(device):
torch_a = torch.ra…
-
- [x] LeviCivitaTensor — totally antisymmetric tensor
- [ ] Band — specify banded structure in a sparse array
- [ ] Trace -- generalized trace
- [ ] ArrayReduce
Replaces https://github.com/mathi…
rocky updated
6 months ago
-
Hello,
I noticed that my process hangs at [`results = ray.get(object_refs)`](https://github.com/EleutherAI/lm-evaluation-harness/blob/main/lm_eval/models/vllm_causallms.py#L220) when I use `data_pa…
-
### Bug description
I am trying to run a very simple training script for 2 nodes and I always get this error:
Output:
```
(ve) root@442a8ba5c0c6:~/ptl# . wr.sh
Start fitting...
Initializing…
-
i build my model with --tp_size 2 --world_size 2, and put two generated model files into the backend directory and use the default config.pbtxt.
then i run the script/launch_triton_server.py --model_…
-
Thanks for perfect work inspired me a lot.
Here is the story.
Im currently working with tensorflow.datasets.imdb dataset. i decided to use Glove word embeddings(300d) with my toy project but i…