-
When I run mva, the following error occurs :( ,
EOFError: Ran out of input
TypeError: can't pickle vtkmodules.vtkCommonDataModel.vtkPolyData objects
How can I fix it?
Thanks!
-
Hi, there. I'm trying to run Alpa PipeShardParallel training for a ResNet50 model (CIFAR10 dataset) on a Distributed Ray Cluster with two 1080ti (4-GPU each) servers.
- The **parallel method** i…
-
Hi @Serafadam , Iam using ubuntu 22.04 and working with ros2 humble. I installed the depthai-ros from source. I got the error when trying to launch `ros2 launch depthai_ros_driver camera.launch.py`.
…
-
## Motivation
This RFC discusses mechanism to interact with tools or data outside the Typst document and its direct file system environment to. Possible use cases:
- Use external tooling, e.g. to ge…
-
**Please describe the bug**
I'm trying to use `PipeshardParallel` for the GPT2 example in `examples/gpt2` (20debbe5f0ed4047d82ae615cb2c07b059498032) with Alpa v0.2.2 inside a Docker container. I'm on…
-
## Description
I am loading a Dictionary KVP with the textures from a collection of .png files. This runs as a Task.Run. Inside the task the code paginates the texture into block of files with the id…
-
[Tech Lead DevOps with English] - Remote - Only for Latin America Residents.
Need to have: Terraform and Ansible. Kubernetes environment and ci/cd pipeline with multiple microservices. CAD 4,500.00 t…
-
Hi, Thanks for your nice work!
Loss is nan when I set the batchsize as 16 and train the model on 8 RTX 2080Ti via the command `python train.py`.
How to solve this problem?
![image](https://user-im…
-
The idea here is to bring together several relevant signals for troubleshooting / analysis in the same dashboard.
For all signals of a given service/workload/app, a "score" for being potentially re…
-
As a user, I want to start a Baremetal instance (no virtualisation) in my Public Cloud tenant, I want to have the choice between monthly billing and pay as you go model (hourly billing).