-
I am getting an `pickle.UnpicklingError` when trying to train again on a previously trained checkpoint with open_clip `v2.27.0+`.
This is similar to https://github.com/mlfoundations/open_clip/issue…
-
### Describe the bug
`com.amazonaws..ecr.dkr is not registered` in EC2 VPC Endpoint Service and it fails deployment in some of the isolated regions
### Regression Issue
- [ ] Select this opti…
-
### Description
**Observed Behavior**:
Pod fails to start - panic: AWS.SimpleQueueService.NonExistentQueue: The specified queue does not exist.
**Expected Behavior**:
Pod runs
**Reproduc…
-
Platforms: mac, macos
This test was disabled because it is failing in CI. See [recent examples](https://hud.pytorch.org/flakytest?name=test_full_dtype&suite=TestFull&limit=100) and the most recent tr…
-
# EC2のエフェクト処理について
## 複数のEC2を組み合わせる場合の重複
EC2のプライマリ・セカンダリは一緒に組み合わされているEC2のエフェクトについては重複して発動しない認識で相違ないでしょうか?
### 正解例
- EC2 + EC2 = プライマリ1つのみ発動で クレジット +3
- EC2 + EC2 + EC2 = プライマリ・セカンダリが発動で クレジット +7…
-
### Which jobs are flaking?
- ci-kubernetes-ec2-conformance-latest
- e2e-ci-kubernetes-e2e-al2023-aws-conformance-cilium-canary
- ci-kubernetes-e2e-ec2-eks-al2023-serial
- ci-kubernetes-ec2-conf…
-
Hello!
From time to time (10-15% of cases) I see a situation when EC2 starts in `consume mode`, job submitted on the work queue, but the worker does not see this job.
Here is an example of logs of…
-
### What Happened?
Error starting cluster: cmd failed: sudo env PATH=/var/lib/minikube/binaries/v1.16.0:$PATH kubeadm init --config /var/tmp/minikube/kubeadm.yaml --ignore-preflight-errors=DirAvaila…
-
Hi, thank you for the package and great published papers on how the toolkit works. I'm having issues with arm64 mac wheels as installed from pypi. Doing `import CDPL.ConfGen as ConfGen` causes an imme…
-
**What happened**:
I have deployed kube-state-metrics on eks cluster, on which multiple pods are deployed across multiple namespaces. Now what we are seeing few metrics are not scrapped for all nam…