-
### What happened + What you expected to happen
Ray train official docs [script](https://docs.ray.io/en/latest/train/getting-started-transformers.html) fails due to NCCL error.
Proxy Call to rank 1 …
-
Continuation from #141
We need scripts for examples using comet to track and log metrics
hbja updated
3 months ago
-
re-creating a venv at every deployment (single run) is painful, up to 5min wait, there are a few way to deal with it like:
https://stackoverflow.com/questions/11021130/parallel-pip-install
or ha…
-
Hi all,
I am trying to use `metaflow resume` to resume my flow from the last failed step. Unfortunately, it seems to resume at the step prior to the failed step. Why is this happening? Is this a bu…
-
## Describe the Bug
No module named 'imp'
## Expected behavior
Just run it.
## Where is the issue?
- [x] Comet Python SDK
- [ ] Comet UI
- [ ] Third Party Integrations (Huggingface, Tensorb…
-
I feel sorry to bother you again. I just finish the first phase of training with your sincerely help and move on to the next command which is `python run_nerf.py --config=configs/pour_baseline_flow.tx…
-
**Is your feature request related to a problem? Please describe.**
A clear and concise description of what the problem is.
An attempt to make further enhancements in Kotsu Benchmarking seems to go…
-
### Building with GCC on XSEDE Stampede2
```
# I have already installed eccodes at ~/packages/release
# You can use this script to install eccodes locally
#
# https://github.com/Weiming-Hu/Anal…
-
## Describe the Bug
After importing comet_ml a scikit-learn-based training script fails during sklearn grid search cross-validation: "broken pipe" exception in joblib. Works fine without import of co…
-
I tried to pretrain Longformer using transformers and datasets. But I got OOM issues with loading a large text file. My script is almost like this:
```python
from datasets import load_dataset
@…