issues
search
NVIDIA
/
NeMo-Run
A tool to configure, launch and manage your machine learning experiments.
Apache License 2.0
78
stars
20
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Support Annotated in CLI
#118
marcromeyn
opened
2 days ago
0
Add multiple include paths option to packagers
#117
Kipok
closed
3 days ago
0
Support ContainerPackager that can accept a list of different packagers
#116
hemildesai
opened
4 days ago
0
Remove sending packager to fdl_runner
#115
hemildesai
closed
4 days ago
0
ci: Add `build-test-publish-wheel` workflow
#114
ko3n1g
opened
1 week ago
0
Remove tar -Af to support Mac based systems
#113
hemildesai
closed
4 days ago
0
GitPackager with include_pattern fails when run on macos
#112
Kipok
closed
4 days ago
1
Upgrade skypilot to 0.7.0
#111
hemildesai
opened
1 week ago
0
Add ability to customize job details between subtasks inside of a task group for slurm
#110
hemildesai
closed
1 week ago
0
Fix git packager for git repo with submodules
#109
hemildesai
closed
2 weeks ago
0
Add ability to package a list of paths instead of single one
#108
titu1994
closed
3 days ago
1
Add --dump option to dump fiddle configurations
#107
TaekyungHeo
closed
4 days ago
0
Return correct exit code in slurm job groups
#106
hemildesai
closed
2 weeks ago
0
Add a helper function get_executor for retrieving executors from NEMORUN_HOME
#105
hemildesai
opened
2 weeks ago
4
AssertionError from nemo_run/core/runners/fdl_runner.py when using PreTrainingDataModule
#104
RachitBansal
opened
2 weeks ago
0
Add validation for run.Partial based tasks in Experiment
#103
hemildesai
closed
3 weeks ago
0
Update management guide with more details
#102
hemildesai
closed
3 weeks ago
0
Make GitArchivePackager behavior consistent with subpath and include_pattern
#101
hemildesai
closed
3 weeks ago
0
Fix multi gpu torchrun in Skypilot
#100
hemildesai
closed
1 week ago
0
Handle constants in CLI
#99
marcromeyn
closed
4 weeks ago
0
Can't pass in dtype via CLI
#98
sweiss-google
closed
4 weeks ago
2
zlib.error: Error -3 while decompressing data: incorrect header check
#97
RachitBansal
opened
1 month ago
1
Remove mkpath (replace with explicit mkdir) since its not supported everywhere
#96
hemildesai
closed
1 month ago
0
Handle missing experiments
#95
amaslenn
closed
1 month ago
0
Move to uv
#94
hemildesai
opened
1 month ago
0
Fix dependencies in Slurm Executor and local inline scripts
#93
hemildesai
closed
1 month ago
0
Shorten job name when viewing logs
#92
hemildesai
closed
1 month ago
0
Add option to return dict in Experiment.status
#91
hemildesai
closed
1 month ago
2
Add option to cancel all jobs + cancel all dependencies of a job in experiment CLI
#90
hemildesai
closed
1 month ago
0
Fix slurm environment variables for non-het task groups
#89
Kipok
closed
1 month ago
0
Add nightly workflow
#88
hemildesai
closed
1 month ago
0
Use --mkpath by default in rsync
#87
hemildesai
closed
1 month ago
0
Fix get type namespace
#86
marcromeyn
closed
1 month ago
0
Fix get_type_namespace to work with buildable
#85
marcromeyn
closed
1 month ago
0
Possibility to close a group of jobs belonging to a specific experiment name
#84
titu1994
closed
1 month ago
0
Use rsync to sync experiment directory for slurm
#83
hemildesai
closed
1 month ago
0
use --diff instead of --check in rye
#82
akoumpa
closed
1 month ago
0
Disable pretty exceptions in Typer
#81
hemildesai
closed
1 month ago
0
Mute Deprecation/Future warnings
#80
akoumpa
closed
1 month ago
0
Convert arg to str for template substitution
#79
hemildesai
opened
1 month ago
0
Add python API for getting dictionary with job status instead of only printing it
#78
Kipok
closed
1 month ago
0
Add arbitrary DAG execution support to Experiment
#77
hemildesai
closed
1 month ago
1
Add setup lines option to SlurmExecutor
#76
hemildesai
closed
1 month ago
0
Fix skypilot executor and make it consistent with others
#75
hemildesai
closed
1 month ago
0
Do not deserialize task cfg in _load_jobs
#74
hemildesai
closed
1 month ago
0
Move loading external main to experiment reset
#73
hemildesai
closed
1 month ago
0
Shorten the task name in logs
#72
Kipok
closed
1 month ago
0
When running with detach=False on slurm, add an option to cancel experiment when ctrl+C is pressed
#71
Kipok
opened
2 months ago
0
Don't die if running git fails
#70
akoumpa
closed
2 months ago
1
Fix errors with fetching slurm logs
#69
hemildesai
closed
1 month ago
0
Next