microsoft / nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
https://nni.readthedocs.io
MIT License
14.02k stars 1.81k forks source link

NNI v2.8 Iteration Plan #4735

Closed scarlett2018 closed 2 years ago

scarlett2018 commented 2 years ago

Release Plan for vNext release

Planned Items

NAS

owners: @ultmaster / @QuanluZhang / @JiahangXu / @hzhua

Pipeline

HPO

owner: @liuzhe-lz

Experiment

owner: @liuzhe-lz

WebUI

owner: @Lijiaoa

Compression

owners: @J-shang /@zheng-ningxin

Deferred

matluster commented 2 years ago

NAS Candidates in the scrum

Done and in progress will be move to the root comment

Doc

Pipelines

liuzhe-lz commented 2 years ago

HPO Candidates in the scrum

Done and in progress will be move to the root comment

Experiment

Document

Lijiaoa commented 2 years ago

WebUI Candidates in the scrum

Done and in progress will be move to the root comment

Feature:

Refactor and bug fixed:

J-shang commented 2 years ago

Compression Candidates in the scrum

Done and in progress will be move to the root comment

QuanluZhang commented 2 years ago

bugbash:

NAS

HPO

owner: @liuzhe-lz tester: @J-shang

Experiment

owner: @liuzhe-lz

WebUI

owner: @Lijiaoa tester: @liuzhe-lz

Compression

owners: @J-shang /@zheng-ningxin

More prs:

Pipelines: @ultmaster @J-shang @liuzhe-lz

matluster commented 2 years ago

Logging:

image

NAS experiment: hang in this state. Experiment status is DONE.

image

When I press Ctrl-C, the traceback is:

^CException ignored in: <module 'threading' from '/home/azureuser/miniconda3/envs/nni/lib/python3.8/threading.py'>
Traceback (most recent call last):
  File "/home/azureuser/miniconda3/envs/nni/lib/python3.8/threading.py", line 1388, in _shutdown
    lock.acquire()
KeyboardInterrupt: 
[2022-06-01 00:09:19] Stopping experiment, please wait...
INFO:nni.retiarii.experiment.pytorch:Stopping experiment, please wait...
[2022-06-01 00:09:19] Dispatcher exiting...
INFO:nni.runtime.msg_dispatcher_base:Dispatcher exiting...
[2022-06-01 00:09:20] Dispatcher terminiated
INFO:nni.runtime.msg_dispatcher_base:Dispatcher terminiated
[2022-06-01 00:09:20] Experiment stopped
INFO:nni.retiarii.experiment.pytorch:Experiment stopped
J-shang commented 2 years ago

WebUI (from Zhe's comments):

Experiment Tuner Config need to report right error message.