An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
[x] Weight sharing, 10 days @QuanluZhang TODO: before 11/17 Quanlu drive the possible implementation plan; after 11/17, Yuge take over design and implementation. ETA: 11/27
[ ] P2 - (blocked by NAS search space structure) support NAS Once-for-all alog on NNI, 5 days, @colorjam #2955
[x] Support heterogenous training service @SparkSnail PR #3097 (review: @squirrelsc @liuzhe-lz )
[x] Support remote
[x] Support azureml
[x] Support local
[x] Support PAI
[x] doc 11.25
[ ] refactor code for review 11.26
[ ] IT pipeline (on going)
[ ] Release materials @scarlett2018 /@liuzhe-lz (after feature freeze 11/27, target end date 12/11)
[ ] release process, 1-2 week @liuzhe-lz
[ ] doc, article, and demo 1 week @scarlett2018
[ ] Release press (demo new features) @scarlett2018
[ ] Upload video
Features
[x] Reorganize NNI python package, package hierarchy, source code directory hierarchy - dev owner: @liuzhe-lz
[x] P0 - able to install and run, 3 days, ETA: 10/15
[x] Experiment duration under TUNER_NO_MORE_TRIAL @98may
[x] HyperBand refactor @ ayan
[x] ADL @SparkSnail
[x] Quantization export @QuanluZhang
[x] NNI manager warning @QuanluZhang
[x] Check other PRs not listed in tasks @98may
[x] Kill trial @liuzhe-lz
[x] Logging (check if NNI manager, dispatcher, trial log contains logging and printing output, including multiprocess scenario) @ultmaster
[x] nnictl save , load , log commands @98may
[x] quantization bug (PR #3160) @J-shang
[x] Cream @ultmaster
Documentation
[x] Retiarii @QuanluZhang
[ ] ETA for v1 @QuanluZhang 12/30; Reviewers: @colorjam @ultmaster @kvartet @scarlett2018
[x] Heterogenous
[ ] [heterogenous --> hybrid] ETA for the 12/31 @kvartet @scarlett2018
[x] Launch from Python PR ready @J-shang #3217 ; Reviewers: @liuzhe-lz @QuanluZhang @kvartet; ETA for review: 12/31;
[x] New experiment config @liuzhe-lz #3222; Reviewer: @SparkSnail ; TODO: @liuzhe-lz fix reviewer comments; + @kvartet (comments: there are lots of TODO in the doc); ETA for doc update: 12/31;
This is the draft plan for the upcoming iteration in Oct ~ Nov, it's a 4 weeks iteration.
Release Plan for v2.0
release: 1.11
Let's try to adopt the legend of annotations:
Epic
[ ] Improved NAS framework (Retiarii) on NNI, 40 days
[x] Support heterogenous training service @SparkSnail PR #3097 (review: @squirrelsc @liuzhe-lz )
[ ] Release materials @scarlett2018 /@liuzhe-lz (after feature freeze 11/27, target end date 12/11)
Features
Tasks
[x] Port doc to rst and fix links, 2 day evaluation, @ultmaster
[x] Unify the names of trail job id; @J-shang 9/16 , pr ~#2896~ #3053 (review: @chicm-ms @QuanluZhang )
[x] use NNI's flops counter (refactor the counter first) @colorjam PR: #3048 reivew: @chicm-ms @QuanluZhang @ultmaster
[x] Bug fix of experiment resume, @J-shang 1 day (evaluated by quanlu)
[x] check experiment duration under the tuner_no_more_trial state (Issue #2758 ), @J-shang 1 day PR: #3043 @liuzhe-lz @QuanluZhang
[ ] AML scheduling (Issue #3014) @SparkSnail 1 day ~PR #3034 review: @QuanluZhang @chicm-ms ~
[x] HyperBand refactor @J-shang 3 days PR: #3040 review: @QuanluZhang @colorjam
[x] Add algorithm import shortcut @liuzhe-lz 3 days draft PR #3139
[x] AdaptDL integration test @SparkSnail
[x] Quantization export @linbinskn
[ ] update API reference for model compression in doc
[x] Import shortcut @ultmaster
[x] Pipeline (fix ut numpy dependency, check it) @ultmaster
[x] Fix NNI manager warning @J-shang
Stretch Goals & Deferred Items
[ ] tensorboard backend support, 5 days @J-shang (@SparkSnail help)
[ ] multi-objective optimization algorithm, #2525 , 5 days - @ultmaster
[ ] Cream2NNI @hwpengms , pr #2705 , under refactoring (review @QuanluZhang )
[ ] Update TF model compression: support checkpoint, 3 days @liuzhe-lz
[ ] P2, 2.0-candidates, under design discussion support post training quantization speedup, 14 days - dev owner: @linbinskn @J-shang
[ ] P2 Update node/python JSON handling, 3 days - @liuzhe-lz
[ ] FBnetV3 tuning algorithm, #2591 , #2357 , 7 days - dev owner: @J-shang
[ ] HyperBand further refactor @J-shang
[ ] Merge .config and db.experimentProfile.param @J-shang
[ ] Support code coverage check, add this check to IT, 7 days @chicm-ms
[ ] test classic nas tuners on NNI NAS benchmark and report evaluation results, ? days, @98may
[ ] Search trial by parameter (issue #3024) @sharonxingyufan @Lijiaoa
[ ] Bump dependency version, @liuzhe-lz 5 days
Backlog