mlcommons / ck

Collective Knowledge (CK) is an educational community project to learn how to run AI, ML and other emerging workloads in the most efficient and cost-effective way across diverse models, data sets, software and hardware using MLCommons CM (Collective Mind workflow automation framework)
https://cKnowledge.org
Apache License 2.0
605 stars 113 forks source link

Roadmap for CM, MLPerf and ML/SW/HW DSE: 20221116 #536

Closed gfursin closed 6 months ago

gfursin commented 1 year ago

Following the successful testing of CM end-to-end benchmarking and submission workflow for modular MLPerf benchmarks at the Student Cluster Competition at SuperComputing'22, we have prepared a new list of pending tasks for the MLCommons taskforce on education and reproducibility. The goal is to help the community automate their MLPerf submissions for MLPerf v3.0 and continue modularizing ML Systems and automating their benchmarking, optimization and design space exploration:

Community discussions (see the notes from weekly conf-calls)

Finish testing our end-to-end CM MLPerf submission workflow (small dataset)

RetinaNet

ResNet50

Compare C++ implementation with best performance (need to validate):

BERT

All other reference MLPerf implementations

Test and document how to run and tune other MLPerf scenarios

Add Power measurements to the CM MLPerf workflow

Finish testing our end-to-end MLPerf submission workflow (full dataset)

Design Space Exploration and testing

Misc

Documentation

Add non-reference (optimized) implementations

Improve testing and documentation of individual CM scripts:

Add support for Android

Enhancement projects (ideas)

Upcoming presentations

ctuning-admin commented 6 months ago

Will prepare a new plan based on our resources and bandwidth.