intel / dffml

The easiest way to use Machine Learning. Mix and match underlying ML libraries and data set sources. Generate new datasets or modify existing ones with ease.
https://intel.github.io/dffml/main/
MIT License
250 stars 138 forks source link

webui: Data Flow creation #363

Open pdxjohnny opened 4 years ago

pdxjohnny commented 4 years ago

DFFML has started it's web UI.

Using something like https://github.com/alyssaxuu/flowy, we need to enable users to visually create dataflows.

flowy Demo

This is similar to but instead of creating them via the command line and manually editing yaml/json files the user will use the web UI to specify the config properties for operations and link various operations inputs and outputs.

Notes

AutoML or Automated Machine Learning as the name suggests automates the process of solving problems with Machine Learning. AutoML is generally helpful for people who aren't either familiar with Machine Learning or the involved programming. AutoML aims to improve the efficiency of any task involving Machine Learning.

The primary objective we are trying to achieve is to create a model that takes as a property of its config a set of models to used for hyperparameter tuning. Another property of its config is the set of models which we should attempt to tune (via the first set). Default values for these results in using all installed models to try to tune all installed model plugins.

Due to the shortened GSoC cycle, we may end up not doing all of these phases. Which one we go to will be decided as we approach the selection process.

Skills

Difficulty

Intermediate/Hard

Related Readings

Getting Started

Potential Mentors

programmer290399 commented 2 years ago

PerceptiLabs can be a very good inspiration as well.

Untitled

Tagging: @pdxjohnny @mHash1m @sk-ip @sakshamarora1 @yashlamba

programmer290399 commented 2 years ago

Orchest has very similar objectives like the webUI part of our project. We can take some inspiration from them as well.

img

Adding another drop to the ocean of your notifications: @pdxjohnny 😅

programmer290399 commented 2 years ago

Another nice UI: https://noisecraft.app/ image GitHub Repo: https://github.com/maximecb/noisecraft

pdxjohnny commented 2 years ago

Most flow based tools are developer facing. We're talking allowing for arbitrary software to allow users to define flows which solve domain specific problems

pdxjohnny commented 2 years ago

We want to do this project because we want to enable embeddability with ability to tailor for use case of the application which depends on the dataflow definition UI components.

We should look at the work we do here as if we are working towards introducing a new input type to HTML.

We're going for <input type="dataflow">​ to be so ubiquitous because it useful for applications to allow users to define domain specific flows.

Part of this is the embeddability with ability to tailor for use case too though

pdxjohnny commented 1 year ago

https://github.com/logspace-ai/langflow