Unity-Technologies / ml-agents

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
https://unity.com/products/machine-learning-agents
Other
17.19k stars 4.16k forks source link

Supervised and Unsupervised Learning Support #4257

Closed donamin closed 4 years ago

donamin commented 4 years ago

I am developing a ML project using ml-agents. One issue that I'm struggling with right now, is that the framework has been solely developed for reinforcement learning. But what about other types of learning, i.e., supervised and unsupervised learning? For example, part of my project requires an out-of-the-box nn trained using some offline data. I believe with all the cool features in ml-agents, it would be an easy task to add support for other types of learning.

vincentpierre commented 4 years ago

Hi, You are correct, ML-Agents is built around reinforcement learning. There are other tools we are working on for supervised learning like the Unity Perception SDK. If you want to use ML-Agents and do supervised learning, you can collect data with a reinforcement learning loop and then save / reuse the data to train a supervised learning model. I will mention your interest for supervised and unsupervised learning to the team, do you have specific use cases in mind ?

donamin commented 4 years ago

Thanks for the reply.

Well, the use cases can be very generic. For example, RL systems (such as this one) can have several supervised or unsupervised components for helping the decision making process. I'm particuarly interested in using generative models in Unity. For that, one needs a library that allows one to design the neural network architecture and choose the loss function themselves.

vincentpierre commented 4 years ago

Ha, I thought you wanted to not do RL and only supervised. In our trainers, we use supervised learning in the behavioral cloning module and GANs in the GAIL reward module. If you think of other generic RL components that can help learning, let us know. We currently do not have a solution for animations as we are focused on generating behaviors.

github-actions[bot] commented 3 years ago

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.