Welcome to the Tau LLM Unity ML Agents project! This project aims to build a Language Model (LLM) from scratch using Unity, ML-Agents, and Sentence Transformers.
The Tau LLM Unity ML Agents project is an innovative initiative to create an intelligent chatbot using reinforcement learning and natural language processing techniques. By leveraging Unity's ML-Agents toolkit and Sentence Transformers, we aim to develop a robust and efficient language model.
all-MiniLM-L6-v2
model from Hugging Face.Clone the repository:
git clone https://github.com/yourusername/tau-llm-unity-ml-agents.git
cd Tau\MLAgentsProject
Install Python dependencies:
.\Scripts\setup.bat
Open the project in Unity:
Set up ML-Agents:
Clone ML-Agents Repository:
git clone --branch develop https://github.com/Unity-Technologies/ml-agents.git
Add ML-Agents Packages Locally:
ml-agents
repository and select the com.unity.ml-agents
directory.com.unity.ml-agents.extensions
directory.Configure the training environment:
config.yaml
file to set up your training parameters.Start training:
mlagents-learn .\config\tau_agent_sac.yaml --run-id=tau_agent_sac_A1 --env .\Build\ --torch-device cuda --timeout-wait 300 --force
.\config\tau_agent_sac.yaml
: Path to the configuration file.--run-id=tau_agent_sac_A1
: Unique identifier for the training run.--env .\Build\
: Path to the build directory.--torch-device cuda
: Specifies the use of a CUDA-enabled GPU for training.--timeout-wait 300
: Sets the timeout wait time.--force
: Forces the training to start even if there are warnings.Monitor training:
tensorboard --logdir results
Load the trained model in Unity:
Test the agent:
Assets/
: Unity project assets.Build/
: The directory where we export our builds to.configs/
: Configuration files for training.Data/
: Directory used to store the model's runtime data and training data for the database and other purposes.Logs/
: Where log files generated by Tau and Python packages are located.ml-agents/
: The directory where ML Agents repo is cloned intoModels/
: Trained models and configurations.results/
: Training results and logs.Scripts/
: Custom scripts for agents and environment setup.venv/
: Python virtual environment directory where our packages are installed intoWe welcome contributions from the community! Please read our contributing guidelines for more information.
This project is licensed under the MIT License. See the LICENSE file for details.