p3nGu1nZz / Tau

Tau LLM made with Unity 6 ML Agents
MIT License
11 stars 4 forks source link

Dev dataset oproof #3

Closed p3nGu1nZz closed 2 months ago

p3nGu1nZz commented 2 months ago

Description

This PR integrates the oproof Python package into the Tau project, enhancing the validation of prompt-response pairs using Ollama and Python. The integration includes updates to the BaseTask class and the implementation of iterative proofing to refine error handling. This change aims to improve the accuracy and robustness of the dataset processing pipeline.

Type of Change

Please delete options that are not relevant.

How Has This Been Tested?

The integration has been tested with both small and large datasets to ensure the oproof process works correctly and efficiently. The following tests were conducted:

Test Configuration:

Checklist:

Additional Notes

The iterative proofing process has been designed to continue until the number of errors stabilizes, ensuring a high-quality dataset. Future improvements may include further optimization of the proofing algorithm and additional validation steps.