Bukit-Vista / roadmap

0 stars 0 forks source link

Ability to understand the usage and success rate for each category and tools #12

Open bstiawan opened 3 weeks ago

bstiawan commented 3 weeks ago

problem:

  1. Reliability for categorization
  2. Reliability for selecting tools inside a category
  3. Which tools resulting into an error or no return

user: Product Manager

resource:

  1. GAIA execution log
  2. Agent response log

solution:

bstiawan commented 3 weeks ago

@krisnaBukitVista shouldn't this falls under reliability? Each supervisor should measure the performance of their product.