aihackerleague / management

Project Management
MIT License
1 stars 0 forks source link

Implement AI Judging System #20

Open CraftsMan-Labs opened 2 months ago

CraftsMan-Labs commented 2 months ago

Objective

Develop a robust AI Judging System to evaluate code submissions quickly and objectively in the AI Hacking League.

Tasks

  1. API Integration:

    • Integrate OpenAI's GPT-4 model through their API.
    • Ensure secure and efficient API communication.
  2. Custom Prompts Development:

    • Design prompts that focus on functionality, innovation, efficiency, and code quality.
    • Test and refine prompts to achieve accurate evaluations.
  3. Scoring Algorithm Design:

    • Construct a weighted scoring system based on evaluation criteria.
    • Validate the scoring methodology with test datasets.
  4. Feedback Generation:

    • Develop a mechanism to provide detailed feedback on submissions.
    • Ensure feedback includes strengths, weaknesses, improvement suggestions, and comparative analysis.

Technical Requirements

Labels

wubbyweb commented 1 month ago

@ohdearquant is working on this....