wwzeng1 / gpt-prompt-engineer

MIT License
0 stars 0 forks source link

gpt-prompt-engineer

Twitter Follow Open Main Version In Colab Open Classification Version In Colab

Overview

Prompt engineering is kind of like alchemy. There's no clear way to predict what will work best. It's all about experimenting until you find the right prompt. gpt-prompt-engineer is a tool that takes this experimentation to a whole new level.

Simply input a description of your task and some test cases, and the system will generate, test, and rank a multitude of prompts to find the ones that perform the best.

Features

Setup

  1. Open the notebook in Google Colab or in a local Jupyter notebook. For classification, use this one.

  2. Add your OpenAI API key to the line openai.api_key = "ADD YOUR KEY HERE".

  3. If you have GPT-4 access, you're ready to move on. If not, change CANDIDATE_MODEL='gpt-4' to CANDIDATE_MODEL='gpt-3.5-turbo'. If you're using the classification version, and don't have GPT-4 access, change model='gpt-4' in the second cell to model='gpt-3.5-turbo'.

How to Use

  1. Define your use-case and test cases. The use-case is a description of what you want the AI to do. Test cases are specific prompts that you would like the AI to respond to. For example:
description = "Given a prompt, generate a landing page headline." # this style of description tends to work well

test_cases = [
    {
        'prompt': 'Promoting an innovative new fitness app, Smartly',
    },
    {
        'prompt': 'Why a vegan diet is beneficial for your health',
    },
    {
        'prompt': 'Introducing a new online course on digital marketing',
    },
    {
        'prompt': 'Launching a new line of eco-friendly clothing',
    },
    {
        'prompt': 'Promoting a new travel blog focusing on budget travel',
    },
    {
        'prompt': 'Advertising a new software for efficient project management',
    },
    {
        'prompt': 'Introducing a new book on mastering Python programming',
    },
    {
        'prompt': 'Promoting a new online platform for learning languages',
    },
    {
        'prompt': 'Advertising a new service for personalized meal plans',
    },
    {
        'prompt': 'Launching a new app for mental health and mindfulness',
    }
]

For the classification version, your test cases should be in the format:

test_cases = [
    {
        'prompt': 'I had a great day!',
        'output': 'true'
    },
    {
        'prompt': 'I am feeling gloomy.',
        'output': 'false'
    },
    // add more test cases here
]
  1. Choose how many prompts to generate. Keep in mind, this can get expensive if you generate many prompts. 10 is a good starting point.

  2. Call generate_optimal_prompt(description, test_cases, number_of_prompts) to generate a list of potential prompts, and test and rate their performance. For the classification version, just run the last cell.

  3. The final ELO ratings will be printed in a table, sorted in descending order. The higher the rating, the better the prompt.

    Screen Shot 2023-07-04 at 11 48 45 AM

For the classification version, the scores for each prompt will be printed in a table (see the image above).

Contributions are welcome! Some ideas:

License

This project is MIT licensed.

Contact

Matt Shumer - @mattshumer_

Project Link: https://github.com/mshumer/gpt-prompt-engineer

Lastly, if you want to try something even cooler than this, sign up for Personal Assistant (most of my time is spent on this). It's basically an AI that can operate your web browser to complete tasks for you.