I also created two yaml files for the LLM to determine whether the question from the test set and the candidate question are rephrased versions of each other.
nemo_skills/inference/prompt/openai/math-detect-few-shot.yaml
nemo_skills/inference/prompt/openai/math-detect-zero-shot.yaml
We can leverage the generate_solutions.py code if we host our Llama-405B model as outlined below:
I developed a pipeline to extract similar questions from the training data corresponding to the test data.
We can use the following command to retrieve similar questions, which will then be saved back to the original_test.jsonl file.
I also created two yaml files for the LLM to determine whether the question from the test set and the candidate question are rephrased versions of each other. nemo_skills/inference/prompt/openai/math-detect-few-shot.yaml nemo_skills/inference/prompt/openai/math-detect-zero-shot.yaml
We can leverage the generate_solutions.py code if we host our Llama-405B model as outlined below:
I also created a contaminator type similar to our math-grader if we want to use GPT-4 directly. We can also use GPT-4 using the following command: