Closed saum7800 closed 8 months ago
Hi @saum7800 I can take another look when you've had a moment to make the above revisions!
Hey @neubig , I have resolved the comments that seemed like easy fixes, and left comments for a couple of them we can discuss. Please re-review whenever you get a chance. Thanks!
Description
We are adding a new component
DatasetTransformer
. The code currently contains one version ofDatasetTransformer
:PromptBasedDatasetTransformer
. It is used with the Dataset Retrieval step in order to transform retrieved data to a format that is more directly relevant to the task given by a user. Here is the broad flow:PromptBasedDatasetTransformer
object, and call thetransform_data
function with thePromptSpec
and the loaded dataset.transform_data
creates a prompt and calls the APIAgent which returns a "plan" for carrying out the transformation. This prompt uses the prompt_spec and example rows of the retrieved dataset.PromptSpec
. It then requests a batch complete from the APIAgent for all the transform_prompts.