neulab / prompt2model

prompt2model - Generate Deployable Models from Natural Language Instructions
Apache License 2.0
1.96k stars 177 forks source link

High quality few-shots from retrieved dataset #327

Open rayendito opened 1 year ago

rayendito commented 1 year ago

Instead of only using examples given by the users in the user prompt, what if we try to use retrieved datasets (if applicable) as high-quality shots for the dataset generator?

zhaochenyang20 commented 1 year ago

Perfect idea. After we implement the autopilot dataset retriever, we can add your idea!

neubig commented 1 year ago

Actually I think that our existing dataset retriever is probably fine, I don't think this issue is blocked by anything. @rayendito , if you'd like to take a stab at it I could assign the issue to you!

rayendito commented 1 year ago

sure! i'll be happy to play around with it

bilal-aamer commented 10 months ago

@neubig Sir, Is this still being done by @rayendito, would love to collaborate on this.

neubig commented 10 months ago

Hi @bilal-aamer , definitely go ahead and play around with this unless @rayendito has already finished!

rayendito commented 10 months ago

hi @bilal-aamer I did a somewhat MVP implementation for this some months ago in https://github.com/rayendito/prompt2model/tree/sample-from-dataset but I haven't made any PRs yet since I figured that we need to figure out if this method actually yields better results. I've been paying attention to the discussion on the #multilingual channel on Discord and chimed in a few times. I thought maybe a core team member was already working on this and I sort of spectated because I didn't want to overstep any territory😅 but if @neubig says go ahead then go for it! I'm actually particularly interested in MT and am currently exploring maybe a more specific version of this issue (MT only)

neubig commented 10 months ago

Oh, go ahead! I think this is distinct methodologically from what @VanyaBK is looking at, so please go ahead and run whatever you want to.

And @bilal-aamer maybe you could discuss with @rayendito and see if there's anything he could use help with on the discord. I'm happy to pitch in to the discussion.