PygmalionAI / data-toolbox

Our data munging code.
GNU Affero General Public License v3.0
34 stars 9 forks source link

Publish/Share the share_gpt.json file #13

Closed manyoso closed 1 year ago

manyoso commented 1 year ago

Hi, based on your toolbox it seems pygmallion has the sharegpt dataset. Since the get operation for this is now closed can you share the data you gathered for other model generation research purposes? Thanks!

0x000011b commented 1 year ago

Hello! I'm unsure what the license says regarding redistribution so I'd rather not link it directly to avoid possible problems, but if you go on HuggingFace datasets and search for vicuna you'll find several ShareGPT scrapes.