LAION-AI / Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
https://open-assistant.io
Apache License 2.0
36.94k stars 3.22k forks source link

Add: Korean QA dataset #3551

Open CertifiedJoon opened 1 year ago

CertifiedJoon commented 1 year ago

For: #1157

link to dataset

This repository contains the Python code used to generate the Korean QA dataset. Korean QA is a dataset designed to evaluate the ability of models to perform question answering in korean natural language.

The dataset contains 1.74k instruction and answers, all of which are from Naver Kin, the number one QNA website in korea.

structure - [Instruction, Response, Source, Metadata]