This is a multilingual benchmark for dialogue generation containing real-life Reddit conversations (parent and response comment pairs) in 46 languages, including Indonesian, Tagalog and Vietnamese. English translations are also provided for comments.
Dataloader name:
mdia/mdia.py
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?mdia