Vi Pubmed (or Vietnamese Pubmed) is a corpus of PubMed biomedical abstracts translated by the state-of-the-art English-Vietnamese Translation project. The data has been used as unlabeled dataset for pretraining a Vietnamese Biomedical-domain Transformer model.
Dataloader name:
vi_pubmed/vi_pubmed.py
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?vi_pubmed