This dataset is collected from electronic newspapers published on the web and provided by VLSP organization. It consists of approximately 15k sentences, each of which contain NE information in the IOB annotation format.
Subsets
-
Languages
vie
Tasks
Named Entiy Recognition
License
Creative Commons Attribution Non Commercial 4.0 (cc-by-nc-4.0)
Dataloader name:
vlsp2016_ner/vlsp2016_ner.py
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?vlsp2016_ner