This dataset is designed for named entity recognition (NER) tasks in the Bahasa Indonesia tourism domain. It contains labeled sequences of named entities, including locations, facilities, and tourism-related entities. The dataset is annotated with the following entity types: - B-WIS: Beginning of a tourism-related entity. - I-WIS: Continuation of a tourism-related entity. - B-LOC: Beginning of a location entity. - I-LOC: Continuation of a location entity. - B-FAS: Beginning of a facility entity. - I-FAS: Continuation of a facility entity. - O: Non-entity or other words not falling into the specified categories.
Dataloader name:
indoner_tourism/indoner_tourism.py
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?indoner_tourism