alphagov / govuk-content-metadata

GovNER: an encoder-based language model (RoBERTa) fine-tuned to perform Named Entity Recognition (NER) on GOV.UK content
MIT License
4 stars 1 forks source link

Updated post-extraction workflow to process for phase-2 entities too #91

Closed exfalsoquodlibet closed 1 year ago

exfalsoquodlibet commented 1 year ago

Summary

Updated workflow YAML file so that phase-2 extracted entities are also post-processed, aggregated and counted and made available in final format to users.

Checklists

This pull/merge request meets the following requirements:

Comments have been added below around the incomplete checks.