A Google Workflow now orchestrates the named entities post-extraction processing.
The scheduled workflow consists of three steps and can be found in the src/post_extraction_process/post-extraction-gc-workflow.yaml file.
The first two steps (i.e., creation of named_entities.named_entities_all and named_entities.named_entities_counts BigQuery tables) were first implemented by a scheduled queries in Big Query, which has now been deleted from the repository.
Documentation has been added or updated.
Checklists
This pull/merge request meets the following requirements:
Summary
A Google Workflow now orchestrates the named entities post-extraction processing.
The scheduled workflow consists of three steps and can be found in the
src/post_extraction_process/post-extraction-gc-workflow.yaml
file.The first two steps (i.e., creation of
named_entities.named_entities_all
andnamed_entities.named_entities_counts
BigQuery tables) were first implemented by a scheduled queries in Big Query, which has now been deleted from the repository.Documentation has been added or updated.
Checklists
This pull/merge request meets the following requirements:
docs/aqa/aqa_plan.md
)docs/aqa/data_log.md
), if necessarydocs/aqa/assumptions_caveats.md
), if necessarydocs
folderComments have been added below around the incomplete checks.