Open elizlee opened 2 years ago
Thoughts so far:
Strategy for event task 1: create a dict with all PB frames that I don't believe we'd be interested in extracting events for.
Event task 2: I still need to investigate why there are duplicate events.
Entity task: Several "event" arguments yield nominal qnodes, like in the provided example ("research"). How much would we be losing if we filtered these out? Is there some other check we can use instead?
Examples are from
/nas/gaia/users/elee/phase3_test/cdse_dryrun_id_match/WORKING/en/isi_ttl_output/output.json
. Reproduce by running the cdse-covid pipeline with input/nas/gaia/users/elee/phase3_test/cdse_dryrun_id_match/WORKING/en/ttl_output/claim_all.json
through all steps untilconvert_claims_to_json.py
is finished. Observe the json output.