usc-isi-i2 / dig3-extractions

Apache License 2.0
0 stars 0 forks source link

Some extractions are missing provenance context #14

Closed ThomasSchellenbergNextCentury closed 7 years ago

ThomasSchellenbergNextCentury commented 7 years ago

Some extractions are missing their provenance context. This appears to be extractor-specific: city, phone, price, and review_id all have missing provenance information. Here is an example: https://dig3.memexproxy.com/elasticsearch/dig-etk-search/ads/B5FDC1EEA8EBCA711E59A866DBF6E84EB8E9677BF7223E371985D1122DCAC1AB

saggu commented 7 years ago

Its not a bug. These extractors are either very customised or do not support context creation. Moving forward, phone will have context once we phase out current phone extractor and introduce the spacy phone extractor.