Open dcsw2 opened 2 years ago
Kaspar's random sample from The Sun: randsample0002194.csv
The four versions of the pipelines are here as jupyter notebooks: https://github.com/Living-with-machines/toponym-resolution/tree/dev/experiments
Now waiting for updates to including candidates
@npedrazzini has done a nice analysis of processing times across the different T-Res methods: https://docs.google.com/spreadsheets/d/1ymjGPubsjq93VmCakBDxYOQW_TCXXxECG1OLG5n1YV8/edit#gid=175352557
Goal: document processing times for different settings in the pipeline (esp. DeezyMatch vs. perfectmatch)
Use sample set of articles from The Sun for this test.
TASKS
Structure of "report":