Closed barlowrussell closed 2 years ago
You could email Eugene and ask him why they've been changed -- he's very happy to discuss numbers.
I will! I meant to ask rather whether there was something in the Chan-to-Numeralbank workflow to catch these expunged entries.
yes, so we need to know if it's an error or a deliberate removal.
Agreed. And if deliberate removals are a regular occurrence, then we need some way for them to be registered on our end (e.g., Eugene notifying us, or some fancier computer way).
I just want to clarify that Eugene's site https://lingweb.eva.mpg.de/channumerals is NOT the source anymore. The link should only be understood as a kind of hint for developers. The source is now only this repo. New stuff from Eugene comes now as Excel files in raw/xlsx. From now on I will concentrate on adding new/changed data as soon as possible.
Thanks for the clarification. So, with new data that he collects, Eugene is now both (1) updating his site and (2) sending an excel file? Has he also been giving notification about data that should be removed?
Some while ago I asked him to mention if he deleted a dataset but it only occurred that he renamed an HTML file which has to be removed from his site. I'll try to write a script which can detect such instances.
That would be great -- thanks!
I'm wondering if Numeralbank has something in place for removing outdated entries when they are no longer included in Chan's web database.
For example, fata1247-3, "Nisa (Lautém), East Timor" no longer seems to be included in Chan's site (and the data look rather weird, so maybe it was pulled down for being incorrect?). I don't know what happened here, but I doubt it's the only case of something getting pulled down.
Similarly, bagu1251-1, "Bagusa, Indonesia" has been removed, since it was falsely assigned on the site. The data are now here:
https://lingweb.eva.mpg.de/channumerals/Trimuris.htm
... and so the entry in Numeralbank should be [trim1239].
Thanks! Russell