numeralbank / numerals

Creative Commons Attribution 4.0 International
0 stars 0 forks source link

What happens when an entry is removed from Chan's site? #196

Closed barlowrussell closed 2 years ago

barlowrussell commented 2 years ago

I'm wondering if Numeralbank has something in place for removing outdated entries when they are no longer included in Chan's web database.

For example, fata1247-3, "Nisa (Lautém), East Timor" no longer seems to be included in Chan's site (and the data look rather weird, so maybe it was pulled down for being incorrect?). I don't know what happened here, but I doubt it's the only case of something getting pulled down.

Similarly, bagu1251-1, "Bagusa, Indonesia" has been removed, since it was falsely assigned on the site. The data are now here:

https://lingweb.eva.mpg.de/channumerals/Trimuris.htm

... and so the entry in Numeralbank should be [trim1239].

Thanks! Russell

SimonGreenhill commented 2 years ago

You could email Eugene and ask him why they've been changed -- he's very happy to discuss numbers.

barlowrussell commented 2 years ago

I will! I meant to ask rather whether there was something in the Chan-to-Numeralbank workflow to catch these expunged entries.

SimonGreenhill commented 2 years ago

yes, so we need to know if it's an error or a deliberate removal.

barlowrussell commented 2 years ago

Agreed. And if deliberate removals are a regular occurrence, then we need some way for them to be registered on our end (e.g., Eugene notifying us, or some fancier computer way).

Bibiko commented 2 years ago

I just want to clarify that Eugene's site https://lingweb.eva.mpg.de/channumerals is NOT the source anymore. The link should only be understood as a kind of hint for developers. The source is now only this repo. New stuff from Eugene comes now as Excel files in raw/xlsx. From now on I will concentrate on adding new/changed data as soon as possible.

barlowrussell commented 2 years ago

Thanks for the clarification. So, with new data that he collects, Eugene is now both (1) updating his site and (2) sending an excel file? Has he also been giving notification about data that should be removed?

Bibiko commented 2 years ago

Some while ago I asked him to mention if he deleted a dataset but it only occurred that he renamed an HTML file which has to be removed from his site. I'll try to write a script which can detect such instances.

barlowrussell commented 2 years ago

That would be great -- thanks!