Closed fititnt closed 1 month ago
Humm. Turns out that the interface is loading all data (including the v0.5 https://sdm.etica.ai/v/0.5/) even without changes. However, the count preview is written into the tabs earlier than the computation is finished. So, do exist a minor problem.
One way to know the underlining data is (after wait to finish the loading) use the console to print SDMData
(or WorkingData
in the v0.5.0)
I released v0.5.3.3 beta (also a snapshot at https://sdm.etica.ai/v/0.5.3.3/). Turns out that both v0.5.3.2 (including the v0.5.0.0 documented in the diary) where loading the data into the memory (which could be debugged manually by dumping the variable SDMData or WorkingData). However, the part of the code that updates the statistics in the UI was doing it too early. This problem was less obvious with test data used in the diary, but with >2000 files from AllThePlaces it became very obvious.
Text description from the images:
Other comments:
While testing load all files from AllThePlaces dump (this dataset is used by https://codeberg.org/matkoniecz/improving_openstreetmap_using_alltheplaces_dataset/ and mentioned on https://www.openstreetmap.org/user/fititnt/diary/404303#comment57942) availible here https://www.alltheplaces.xyz/ (exact dump link https://alltheplaces-data.openaddresses.io/runs/2024-08-03-13-32-47/output.zip. description "(267.1M) 2,967,929 rows from 2625 spiders, updated Mon Aug 5 03:16:45 AM UTC 2024" I noticed the loading process:
I think the loading process needs to be reworked, even if make a bit slower on all cases. Currently, it tries to process all files asynchronously and this testcase have a lot of files: 2623 files (474 of these are empty geojson)