Closed mubaldino closed 3 years ago
in v3.3, gazetteer scripts in ./solr/
were refactored heavily
release v3.4: SQLite-based gazetteer curation is far more fluid and manageable. Still relies on Merged gaz file, but for now this is a simpler means of integrating multiple data sets and avoiding the complexity of Solr plugins/extension
./solr/
scripts and opensextant python lib are refactored heavily./solr/build.sh
and ./solr/build.xml
work together to stage and finalize the SQLite and Solr indicesxponents-gazetteer-meta.jar
is no longer used. Data is published with main SDK JAR.
Type of Feature:
Description of Feature
Use Python Pandas and SQLite to stage all data sources in order to support the Merged Gazetteer output.
The current Gazetteer project is dependent on Kettle v6 to v9 and Java 8. There is now some incompatibility of the project with a git checkout on linux -- Kettle "spoon" script outputs an error on "Line 130, Column 69: Invalid Escape Sequence" ... but does not mention what file or what phase of processing.
This is not worth fixing in Kettle and Gaz project. Much easier to reimplement.