UHaifa-IS / whgazetteer-mehdie

World Historical Gazetteer - MEHDIE version
http://whgazetteer.org
BSD 3-Clause "New" or "Revised" License
1 stars 1 forks source link

Try a database approach to ER #193

Open tomersagi opened 4 months ago

tomersagi commented 4 months ago

Instead of batch matching, for a given dataset - embed all place names and then use a vector database to query the most similar embeddings. Then use a second model to predict the match using all the info.

tomersagi commented 4 months ago
tomersagi commented 4 months ago
tomersagi commented 3 months ago
tomersagi commented 3 months ago

https://cookbook.chromadb.dev/embeddings/bring-your-own-embeddings/

tomersagi commented 3 months ago