Transforming text into data to extract meaning and make connections. In development.
See also our paper, Heritage connector: A machine learning framework for building linked open data from museum collections, at https://doi.org/10.1002/ail2.23.
A set of tools to:
Collections as tabular data (left) vs knowledge graphs (right)
The main project page is here. We're also writing about our research on the project blog as we develop these tools and methods.
Some blog highlights:
We use pipenv for dependency management. You can also install dependencies from requirements.txt
and dev dependencies from requirements_dev.txt
.
Optional dependencies (for experimental features):
torch
, dgl
, dgl-ke
: KG embeddingsspacy-nightly
: export to spaCy KnowledgeBase for Named Entity Linking
Run python -m pytest
with optional --cov=heritageconnector
for a coverage report.
We use pytest
for tests, and all tests are in ./test.
To run web app (in development): python -m heritageconnector.web.app
Cite as:
Dutia, K, Stack, J. Heritage connector: A machine learning framework for building linked open data from museum collections. Applied AI Letters. 2021;e23. https://doi.org/10.1002/ail2.23