Closed decentral1se closed 5 years ago
Following from https://github.com/nickdickinson/WASHWeb/issues/1#issuecomment-461866250, I've found a nice document which I'll drop here. Looks like we need to use the thinking contained in this document (or if you have others, please share) in order to inform what we want to learn from this issue. Here's the page (from the very nice Neo4J documentation):
So, if we can draw it on a whiteboard, we're pretty much there ;)
I can explain the following and we can draw them out. The entities are:
This would be the graph.
However, a lot of knowledge out there is not in the shape of a graph. The external data sources like water point data (GPS coordinates / tables of data) add a lot of rich information.
As soon as we start look at full text analysis, since lots of information isn't formally organized, then we would potentially be looking at creating a taxonomy of WASH terms and linking them to the network above and using that to link concepts/entities in texts to the graphdb.
This is getting pretty complex but I'm curious if there are already tools out there to link for example graphdbs to elastic search or natural language processing and if so, then maybe it only sounds hard but it is actually doable.
OK, closing this one out until we gather more steam again!
(cleaning out my issue tracking list ...)