marcderbauer / humanitarianKG

0 stars 0 forks source link

Humanitarian KG

What is This Repository About?

As the name might hint at, this repository's goal is to provide structured knowledge to humanitarian organisations and decisionmakers.
It follows a standard ETL-Pipeline scheme with the following three steps:

Technical Aspects

  1. Extract: Extract text from a given source (e.g. HTML, PDF, TXT or maybe even from an API).
  2. Transform: Take the text from step (1) and extract useful information into relational triplets.
  3. Load: Take the relational triplets and load them into a Graph Database (Neo4j).

For now the main focus is to construct an MVP. This will be a proof of concept on how to use LLMs to construct a Knowledge Graph. Crucial to this is ensuring that the knowledge extracted can be traced throughout the whole system from the extraction step all the way into the KG database.

Topical Aspects

During the development of this repository, the topical focus will be on the Sudans. They unfortunately haven't received much news coverage, despite both facing severe humanitarian crises.

Once the technology works as intended, there are a few different directions I might develop this into:

Current State of Development

You can track the current state of this project via GitHub tickets. I'm trying to be as transparent as possible with its' development.
Should you have any requests or questions, please feel free to create a ticket or reach out to me via email.