In the digital era, Wikipedia represents a comprehensive cross-domain source of knowledge with millions of contributors. The DBpedia project tries to extract structured information from Wikipedia and transform it into RDF. This helps anyone to ask sophisticated queries against Wikipedia.
The main classification system of DBpedia depends on human curation, which causes it to lack coverage, resulting in a large amount of untyped resources. DBTax provides an unsupervised approach that automatically learns a taxonomy from the Wikipedia category system and extensively assigns types to DBpedia entities, through the combination of several NLP and interdisciplinary techniques. It provides a robust backbone for DBpedia knowledge and has the benefit of being easy to understand for end users.
Follow these instructions step by step to set up the project locally on your machine.
Run the /root/DBTaxProject/src/main/java/org/dbpedia/dbtax/DBTaxPipeline.java file.
This project is This project is licensed under the Apache License 2.0- see the LICENSE file for details.