Wikipedia Graph Processor is a tool written in Python 3.6 aiming to process wikipedia graphs and extract communities and give a description of their topic.
You need:
The tool can be run using this command
./WikipediaGraphProcessor.py --arguments
or
python3 WikipediaGraphProcessor.py --arguments
-h
,--help
: show help message and exit-jdk8
,--jdk8Path
: path to jdk8. Default: /usr/lib/jvm/java-1.8.0-openjdk-amd64-n4jadr
,--neo4jAddress
: neo4j database address. Default: bolt://localhost:7687-n4jusr
,--neo4jUsername
: neo4j database username. Default: neo4j-n4jpwd
,--neo4jPassword
: neo4j database password. Default: neo4j-ip
,--inputPath
: path of the directory containing the graphs. Default: graphs/-n
,--nOfClusters
: max number of clusters to extract. Default: 20-op
,--outputPath
: path of the output directory. Default: output/This tool outputs as many folders as graphs. Each folder contains a .csv file with the clusters and their respective description and a .gexf file where each node is assigned a class (cluster) and the class' description.