dice-group / dice-embeddings

Hardware-agnostic Framework for Large-scale Knowledge Graph Embeddings
MIT License
48 stars 13 forks source link

Serializing Knowledge Graphs #6

Closed Demirrr closed 3 years ago

Demirrr commented 3 years ago

While we are using our models on DBpedia knowledge graph containing 10^8 triples, we observe serialization and deserialization of large data can be useful.

  1. Serialize and Deserialize (KG object).
  2. Use the path of serialized object as a parameters so that we could avoid reparsing large input file, if

Number of triples: 375_434_888 Number of entities: 114747963 Number of relations: 13906

Demirrr commented 3 years ago

By using --deserialize_flag, we are able to deserialized a KG. This allow us to avoid reparsing large KGs.