xpdAcq / rapidz

Fork of Streamz
https://xpdacq.github.io/rapidz
BSD 3-Clause "New" or "Revised" License
5 stars 7 forks source link

Graph as Merkel tree ish thing #36

Closed CJ-Wright closed 5 years ago

CJ-Wright commented 5 years ago

The idea would be to have the nodes have a representation of themselves which reference their characteristic variables (their name, the name/module of any incoming functions, args, kwargs, etc.) plus their order in the execution and their parent. All of this will be hashed. Thus to check if a graph is different from another we can check if any of the nodes has a different characteristic hash.

Note that these hashes may not be unique, since source nodes would all look the same, but that may be ok. As a consequence we won't be able to use these as node names for networkx.

CJ-Wright commented 5 years ago

It is important to note that this would go a long way to checking if two pipelines are the same.

CJ-Wright commented 5 years ago

(t/h to @tacaswell )

CJ-Wright commented 5 years ago

Done in SHED