Framework for linking generator/iterators to create processing chains and pipelines. With its operator based syntax, it is easy to create complex sequences from simple building blocks. Chainlets are suitable for incremental, iterative and stream processing and beyond.
Consider the following use case: Read data from an XML, flatten it, then write to a csv. This can be expressed with a chain of generators:
csv_writer(flatten(xml_reader(path='data.xml'), join='.'.join), path='data.csv')
When written using chainlets, generator sequence and arguments are much easier to read.
The chainlets are joined using the >>
operator:
xml_reader(path='data.xml') >> flatten(join='.'.join) >> csv_writer(path='data.csv')
In addition, chainlets can be composed much more freely. Instead of deeply nested call structures, chainlets have simple, flat call sequences.
Writing new chainlets does not require any special techniques or conventions.
You can directly convert existing coroutines, functions and objects.
Implementing a moving average requires exactly one line specific to the chainlet
library:
@chainlet.genlet
def moving_average(window_size=8):
buffer = collections.deque([(yield)], maxlen=window_size)
while True:
new_value = yield(sum(buffer)/len(buffer))
buffer.append(new_value)
All the gluing and binding is done automatically for you. Instead of bloating existing code, it is often easier to create and bind another simple chainlet.
Chainlets are not limited to 1-to-1 relations, but actually allow n-to-n links. Each link can have multiple parents and children. The following example reads XML messages via UDP, and logs them in two different verbosity levels.
udp_digest(port=31137) >> xml_converter() >> (
json_writer(path='raw.json'),
moving_average(window_size=60) >> json_writer(path='avg1m.json'),
)
ChainLink
, ready to be subclassed and made bigGeneratorLink
put that awesome into useFunctionLink
linklet
and let linkChainlets are simple at their core, and quick to understand. If you want to know more, just read the fabulous manual:
The module is hosted on github. If you have issues or want to propose changes, check out the issue tracker.