damballa / parkour

Hadoop MapReduce in idiomatic Clojure.
Apache License 2.0
257 stars 19 forks source link

Distributed values (dvals) via conf-aware job parameter serialization #4

Closed llasram closed 10 years ago

llasram commented 10 years ago

Serialize job parameters (task vars and arguments) via a modified version of EDN. Make the active job configuration available during de/serialization, allowing objects access to other parameters in the configuration. Use this access to implement "distributed values" or dvals, which arrange to distribute their serialized representation via the Hadoop distributed cache.