DigitalPebble / behemoth

Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.
Other
281 stars 60 forks source link

Versioning BehemothDocument #23

Closed butlermh closed 12 years ago

butlermh commented 13 years ago

Currently BehemothDocument does not contain a version number, which means it will be difficult to maintain compatibility with several versions if in the future fields are added.

Also consider the possibility of using Avro for serialization / deserialization.

jnioche commented 12 years ago

[master 2cdf19d] Issue #23 Versioning BehemothDocument - NOT BACKWARD COMPATIBLE 1 file changed, 323 insertions(+), 311 deletions(-)