mozilla / jydoop

Efficient Hadoop Map-Reduce in Python
Other
31 stars 19 forks source link

Telemetry hdfs support #41

Closed mreid-moz closed 11 years ago

mreid-moz commented 11 years ago

Add support for reading Telemetry data from HDFS (Sequence Files) instead of scanning HBase. This is now the preferred way to access Telemetry data for the most recent 2 weeks. You'll get an error message if you try to run on data outside that time frame.