mozilla / jydoop

Efficient Hadoop Map-Reduce in Python
Other
31 stars 19 forks source link

output python (string,string) k/v pairs to HDFS as (Text,Text) #54

Open bcolloran opened 10 years ago

bcolloran commented 10 years ago

Needed to join the output of jydoop runs jobs against FHR data dumps. This is required for FHR de-orphaning, and is a blocker on the new algorithm being tested/deployed. I spent all day trying to do this myself, but I still have no idea what's going on (where can you cast reducer output to (Text,Text)?).