issues
search
klbostee
/
dumbo
Python module that allows one to easily write and run Hadoop programs.
http://projects.dumbotics.com/dumbo
1.04k
stars
146
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
It's hard to use an output as an input
#43
brisssou
closed
13 years ago
1
-input format not handled in local mode
#42
jmesnil
closed
13 years ago
1
dumbo with virtualenv
#41
oddskool
closed
13 years ago
6
'default' parameter in Params class
#40
andrix
closed
13 years ago
0
Import external libraries in an dumbo MapReduce
#39
thanhbinh87
closed
13 years ago
1
Permit stdout redirection to avoid broken pipes
#38
jlewi
opened
13 years ago
1
tests won't run on ubuntu with zope.interface installed globally
#37
klbostee
closed
13 years ago
0
pre-outputs not deleted automatically anymore
#36
klbostee
closed
13 years ago
0
dedicated decorator for specifying parser for a single mapper
#35
gamboviol
opened
13 years ago
1
document why anyone should use this
#34
Dieterbe
closed
13 years ago
3
Override or disable config files
#33
andrewclegg
closed
13 years ago
2
reducers for outputting raw binary files, tokyo cabinet dbs or constant dbs
#32
klbostee
closed
13 years ago
3
Buildout wrapper script swallows exceptions raised by starter
#31
andrewclegg
closed
13 years ago
1
Two error handling bugs in core.py
#30
andrewclegg
closed
13 years ago
1
Explicit option for local run
#29
gsakkis
closed
10 months ago
1
VirtualEnv + Hadoop CDH3B4 mode + Dumbo = import site error (was: StreamJob fails -- error finding typedbytes.pyc even though it exists)
#28
brainstorm
closed
13 years ago
12
Add "raw" outputformat shortcut
#27
dangra
closed
13 years ago
2
hadoopy backend?
#26
dgleich
opened
13 years ago
4
some issue when trying to use the new DAG functionality in the 0.21.29 release
#25
asaelm
opened
13 years ago
3
python not found
#24
josephcc
closed
13 years ago
2
"(1/0)" in job name
#23
klbostee
closed
13 years ago
1
Outputting bytes directly on Hadoop is not possible
#22
klbostee
closed
12 years ago
2
unittest framework for dumbo mapreduce
#21
klbostee
closed
13 years ago
1
addpath broken under hadoop-0.21.0
#20
jso
closed
13 years ago
1
dumbo cat silently fails when JobHistory logging is enabled
#19
emf
closed
13 years ago
1
Allow a Job to be a DAG instead of a chain?
#18
hydropyrum
closed
13 years ago
5
make -overwrite yes work for intermediate output dirs too
#17
klbostee
closed
14 years ago
1
overwrite option
#16
klbostee
closed
14 years ago
1
failing second iteration not reflected by status code
#15
klbostee
closed
14 years ago
1
dumbo fails to find hadoop streaming jar
#14
shaneaevans
closed
14 years ago
1
equality test between join keys should be customizable
#13
dehowell
closed
14 years ago
2
exception handling for parser ValueError should not write all bad lines out to logs
#12
dehowell
closed
14 years ago
2
JoinReducer cannot be used anymore for combining
#11
klbostee
closed
14 years ago
1
Java -Xmx style memory size specification when using -memlimit
#10
adamhadani
closed
14 years ago
0
add -queue option
#9
klbostee
closed
14 years ago
1
backend abstraction
#8
klbostee
closed
14 years ago
10
findjar() method is abit naive
#7
klbostee
closed
14 years ago
1
configure() and close() on mapper/reduce/combiner classes not called if map/reduce function is defined
#6
klbostee
closed
14 years ago
1
parameters for local sort
#5
klbostee
closed
14 years ago
1
no error reported by Hadoop in case of immediate failure
#4
klbostee
opened
14 years ago
4
cryptic "relative module names not supported" error message
#3
klbostee
closed
14 years ago
1
make MultiMapper inherit options
#2
klbostee
closed
14 years ago
1
dumbo cat /hdfs/path/part* silently fails to concatenate all part files
#1
klbostee
closed
14 years ago
2
Previous