cerndb / dist-keras

Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
http://joerihermans.com/work/distributed-keras/
GNU General Public License v3.0
623 stars 169 forks source link

Error in StringIndexer of the example #46

Open mashaye opened 6 years ago

mashaye commented 6 years ago

Hi,

I am getting the following error when I run the example that works with atlas_higgs data. In the StringIndexer section I am getting the following error:

File "/disk/disk16/hadoop/yarn/local/usercache/319413696/appcache/application_1510179039244_71672/container_e31_1510179039244_71672_01_000004/pyspark.zip/pyspark/worker.py", line 163, in main
    func, profiler, deserializer, serializer = read_command(pickleSer, infile)
  File "/disk/disk16/hadoop/yarn/local/usercache/319413696/appcache/application_1510179039244_71672/container_e31_1510179039244_71672_01_000004/pyspark.zip/pyspark/worker.py", line 54, in read_command
    command = serializer._read_with_length(file)
  File "/disk/disk16/hadoop/yarn/local/usercache/MyID/appcache/application_1510179039244_71672/container_e31_1510179039244_71672_01_000004/pyspark.zip/pyspark/serializers.py", line 169, in _read_with_length
    return self.loads(obj)
  File "/disk/disk16/hadoop/yarn/local/usercache/MyID/appcache/application_1510179039244_71672/container_e31_1510179039244_71672_01_000004/pyspark.zip/pyspark/serializers.py", line 434, in loads
    return pickle.loads(obj)
AttributeError: 'module' object has no attribute 'to_dense_vector'