I am getting the following error when I run the example that works with atlas_higgs data.
In the StringIndexer section I am getting the following error:
File "/disk/disk16/hadoop/yarn/local/usercache/319413696/appcache/application_1510179039244_71672/container_e31_1510179039244_71672_01_000004/pyspark.zip/pyspark/worker.py", line 163, in main
func, profiler, deserializer, serializer = read_command(pickleSer, infile)
File "/disk/disk16/hadoop/yarn/local/usercache/319413696/appcache/application_1510179039244_71672/container_e31_1510179039244_71672_01_000004/pyspark.zip/pyspark/worker.py", line 54, in read_command
command = serializer._read_with_length(file)
File "/disk/disk16/hadoop/yarn/local/usercache/MyID/appcache/application_1510179039244_71672/container_e31_1510179039244_71672_01_000004/pyspark.zip/pyspark/serializers.py", line 169, in _read_with_length
return self.loads(obj)
File "/disk/disk16/hadoop/yarn/local/usercache/MyID/appcache/application_1510179039244_71672/container_e31_1510179039244_71672_01_000004/pyspark.zip/pyspark/serializers.py", line 434, in loads
return pickle.loads(obj)
AttributeError: 'module' object has no attribute 'to_dense_vector'
Hi,
I am getting the following error when I run the example that works with atlas_higgs data. In the StringIndexer section I am getting the following error: