aditya-grover / node2vec

http://snap.stanford.edu/node2vec/
MIT License
2.61k stars 912 forks source link

Spark implementation Exception: Output directory xxxxxx already exists. #22

Open passiondy opened 6 years ago

passiondy commented 6 years ago

Hi!

I was recently using the spark version to analyze a large scale network with 21M nodes, 310M edges and the file size is 13G. I ran the code on a cluster of 5 machines. It runs very well until saving the result to HDFS. I got the excpetion "diagnostics: User class threw exception: org.apache.hadoop.mapred.FileAlreadyExistsException: Output directory xxxxxx already exists."

Has anyone encountered this problem before?

Thank you!

Best, Ying

lsong77 commented 4 years ago

Has anyone solved it? I also encountered this problem