crs4 / pydoop

A Python MapReduce and HDFS API for Hadoop
Apache License 2.0
236 stars 59 forks source link

hdfs.cp hangs #353

Closed pravinkarnam closed 5 years ago

pravinkarnam commented 5 years ago

Hi, I have a simple code which copies from one path to another path:

import os
if 'HADOOP_HOME' not in os.environ:
        os.environ['HADOOP_HOME'] = '/opt/cloudera/parcels/CDH/lib/hadoop'

from pydoop import hdfs

src_path= '/from/path'
dest_path ='/to/path'
hdfs.cp(src_path, dest_path)

This code hangs though i see that it actually copies the files/directories in to destination (I checked using hdfs dfs -ls /to/path/.

Not sure if it has to be written in different way to handle some file closures? If yes, please suggest?