Open ymtoo opened 1 year ago
This issue happens when we have cugraph
imported via PythonCall. I am looking into the issue further. This will result in the same error:
using CSV
using DataFrames
using PythonCall
const cugraph = PythonCall.pynew()
PythonCall.pycopy!(cugraph, pyimport("cugraph"))
csvpath = "./metadata.csv"
@info "Generate dummy data"
X = randn(100000, 5)
df = DataFrame(X, :auto)
CSV.write(csvpath, df)
@info "Read the data"
df = CSV.read(csvpath, DataFrame, ntasks=2)
@info "Done"
I have removed the automatic cugraph
pyimport in the main branch. Let me know if this fixes your issue. I will continue to look into what is happening to cause this. If this works for you too, I will probably publish this release to avoid potential issues with cugraph
in my processes.
This fixes the issue. The main branch works fine with multi-threaded CSV reading. Thanks!
I pushed the update to the registry. Over the next few weeks, I will look at directly interfacing with the C++/CUDA functions instead of wrapping Python, so hopefully, weird bugs like this will go away when the extra layer is removed.
MWE (
test_rapids.jl
):Running the script:
It works fine if it's on a single thread:
Julia and the package version: