neo4j / graph-data-science

Source code for the Neo4j Graph Data Science library of graph algorithms.
https://neo4j.com/docs/graph-data-science/current/
Other
621 stars 160 forks source link

gds.beta.graph.export.csv would be very useful if the csv files can be used for "neo4j-admin import incremental" #241

Open cpuwar opened 1 year ago

cpuwar commented 1 year ago

Is your feature request related to a problem? Please describe. I used this feature to generate csv files for "neo4j-admin database import full". It is very easy and successful. But I cannot use them for "neo4j-admin database import incremental"

Describe the solution you would like "neo4j-admin import" tool is very good to import a huge graph. Now it supports "incremental" mode and it's really big step up for import tool's usefulness. But "import incremental" seemed to require uuid of Node as its property. So I added them in Movie and Person nodes by apoc.uuid.install and export them in csv files when I test gds.beta.graph.export.csv with Movie graph(from :play movies).

And I got the csv files like these: < nodes_Person_0.csv > 1,"Keanu Reeves",,"bf59b74a-9ac2-4c1f-8de7-b22be51f731a" 2,"Carrie-Anne Moss",,"c5642d98-09f7-4469-9efa-2606ea92ef9e" 3,"Laurence Fishburne",,"365c1fe3-9039-4a38-be9b-ec6fce9111b3" .... < nodes_Movie_0.csv > 0,,"The Matrix","aeff74b8-8c0c-4d98-9395-9463af575602" 9,,"The Matrix Reloaded","8d69c33d-4d68-454c-85d7-ef222f8e6ebb" 10,,"The Matrix Revolutions","f2368b75-3ddd-45a3-95f7-de10a2f4a715" .... < relationships_ACTED_IN_0.csv > 1,0 1,9 1,10 ...

I don't know how to use them for "neo4j-admin database import incremental" if I want to import the data to an existing graph incrementally.

Describe alternatives you have considered If gds.beta.graph.export.csv can export uuid property in relationship.csv like this, I would use all the csv files to import them incrementally: < relationships_ACTED_IN_0.csv > "bf59b74a-9ac2-4c1f-8de7-b22be51f731a","aeff74b8-8c0c-4d98-9395-9463af575602" "bf59b74a-9ac2-4c1f-8de7-b22be51f731a","8d69c33d-4d68-454c-85d7-ef222f8e6ebb" "bf59b74a-9ac2-4c1f-8de7-b22be51f731a","f2368b75-3ddd-45a3-95f7-de10a2f4a715" ...

Could you add this option in the next release? Or do you know any current method to generate the csv files for "neo4j-admin database import incremental"? Actually I don't understand it clearly though I read the manual many times: https://neo4j.com/docs/operations-manual/current/tools/neo4j-admin/neo4j-admin-import/

Dongho.