neo4j-examples / kettle-plugin-examples

Simple examples for the Neo4j Kettle (Pentaho Data Integration) steps.
Apache License 2.0
14 stars 7 forks source link

neo4j importer not creating the relationships (clickstream example) #2

Open barbosantos opened 5 years ago

barbosantos commented 5 years ago

Hello Matt,

I just executed the clickstream example on kettle, and the resulting graph didn't had the relationships, only the nodes.

issue_neo4j_importer

The files that were generated and stored on the local import folder are the following: issue_neo4j_importer

No error produced during the execution of the job/transformations. issue_neo4j_importer

I hope i'm not missing anything.

mattcasters commented 5 years ago

I probably need to check in the latest changes to that example. Hang in there.

mattcasters commented 5 years ago

Apparently the square brackets are causing issues for recent versions of neo4j-import (sigh). I'll fix the file naming.

mattcasters commented 5 years ago

https://github.com/knowbi/knowbi-pentaho-pdi-neo4j-output/issues/112

mattcasters commented 5 years ago

The sample should be fixed, give it a try.

2019/03/12 15:43:49 - Import clickstream files.0 - IMPORT DONE in 2m 1s 845ms.
2019/03/12 15:43:49 - Import clickstream files.0 - Imported:
2019/03/12 15:43:49 - Import clickstream files.0 -   4709063 nodes
2019/03/12 15:43:49 - Import clickstream files.0 -   86721957 relationships
2019/03/12 15:43:49 - Import clickstream files.0 -   4825683 properties
2019/03/12 15:43:49 - Import clickstream files.0 - Peak memory usage: 1.08 GB
2019/03/12 15:43:50 - Import clickstream files.0 - Finished processing (I=0, O=0, R=11, W=11, U=0, E=0)
2019/03/12 15:43:50 - import-clickstream-CSVs - Transformation duration : 125.135 seconds [  2'  5.135" ]
2019/03/12 15:43:50 - load-clickstream-data - Finished job entry [Run import] (result=[true])
2019/03/12 15:43:50 - load-clickstream-data - Finished job entry [Create clickstream rels CSVs] (result=[true])
2019/03/12 15:43:50 - load-clickstream-data - Finished job entry [Create clickstream nodes CSVs] (result=[true])
2019/03/12 15:43:50 - load-clickstream-data - Finished job entry [Initialize/Download] (result=[true])
2019/03/12 15:43:50 - load-clickstream-data - Job execution finished
2019/03/12 15:43:50 - Spoon - Job has ended.
2019/03/12 15:43:50 - load-clickstream-data - Job duration : 499.036 seconds [  8' 19.036" ]
mattcasters commented 5 years ago

image