linkedpipes / etl

LinkedPipes ETL is an RDF based, lightweight ETL tool
https://etl.linkedpipes.com
Other
144 stars 30 forks source link

Input files for transformation components #799

Closed versant2612 closed 4 years ago

versant2612 commented 4 years ago

How can I specify the input file name and location to "Tabular to RDF" component? The input file is on the same machine where the linkedpipes is installed, in this casa do I need an Extractor?

jakubklimek commented 4 years ago

@versant2612 Please see the tutorial on how to convert tabular data to RDF first.

If your CSV file is on your local file system, then you need the files from local extractor. This one needs to be unbanned in the configuration file as it is disabled by default due to security concerns on web instances.

versant2612 commented 4 years ago

I changed the configuration files, commenting out these lines and restarted the executor, but I'm still getting the error message about the banned component

cloud-di@vm029:~/etl/deploy$ grep executor.banned_jar_iri_patterns conf configuration.docker.properties:# executor.banned_jar_iri_patterns = ".e-filesFromLocal.",".l-filesToLocal.",".x-deleteDirectory." configuration.properties:# executor.banned_jar_iri_patterns = ".e-filesFromLocal.",".l-filesToLocal.",".x-deleteDirectory." configuration.properties.sample:# executor.banned_jar_iri_patterns = ".e-filesFromLocal.",".l-filesToLocal.",".x-deleteDirectory.*"

Error messages

cloud-di@vm029:~/etl/deploy$ tail -30 executor.log at com.linkedpipes.etl.executor.pipeline.PipelineExecutor.execute(PipelineExecutor.java:93) at com.linkedpipes.etl.executor.web.servlet.ExecutionServlet.lambda$execute$0(ExecutionServlet.java:71) at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) at java.base/java.lang.Thread.run(Thread.java:834) 15:14:30 [asynchExecutor-1] ERROR c.l.e.e.p.PipelineExecutor - Banned component. com.linkedpipes.etl.executor.module.BannedComponent: Required component: http://etl.linkedpipes.com/resources/jars/l-filesToLocal/0.0.0 is banned by: .l-filesToLocal. at com.linkedpipes.etl.executor.module.ModuleService.checkIfBundleIsAllowed(ModuleService.java:199) at com.linkedpipes.etl.executor.module.ModuleService.getComponent(ModuleService.java:137) at com.linkedpipes.etl.executor.pipeline.PipelineExecutor.loadComponents(PipelineExecutor.java:280) at com.linkedpipes.etl.executor.pipeline.PipelineExecutor.initialize(PipelineExecutor.java:160) at com.linkedpipes.etl.executor.pipeline.PipelineExecutor.execute(PipelineExecutor.java:93) at com.linkedpipes.etl.executor.web.servlet.ExecutionServlet.lambda$execute$0(ExecutionServlet.java:71) at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) at java.base/java.lang.Thread.run(Thread.java:834)

jakubklimek commented 4 years ago

@versant2612 This is weird. I just tested it and it worked for me with the line commented out in configuration.properties. Can you try restarting all 4 parts, just to be sure?

versant2612 commented 4 years ago

The virtual machine was restarted by cloud administrator for others reasons. I started all 4 parts and it is working now. Thanks!

I have another error message with File to Local component. Should I open another issue?

jakubklimek commented 4 years ago

Yes please

versant2612 commented 4 years ago

OK @jakubklimek , I will. I still have a doubt about "Path to a local file or directory" parameter of "Files from local" component. Can I specify a directory path and a regular expression like "*.xml" to select a list of files?

jakubklimek commented 4 years ago

@versant2612 you can only specify a single file or a single directory (which can include subdirectories). This will be available in the pipeline as a whole. You can then use Files filter with a regular expression.

versant2612 commented 4 years ago

Thank you. I included Files filter in my pipelines.