datacleaner / DataCleaner

The premier open source Data Quality solution
GNU Lesser General Public License v3.0
595 stars 180 forks source link

Multiple tables fail: Could not resolve child-datacontext for table #1852

Open LoLei opened 4 years ago

LoLei commented 4 years ago

Using a composite datastore to handle two (csv) tables. Previewing a table fails with:

Using DATACLEANER_HOME: /home/me/programs/datacleaner/5.7/DataCleaner
Using DATACLEANER_LIB_HOME: /home/me/programs/datacleaner/5.7/DataCleaner
Using DATACLEANER_JAVA_OPTS:  -Xmx1024m
INFO  09:19:04 DataCleanerHome - Initializing DATACLEANER_HOME
INFO  09:19:04 DataCleanerHome - Resolved env. variable DATACLEANER_HOME: /home/me/programs/datacleaner/5.7/DataCleaner
Using default log configuration: jar:file:/home/me/programs/datacleaner/5.7/DataCleaner/DataCleaner.jar!/org/datacleaner/log4j-default.xml
Failed to load DataCleaner version from manifest: inStream parameter is null
Failed to load DataCleaner version from manifest: inStream parameter is null
WARNING: An illegal reflective access operation has occurred
WARNING: Illegal reflective access by com.google.inject.internal.cglib.core.$ReflectUtils$2 (file:/home/me/programs/datacleaner/5.7/DataCleaner/lib/guice-3.0.jar) to method java.lang.ClassLoader.defineClass(java.lang.String,byte[],int,int,java.security.ProtectionDomain)
WARNING: Please consider reporting this to the maintainers of com.google.inject.internal.cglib.core.$ReflectUtils$2
WARNING: Use --illegal-access=warn to enable warnings of further illegal reflective access operations
WARNING: All illegal access operations will be denied in a future release
WARN  09:20:00 CompositeSchema - The following table names clashes in composite schema: [Parkgaragen.csv, ParkRide.csv]
WARN  09:20:06 CompositeDataContext - Couldn't resolve DataContext for Table[name=Parkgaragen.csv,type=TABLE,remarks=null]
WARN  09:20:06 CompositeDataContext - Couldn't resolve DataContext for Table[name=ParkRide.csv,type=TABLE,remarks=null]
WARN  09:20:06 CompositeAnalysisListener - Listener org.datacleaner.windows.ResultWindow$2 failed. MetaModelException: Could not resolve child-datacontext for table: Table[name=ParkRide.csv,type=TABLE,remarks=null]
WARN  09:20:06 CompositeAnalysisListener - Listener org.datacleaner.windows.ResultWindow$2 failed. MetaModelException: Could not resolve child-datacontext for table: Table[name=Parkgaragen.csv,type=TABLE,remarks=null]
org.apache.metamodel.MetaModelException: Could not resolve child-datacontext for table: Table[name=Parkgaragen.csv,type=TABLE,remarks=null]
    at org.apache.metamodel.CompositeDataContext.executeQuery(CompositeDataContext.java:80)
    at org.datacleaner.job.runner.RowProcessingMetricsImpl$1.fetch(RowProcessingMetricsImpl.java:116)
    at org.datacleaner.job.runner.RowProcessingMetricsImpl$1.fetch(RowProcessingMetricsImpl.java:99)
    at org.apache.metamodel.util.LazyRef.get(LazyRef.java:47)
    at org.datacleaner.job.runner.RowProcessingMetricsImpl.getExpectedRows(RowProcessingMetricsImpl.java:70)
    at org.datacleaner.windows.ResultWindow$2.rowProcessingBegin(ResultWindow.java:497)
    at org.datacleaner.job.runner.CompositeAnalysisListener.rowProcessingBegin(CompositeAnalysisListener.java:141)
    at org.datacleaner.job.runner.SourceTableRowProcessingPublisher.processRowsInternal(SourceTableRowProcessingPublisher.java:192)
    at org.datacleaner.job.runner.AbstractRowProcessingPublisher.processRows(AbstractRowProcessingPublisher.java:208)
    at org.datacleaner.job.tasks.RunRowProcessingPublisherTask.execute(RunRowProcessingPublisherTask.java:43)
    at org.datacleaner.job.concurrent.TaskRunnable.run(TaskRunnable.java:61)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
    at java.base/java.lang.Thread.run(Thread.java:830)
WARN  09:20:06 CompositeDataContext - Couldn't resolve DataContext for Table[name=ParkRide.csv,type=TABLE,remarks=null]
WARN  09:20:06 ErrorAwareAnalysisListener - errorUnknown(ImmutableAnalysisJob[sourceColumns=21,filterJobs=0,transformerJobs=1,analyzerJobs=0],{})
org.apache.metamodel.MetaModelException: Could not resolve child-datacontext for table: Table[name=ParkRide.csv,type=TABLE,remarks=null]
    at org.apache.metamodel.CompositeDataContext.executeQuery(CompositeDataContext.java:80)
    at org.datacleaner.job.runner.SourceTableRowProcessingPublisher.processRowsInternal(SourceTableRowProcessingPublisher.java:220)
    at org.datacleaner.job.runner.AbstractRowProcessingPublisher.processRows(AbstractRowProcessingPublisher.java:208)
    at org.datacleaner.job.tasks.RunRowProcessingPublisherTask.execute(RunRowProcessingPublisherTask.java:43)
    at org.datacleaner.job.concurrent.TaskRunnable.run(TaskRunnable.java:61)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
    at java.base/java.lang.Thread.run(Thread.java:830)
WARN  09:20:06 CompositeDataContext - Couldn't resolve DataContext for Table[name=Parkgaragen.csv,type=TABLE,remarks=null]
WARN  09:20:06 ErrorAwareAnalysisListener - Exception stack trace:
org.apache.metamodel.MetaModelException: Could not resolve child-datacontext for table: Table[name=ParkRide.csv,type=TABLE,remarks=null]
    at org.apache.metamodel.CompositeDataContext.executeQuery(CompositeDataContext.java:80)
    at org.datacleaner.job.runner.SourceTableRowProcessingPublisher.processRowsInternal(SourceTableRowProcessingPublisher.java:220)
    at org.datacleaner.job.runner.AbstractRowProcessingPublisher.processRows(AbstractRowProcessingPublisher.java:208)
    at org.datacleaner.job.tasks.RunRowProcessingPublisherTask.execute(RunRowProcessingPublisherTask.java:43)
    at org.datacleaner.job.concurrent.TaskRunnable.run(TaskRunnable.java:61)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
    at java.base/java.lang.Thread.run(Thread.java:830)
WARN  09:20:06 ErrorAwareAnalysisListener - errorUnknown(ImmutableAnalysisJob[sourceColumns=21,filterJobs=0,transformerJobs=1,analyzerJobs=0],{})
org.apache.metamodel.MetaModelException: Could not resolve child-datacontext for table: Table[name=Parkgaragen.csv,type=TABLE,remarks=null]
    at org.apache.metamodel.CompositeDataContext.executeQuery(CompositeDataContext.java:80)
    at org.datacleaner.job.runner.SourceTableRowProcessingPublisher.processRowsInternal(SourceTableRowProcessingPublisher.java:220)
    at org.datacleaner.job.runner.AbstractRowProcessingPublisher.processRows(AbstractRowProcessingPublisher.java:208)
    at org.datacleaner.job.tasks.RunRowProcessingPublisherTask.execute(RunRowProcessingPublisherTask.java:43)
    at org.datacleaner.job.concurrent.TaskRunnable.run(TaskRunnable.java:61)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
    at java.base/java.lang.Thread.run(Thread.java:830)
WARN  09:20:06 ErrorAwareAnalysisListener - Exception message:
WARN  09:20:06 ErrorAwareAnalysisListener - errorUnknown(ImmutableAnalysisJob[sourceColumns=10,filterJobs=0,transformerJobs=0,analyzerJobs=1],{})
org.datacleaner.job.concurrent.PreviousErrorsExistException: A previous exception has occurred
    at org.datacleaner.job.runner.ErrorAwareTaskRunnerWrapper.<init>(ErrorAwareTaskRunnerWrapper.java:44)
    at org.datacleaner.job.runner.AnalysisRunnerImpl.run(AnalysisRunnerImpl.java:85)
    at org.datacleaner.util.AnalysisRunnerSwingWorker.doInBackground(AnalysisRunnerSwingWorker.java:58)
    at org.datacleaner.util.AnalysisRunnerSwingWorker.doInBackground(AnalysisRunnerSwingWorker.java:40)
    at java.desktop/javax.swing.SwingWorker$1.call(SwingWorker.java:304)
    at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
    at java.desktop/javax.swing.SwingWorker.run(SwingWorker.java:343)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
    at java.base/java.lang.Thread.run(Thread.java:830)
WARN  09:20:06 ErrorAwareAnalysisListener - errorUnknown(ImmutableAnalysisJob[sourceColumns=21,filterJobs=0,transformerJobs=1,analyzerJobs=0],{})
org.datacleaner.job.concurrent.PreviousErrorsExistException: A previous exception has occurred
    at org.datacleaner.job.runner.ErrorAwareTaskRunnerWrapper.<init>(ErrorAwareTaskRunnerWrapper.java:44)
    at org.datacleaner.job.runner.AnalysisRunnerImpl.run(AnalysisRunnerImpl.java:85)
    at org.datacleaner.util.AnalysisRunnerSwingWorker.doInBackground(AnalysisRunnerSwingWorker.java:58)
    at org.datacleaner.util.AnalysisRunnerSwingWorker.doInBackground(AnalysisRunnerSwingWorker.java:40)
    at java.desktop/javax.swing.SwingWorker$1.call(SwingWorker.java:304)
    at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
    at java.desktop/javax.swing.SwingWorker.run(SwingWorker.java:343)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
    at java.base/java.lang.Thread.run(Thread.java:830)
WARN  09:20:43 CompositeDataContext - Couldn't resolve DataContext for Table[name=Parkgaragen.csv,type=TABLE,remarks=null]
ERROR 09:20:43 DataSetWindow - Unexpected error occurred while building DataSetWindow contents
org.apache.metamodel.MetaModelException: Could not resolve child-datacontext for table: Table[name=Parkgaragen.csv,type=TABLE,remarks=null]
    at org.apache.metamodel.CompositeDataContext.executeQuery(CompositeDataContext.java:80)
    at org.datacleaner.windows.DataSetWindow.lambda$new$0(DataSetWindow.java:85)
    at org.datacleaner.windows.DataSetWindow$1.doInBackground(DataSetWindow.java:153)
    at org.datacleaner.windows.DataSetWindow$1.doInBackground(DataSetWindow.java:151)
    at java.desktop/javax.swing.SwingWorker$1.call(SwingWorker.java:304)
    at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
    at java.desktop/javax.swing.SwingWorker.run(SwingWorker.java:343)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
    at java.base/java.lang.Thread.run(Thread.java:830)
WARN  09:20:50 CompositeDataContext - Couldn't resolve DataContext for Table[name=ParkRide.csv,type=TABLE,remarks=null]
ERROR 09:20:50 DataSetWindow - Unexpected error occurred while building DataSetWindow contents
org.apache.metamodel.MetaModelException: Could not resolve child-datacontext for table: Table[name=ParkRide.csv,type=TABLE,remarks=null]
    at org.apache.metamodel.CompositeDataContext.executeQuery(CompositeDataContext.java:80)
    at org.datacleaner.windows.DataSetWindow.lambda$new$0(DataSetWindow.java:85)
    at org.datacleaner.windows.DataSetWindow$1.doInBackground(DataSetWindow.java:153)
    at org.datacleaner.windows.DataSetWindow$1.doInBackground(DataSetWindow.java:151)
    at java.desktop/javax.swing.SwingWorker$1.call(SwingWorker.java:304)
    at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
    at java.desktop/javax.swing.SwingWorker.run(SwingWorker.java:343)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
    at java.base/java.lang.Thread.run(Thread.java:830)
WARN  09:26:04 UsageAwareDatastoreConnection - Method finalize() invoked but not all usages closed (1 remaining) (for UsageAwareDatastoreConnection[datastore=output,usage=1]). Closing.
WARN  09:27:09 CompositeSchema - The following table names clashes in composite schema: [Parkgaragen.csv, ParkRide.csv]
WARN  09:27:14 CompositeDataContext - Couldn't resolve DataContext for Table[name=ParkRide.csv,type=TABLE,remarks=null]
ERROR 09:27:14 DataSetWindow - Unexpected error occurred while building DataSetWindow contents
org.apache.metamodel.MetaModelException: Could not resolve child-datacontext for table: Table[name=ParkRide.csv,type=TABLE,remarks=null]
    at org.apache.metamodel.CompositeDataContext.executeQuery(CompositeDataContext.java:80)
    at org.datacleaner.windows.DataSetWindow.lambda$new$0(DataSetWindow.java:85)
    at org.datacleaner.windows.DataSetWindow$1.doInBackground(DataSetWindow.java:153)
    at org.datacleaner.windows.DataSetWindow$1.doInBackground(DataSetWindow.java:151)
    at java.desktop/javax.swing.SwingWorker$1.call(SwingWorker.java:304)
    at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
    at java.desktop/javax.swing.SwingWorker.run(SwingWorker.java:343)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
    at java.base/java.lang.Thread.run(Thread.java:830)

Similar to https://github.com/datacleaner/DataCleaner/issues/706 but different error. Also all the outgoing connections from the tables fail too. The tables themselves are not corrupt since I can preview them properly when they're not in a composite datastore.

Using version 5.7 on Linux.