usc-isi-i2 / dsbox-cleaning

The data cleaning TA1 component of DSBox
MIT License
6 stars 4 forks source link

Date Featerizer Timezone Setting #47

Closed proska closed 6 years ago

proska commented 6 years ago

Date featerizer prints following warning messages about setting the timezone on 26_radon_seed dataset.

cleaning/dsbox/datapreprocessing/cleaner/dependencies/date_extractor.py:408: UserWarning: DateExtractor: Failed to set timezone as America/Los_Angeles. Catch offset must be a timedelta representing a whole number of minutes, not datetime.timedelta(-1, 58022).
  warn('DateExtractor: Failed to set timezone as ' + str(self.default_tz) + '. Catch ' + str(e))
/nfs1/dsbox-repo/qasemi/miniconda/envs/d3m-devel/lib/python3.6/re.py:212: FutureWarning: split() requires a non-empty pattern match.
  return _compile(pattern, flags).split(string, maxsplit)
/nfs1/dsbox-repo/qasemi/miniconda/envs/d3m-devel/lib/python3.6/site-packages/pandas/core/indexing.py:621: SettingWithCopyWarning: 
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/indexing.html#indexing-view-versus-copy
  self.obj[item_labels[indexer[info_axis]]] = value
/nfs1/dsbox-repo/qasemi/miniconda/envs/d3m-devel/lib/python3.6/site-packages/pandas/core/indexing.py:537: SettingWithCopyWarning: 
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/indexing.html#indexing-view-versus-copy
  self.obj[item] = s

This is the complete console output of the ta2-search run.

(d3m-devel) [qasemi@dsbox02 python]$ python ta2-search ~/dsbox/runs2/config-seed-test/26_radon_seed_config.json
Namespace(configuration_file='/nas/home/qasemi/dsbox/runs2/config-seed-test/26_radon_seed_config.json', cpus=-1, debug=False, output_prefix=None, timeout=-1)
Using configuation:
{'cpus': '10',
 'dataset_schema': '/nfs1/dsbox-repo/data/datasets/seed_datasets_current/26_radon_seed/26_radon_seed_dataset/datasetDoc.json',
 'executables_root': '/nfs1/dsbox-repo/qasemi/dsbox-ta2/python/output/26_radon_seed/executables',
 'pipeline_logs_root': '/nfs1/dsbox-repo/qasemi/dsbox-ta2/python/output/26_radon_seed/logs',
 'problem_root': '/nfs1/dsbox-repo/data/datasets/seed_datasets_current/26_radon_seed/26_radon_seed_problem',
 'problem_schema': '/nfs1/dsbox-repo/data/datasets/seed_datasets_current/26_radon_seed/26_radon_seed_problem/problemDoc.json',
 'ram': '10Gi',
 'results_root': '/nas/home/qasemi/dsbox/runs2/output-seed/26_radon_seed/results',
 'saved_pipeline_ID': '',
 'saving_folder_loc': '/nfs1/dsbox-repo/qasemi/dsbox-ta2/python/output/26_radon_seed',
 'temp_storage_root': '/nfs1/dsbox-repo/qasemi/dsbox-ta2/python/output/26_radon_seed/temp',
 'test_data_root': '/nfs1/dsbox-repo/data/datasets/seed_datasets_current/26_radon_seed/26_radon_seed_dataset',
 'timeout': 9}
[INFO] No test data config found! Will split the data.
[INFO] - dsbox.controller.controller - Top level output directory: /nfs1/dsbox-repo/qasemi/dsbox-ta2/python/output/26_radon_seed
[INFO] Succesfully parsed test data
{'structural_type': <class 'd3m.container.pandas.DataFrame'>, 'semantic_types': ('https://metadata.datadrivendiscovery.org/types/Table', 'https://metadata.datadrivendiscovery.org/types/DatasetEntryPoint'), 'dimension': {'name': 'rows', 'semantic_types': ('https://metadata.datadrivendiscovery.org/types/TabularRow',), 'length': 736}}
{'dimension': <FrozenOrderedDict OrderedDict([('name', 'rows'), ('semantic_types', ('https://metadata.datadrivendiscovery.org/types/TabularRow',)), ('length', 736)])>,
 'semantic_types': ('https://metadata.datadrivendiscovery.org/types/Table',
                    'https://metadata.datadrivendiscovery.org/types/DatasetEntryPoint'),
 'structural_type': <class 'd3m.container.pandas.DataFrame'>}
{'structural_type': <class 'd3m.container.pandas.DataFrame'>, 'semantic_types': ('https://metadata.datadrivendiscovery.org/types/Table', 'https://metadata.datadrivendiscovery.org/types/DatasetEntryPoint'), 'dimension': {'name': 'rows', 'semantic_types': ('https://metadata.datadrivendiscovery.org/types/TabularRow',), 'length': 183}}
{'dimension': <FrozenOrderedDict OrderedDict([('name', 'rows'), ('semantic_types', ('https://metadata.datadrivendiscovery.org/types/TabularRow',)), ('length', 183)])>,
 'semantic_types': ('https://metadata.datadrivendiscovery.org/types/Table',
                    'https://metadata.datadrivendiscovery.org/types/DatasetEntryPoint'),
 'structural_type': <class 'd3m.container.pandas.DataFrame'>}
[INFO] Template choices:
Template ' Default_regression_template ' has been added to template base.
[INFO] Template 0:Default_regression_template Selected. UCT:[100.0]
[INFO] number of workers: 10
[INFO] Worker started, id: <_MainProcess(MainProcess, started)>
/nfs1/dsbox-repo/qasemi/miniconda/envs/d3m-devel/lib/python3.6/site-packages/h5py/__init__.py:36: FutureWarning: Conversion of the second argument of issubdtype from `float` to `np.floating` is deprecated. In future, it will be treated as `np.float64 == np.dtype(float).type`.
  from ._conv import register_converters as _register_converters
Using TensorFlow backend.
[INFO] Push@cache: ('d3m.primitives.dsbox.Denormalize', -7456448384928637957)
/nfs1/dsbox-repo/qasemi/miniconda/envs/d3m-devel/lib/python3.6/site-packages/h5py/__init__.py:36: FutureWarning: Conversion of the second argument of issubdtype from `float` to `np.floating` is deprecated. In future, it will be treated as `np.float64 == np.dtype(float).type`.
  from ._conv import register_converters as _register_converters
Using TensorFlow backend.
[INFO] Push@cache: ('d3m.primitives.datasets.DatasetToDataFrame', -7456448384928637957)
[INFO] Push@cache: ('d3m.primitives.data.ExtractColumnsBySemanticTypes', 2020361256340455972)
[INFO] Push@cache: ('d3m.primitives.data.ExtractColumnsBySemanticTypes', -6157425058097195679)
[INFO] Push@cache: ('d3m.primitives.dsbox.Profiler', -8604379024263977179)
/nfs1/dsbox-repo/qasemi/dsbox-cleaning/dsbox/datapreprocessing/cleaner/dependencies/date_extractor.py:408: UserWarning: DateExtractor: Failed to set timezone as America/Los_Angeles. Catch offset must be a timedelta representing a whole number of minutes, not datetime.timedelta(-1, 58022).
  warn('DateExtractor: Failed to set timezone as ' + str(self.default_tz) + '. Catch ' + str(e))
/nfs1/dsbox-repo/qasemi/miniconda/envs/d3m-devel/lib/python3.6/re.py:212: FutureWarning: split() requires a non-empty pattern match.
  return _compile(pattern, flags).split(string, maxsplit)
/nfs1/dsbox-repo/qasemi/miniconda/envs/d3m-devel/lib/python3.6/site-packages/pandas/core/indexing.py:621: SettingWithCopyWarning: 
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/indexing.html#indexing-view-versus-copy
  self.obj[item_labels[indexer[info_axis]]] = value
/nfs1/dsbox-repo/qasemi/miniconda/envs/d3m-devel/lib/python3.6/site-packages/pandas/core/indexing.py:537: SettingWithCopyWarning: 
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/indexing.html#indexing-view-versus-copy
  self.obj[item] = s
[INFO] Push@cache: ('d3m.primitives.dsbox.CleaningFeaturizer', -8604379024263977179)
[INFO] Push@cache: ('d3m.primitives.dsbox.CorexText', 7712785560202519328)
[INFO] Push@cache: ('d3m.primitives.dsbox.Encoder', -431603613755528502)
[INFO] Push@cache: ('d3m.primitives.sklearn_wrap.SKImputer', -2308759725795285255)
[INFO] Push@cache: ('d3m.primitives.sklearn_wrap.SKARDRegression', -6873354452746540900)
******************
[INFO] Writing results
{'fitted_pipeline': <dsbox.pipeline.fitted_pipeline.FittedPipeline object at 0x7f0e72baee80>, 'training_metrics': [{'metric': 'rootMeanSquaredError', 'value': 0.3742661106678188}], 'cross_validation_metrics': [{'metric': 'rootMeanSquaredError', 'value': 0.374763796302885, 'values': [0.4568771056820037, 0.6940208888120548, 0.5236146968518263, 0.29829978655952955, 0.3494624205543473, 0.25485837162840735, 0.3168304264574417, 0.28151174107824145, 0.3286939632411877, 0.24346856216380977], 'targets': []}], 'test_metrics': [{'metric': 'rootMeanSquaredError', 'value': 0.35931540324459654}]}
{'denormalize_step': {'primitive': 'd3m.primitives.dsbox.Denormalize', 'hyperparameters': {}}, 'to_dataframe_step': {'primitive': 'd3m.primitives.datasets.DatasetToDataFrame', 'hyperparameters': {}}, 'extract_attribute_step': {'primitive': 'd3m.primitives.data.ExtractColumnsBySemanticTypes', 'hyperparameters': {'semantic_types': ('https://metadata.datadrivendiscovery.org/types/Attribute',)}}, 'profiler_step': {'primitive': 'd3m.primitives.dsbox.Profiler', 'hyperparameters': {}}, 'clean_step': {'primitive': 'd3m.primitives.dsbox.CleaningFeaturizer', 'hyperparameters': {}}, 'corex_step': {'primitive': 'd3m.primitives.dsbox.CorexText', 'hyperparameters': {}}, 'encoder_step': {'primitive': 'd3m.primitives.dsbox.Encoder', 'hyperparameters': {}}, 'impute_step': {'primitive': 'd3m.primitives.sklearn_wrap.SKImputer', 'hyperparameters': {}}, 'extract_target_step': {'primitive': 'd3m.primitives.data.ExtractColumnsBySemanticTypes', 'hyperparameters': {'semantic_types': ('https://metadata.datadrivendiscovery.org/types/Target', 'https://metadata.datadrivendiscovery.org/types/SuggestedTarget')}}, 'model_step': {'primitive': 'd3m.primitives.sklearn_wrap.SKARDRegression', 'hyperparameters': {}}} 0.35931540324459654
Training rootMeanSquaredError = 0.3742661106678188
CV rootMeanSquaredError = 0.374763796302885
Test rootMeanSquaredError = 0.35931540324459654
******************
[INFO] Saving training results in /nfs1/dsbox-repo/qasemi/dsbox-ta2/python/output/26_radon_seed.txt
[INFO] report: 0.35931540324459654
[INFO] UCT updated: [19.068816944723313]
[INFO] cache size: 10
[INFO] Template 0:Default_regression_template Selected. UCT:[19.068816944723313]
[INFO] number of workers: 10
[INFO] Worker started, id: <_MainProcess(MainProcess, started)>
[INFO] Hit@cache: ('d3m.primitives.dsbox.Denormalize', -7456448384928637957)
[INFO] Hit@cache: ('d3m.primitives.datasets.DatasetToDataFrame', -7456448384928637957)
[INFO] Hit@cache: ('d3m.primitives.data.ExtractColumnsBySemanticTypes', 2020361256340455972)
[INFO] Hit@cache: ('d3m.primitives.data.ExtractColumnsBySemanticTypes', -6157425058097195679)
[INFO] Hit@cache: ('d3m.primitives.dsbox.Profiler', -8604379024263977179)
[INFO] Hit@cache: ('d3m.primitives.dsbox.CleaningFeaturizer', -8604379024263977179)
[INFO] Hit@cache: ('d3m.primitives.dsbox.CorexText', 7712785560202519328)
[INFO] Hit@cache: ('d3m.primitives.dsbox.Encoder', -431603613755528502)
[INFO] Hit@cache: ('d3m.primitives.sklearn_wrap.SKImputer', -2308759725795285255)
[INFO] Hit@cache: ('d3m.primitives.sklearn_wrap.SKARDRegression', -6873354452746540900)
******************
[INFO] Writing results
{'fitted_pipeline': <dsbox.pipeline.fitted_pipeline.FittedPipeline object at 0x7f0c545c8630>, 'training_metrics': [{'metric': 'rootMeanSquaredError', 'value': 0.3742661106678188}], 'cross_validation_metrics': [], 'test_metrics': [{'metric': 'rootMeanSquaredError', 'value': 0.35931540324459654}]}
{'denormalize_step': {'primitive': 'd3m.primitives.dsbox.Denormalize', 'hyperparameters': {}}, 'to_dataframe_step': {'primitive': 'd3m.primitives.datasets.DatasetToDataFrame', 'hyperparameters': {}}, 'extract_attribute_step': {'primitive': 'd3m.primitives.data.ExtractColumnsBySemanticTypes', 'hyperparameters': {'semantic_types': ('https://metadata.datadrivendiscovery.org/types/Attribute',)}}, 'profiler_step': {'primitive': 'd3m.primitives.dsbox.Profiler', 'hyperparameters': {}}, 'clean_step': {'primitive': 'd3m.primitives.dsbox.CleaningFeaturizer', 'hyperparameters': {}}, 'corex_step': {'primitive': 'd3m.primitives.dsbox.CorexText', 'hyperparameters': {}}, 'encoder_step': {'primitive': 'd3m.primitives.dsbox.Encoder', 'hyperparameters': {}}, 'impute_step': {'primitive': 'd3m.primitives.sklearn_wrap.SKImputer', 'hyperparameters': {}}, 'extract_target_step': {'primitive': 'd3m.primitives.data.ExtractColumnsBySemanticTypes', 'hyperparameters': {'semantic_types': ('https://metadata.datadrivendiscovery.org/types/Target', 'https://metadata.datadrivendiscovery.org/types/SuggestedTarget')}}, 'model_step': {'primitive': 'd3m.primitives.sklearn_wrap.SKARDRegression', 'hyperparameters': {}}} 0.35931540324459654
Training rootMeanSquaredError = 0.3742661106678188
Test rootMeanSquaredError = 0.35931540324459654
******************
[INFO] Saving training results in /nfs1/dsbox-repo/qasemi/dsbox-ta2/python/output/26_radon_seed.txt
[INFO] report: 0.35931540324459654
[INFO] UCT updated: [20.246956596783747]
[INFO] cache size: 10
[INFO] Template 0:Default_regression_template Selected. UCT:[20.246956596783747]
[INFO] number of workers: 10
[INFO] Worker started, id: <_MainProcess(MainProcess, started)>
[INFO] Hit@cache: ('d3m.primitives.dsbox.Denormalize', -7456448384928637957)
[INFO] Hit@cache: ('d3m.primitives.datasets.DatasetToDataFrame', -7456448384928637957)
[INFO] Hit@cache: ('d3m.primitives.data.ExtractColumnsBySemanticTypes', 2020361256340455972)
[INFO] Hit@cache: ('d3m.primitives.data.ExtractColumnsBySemanticTypes', -6157425058097195679)
[INFO] Hit@cache: ('d3m.primitives.dsbox.Profiler', -8604379024263977179)
[INFO] Hit@cache: ('d3m.primitives.dsbox.CleaningFeaturizer', -8604379024263977179)
[INFO] Hit@cache: ('d3m.primitives.dsbox.CorexText', 7712785560202519328)
[INFO] Hit@cache: ('d3m.primitives.dsbox.Encoder', -431603613755528502)
[INFO] Hit@cache: ('d3m.primitives.sklearn_wrap.SKImputer', -2308759725795285255)
[INFO] Hit@cache: ('d3m.primitives.sklearn_wrap.SKARDRegression', -6873354452746540900)
******************
[INFO] Writing results
{'fitted_pipeline': <dsbox.pipeline.fitted_pipeline.FittedPipeline object at 0x7f0c941e8d68>, 'training_metrics': [{'metric': 'rootMeanSquaredError', 'value': 0.3742661106678188}], 'cross_validation_metrics': [], 'test_metrics': [{'metric': 'rootMeanSquaredError', 'value': 0.35931540324459654}]}
{'denormalize_step': {'primitive': 'd3m.primitives.dsbox.Denormalize', 'hyperparameters': {}}, 'to_dataframe_step': {'primitive': 'd3m.primitives.datasets.DatasetToDataFrame', 'hyperparameters': {}}, 'extract_attribute_step': {'primitive': 'd3m.primitives.data.ExtractColumnsBySemanticTypes', 'hyperparameters': {'semantic_types': ('https://metadata.datadrivendiscovery.org/types/Attribute',)}}, 'profiler_step': {'primitive': 'd3m.primitives.dsbox.Profiler', 'hyperparameters': {}}, 'clean_step': {'primitive': 'd3m.primitives.dsbox.CleaningFeaturizer', 'hyperparameters': {}}, 'corex_step': {'primitive': 'd3m.primitives.dsbox.CorexText', 'hyperparameters': {}}, 'encoder_step': {'primitive': 'd3m.primitives.dsbox.Encoder', 'hyperparameters': {}}, 'impute_step': {'primitive': 'd3m.primitives.sklearn_wrap.SKImputer', 'hyperparameters': {}}, 'extract_target_step': {'primitive': 'd3m.primitives.data.ExtractColumnsBySemanticTypes', 'hyperparameters': {'semantic_types': ('https://metadata.datadrivendiscovery.org/types/Target', 'https://metadata.datadrivendiscovery.org/types/SuggestedTarget')}}, 'model_step': {'primitive': 'd3m.primitives.sklearn_wrap.SKARDRegression', 'hyperparameters': {}}} 0.35931540324459654
Training rootMeanSquaredError = 0.3742661106678188
Test rootMeanSquaredError = 0.35931540324459654
******************
[INFO] Saving training results in /nfs1/dsbox-repo/qasemi/dsbox-ta2/python/output/26_radon_seed.txt
[INFO] report: 0.35931540324459654
[INFO] UCT updated: [20.55257275559609]
[INFO] cache size: 10
[INFO] Template 0:Default_regression_template Selected. UCT:[20.55257275559609]
[INFO] number of workers: 10
[INFO] Worker started, id: <_MainProcess(MainProcess, started)>
[INFO] Hit@cache: ('d3m.primitives.dsbox.Denormalize', -7456448384928637957)
[INFO] Hit@cache: ('d3m.primitives.datasets.DatasetToDataFrame', -7456448384928637957)
[INFO] Hit@cache: ('d3m.primitives.data.ExtractColumnsBySemanticTypes', 2020361256340455972)
[INFO] Hit@cache: ('d3m.primitives.data.ExtractColumnsBySemanticTypes', -6157425058097195679)
[INFO] Hit@cache: ('d3m.primitives.dsbox.Profiler', -8604379024263977179)
[INFO] Hit@cache: ('d3m.primitives.dsbox.CleaningFeaturizer', -8604379024263977179)
[INFO] Hit@cache: ('d3m.primitives.dsbox.CorexText', 7712785560202519328)
[INFO] Hit@cache: ('d3m.primitives.dsbox.Encoder', -431603613755528502)
[INFO] Hit@cache: ('d3m.primitives.sklearn_wrap.SKImputer', -2308759725795285255)
[INFO] Hit@cache: ('d3m.primitives.sklearn_wrap.SKARDRegression', -6873354452746540900)
******************
[INFO] Writing results
{'fitted_pipeline': <dsbox.pipeline.fitted_pipeline.FittedPipeline object at 0x7f0bdffcb4a8>, 'training_metrics': [{'metric': 'rootMeanSquaredError', 'value': 0.3742661106678188}], 'cross_validation_metrics': [], 'test_metrics': [{'metric': 'rootMeanSquaredError', 'value': 0.35931540324459654}]}
{'denormalize_step': {'primitive': 'd3m.primitives.dsbox.Denormalize', 'hyperparameters': {}}, 'to_dataframe_step': {'primitive': 'd3m.primitives.datasets.DatasetToDataFrame', 'hyperparameters': {}}, 'extract_attribute_step': {'primitive': 'd3m.primitives.data.ExtractColumnsBySemanticTypes', 'hyperparameters': {'semantic_types': ('https://metadata.datadrivendiscovery.org/types/Attribute',)}}, 'profiler_step': {'primitive': 'd3m.primitives.dsbox.Profiler', 'hyperparameters': {}}, 'clean_step': {'primitive': 'd3m.primitives.dsbox.CleaningFeaturizer', 'hyperparameters': {}}, 'corex_step': {'primitive': 'd3m.primitives.dsbox.CorexText', 'hyperparameters': {}}, 'encoder_step': {'primitive': 'd3m.primitives.dsbox.Encoder', 'hyperparameters': {}}, 'impute_step': {'primitive': 'd3m.primitives.sklearn_wrap.SKImputer', 'hyperparameters': {}}, 'extract_target_step': {'primitive': 'd3m.primitives.data.ExtractColumnsBySemanticTypes', 'hyperparameters': {'semantic_types': ('https://metadata.datadrivendiscovery.org/types/Target', 'https://metadata.datadrivendiscovery.org/types/SuggestedTarget')}}, 'model_step': {'primitive': 'd3m.primitives.sklearn_wrap.SKARDRegression', 'hyperparameters': {}}} 0.35931540324459654
Training rootMeanSquaredError = 0.3742661106678188
Test rootMeanSquaredError = 0.35931540324459654
******************
[INFO] Saving training results in /nfs1/dsbox-repo/qasemi/dsbox-ta2/python/output/26_radon_seed.txt
[INFO] report: 0.35931540324459654
[INFO] UCT updated: [20.73608406718704]
[INFO] cache size: 10
[INFO] Template 0:Default_regression_template Selected. UCT:[20.73608406718704]
[INFO] number of workers: 10
[INFO] Worker started, id: <_MainProcess(MainProcess, started)>
[INFO] Hit@cache: ('d3m.primitives.dsbox.Denormalize', -7456448384928637957)
[INFO] Hit@cache: ('d3m.primitives.datasets.DatasetToDataFrame', -7456448384928637957)
[INFO] Hit@cache: ('d3m.primitives.data.ExtractColumnsBySemanticTypes', 2020361256340455972)
[INFO] Hit@cache: ('d3m.primitives.data.ExtractColumnsBySemanticTypes', -6157425058097195679)
[INFO] Hit@cache: ('d3m.primitives.dsbox.Profiler', -8604379024263977179)
[INFO] Hit@cache: ('d3m.primitives.dsbox.CleaningFeaturizer', -8604379024263977179)
[INFO] Hit@cache: ('d3m.primitives.dsbox.CorexText', 7712785560202519328)
[INFO] Hit@cache: ('d3m.primitives.dsbox.Encoder', -431603613755528502)
[INFO] Hit@cache: ('d3m.primitives.sklearn_wrap.SKImputer', -2308759725795285255)
[INFO] Hit@cache: ('d3m.primitives.sklearn_wrap.SKARDRegression', -6873354452746540900)
******************
[INFO] Writing results
{'fitted_pipeline': <dsbox.pipeline.fitted_pipeline.FittedPipeline object at 0x7f0bdfef4588>, 'training_metrics': [{'metric': 'rootMeanSquaredError', 'value': 0.3742661106678188}], 'cross_validation_metrics': [], 'test_metrics': [{'metric': 'rootMeanSquaredError', 'value': 0.35931540324459654}]}
{'denormalize_step': {'primitive': 'd3m.primitives.dsbox.Denormalize', 'hyperparameters': {}}, 'to_dataframe_step': {'primitive': 'd3m.primitives.datasets.DatasetToDataFrame', 'hyperparameters': {}}, 'extract_attribute_step': {'primitive': 'd3m.primitives.data.ExtractColumnsBySemanticTypes', 'hyperparameters': {'semantic_types': ('https://metadata.datadrivendiscovery.org/types/Attribute',)}}, 'profiler_step': {'primitive': 'd3m.primitives.dsbox.Profiler', 'hyperparameters': {}}, 'clean_step': {'primitive': 'd3m.primitives.dsbox.CleaningFeaturizer', 'hyperparameters': {}}, 'corex_step': {'primitive': 'd3m.primitives.dsbox.CorexText', 'hyperparameters': {}}, 'encoder_step': {'primitive': 'd3m.primitives.dsbox.Encoder', 'hyperparameters': {}}, 'impute_step': {'primitive': 'd3m.primitives.sklearn_wrap.SKImputer', 'hyperparameters': {}}, 'extract_target_step': {'primitive': 'd3m.primitives.data.ExtractColumnsBySemanticTypes', 'hyperparameters': {'semantic_types': ('https://metadata.datadrivendiscovery.org/types/Target', 'https://metadata.datadrivendiscovery.org/types/SuggestedTarget')}}, 'model_step': {'primitive': 'd3m.primitives.sklearn_wrap.SKARDRegression', 'hyperparameters': {}}} 0.35931540324459654
Training rootMeanSquaredError = 0.3742661106678188
Test rootMeanSquaredError = 0.35931540324459654
******************
[INFO] Saving training results in /nfs1/dsbox-repo/qasemi/dsbox-ta2/python/output/26_radon_seed.txt
[INFO] report: 0.35931540324459654
[INFO] UCT updated: [20.86579809043956]
[INFO] cache size: 10
[INFO] Template 0:Default_regression_template Selected. UCT:[20.86579809043956]
[INFO] number of workers: 10
[INFO] Worker started, id: <_MainProcess(MainProcess, started)>
[INFO] Hit@cache: ('d3m.primitives.dsbox.Denormalize', -7456448384928637957)
[INFO] Hit@cache: ('d3m.primitives.datasets.DatasetToDataFrame', -7456448384928637957)
[INFO] Hit@cache: ('d3m.primitives.data.ExtractColumnsBySemanticTypes', 2020361256340455972)
[INFO] Hit@cache: ('d3m.primitives.data.ExtractColumnsBySemanticTypes', -6157425058097195679)
[INFO] Hit@cache: ('d3m.primitives.dsbox.Profiler', -8604379024263977179)
[INFO] Hit@cache: ('d3m.primitives.dsbox.CleaningFeaturizer', -8604379024263977179)
[INFO] Hit@cache: ('d3m.primitives.dsbox.CorexText', 7712785560202519328)
[INFO] Hit@cache: ('d3m.primitives.dsbox.Encoder', -431603613755528502)
[INFO] Hit@cache: ('d3m.primitives.sklearn_wrap.SKImputer', -2308759725795285255)
[INFO] Hit@cache: ('d3m.primitives.sklearn_wrap.SKARDRegression', -6873354452746540900)
******************
[INFO] Writing results
{'fitted_pipeline': <dsbox.pipeline.fitted_pipeline.FittedPipeline object at 0x7f0c94679f98>, 'training_metrics': [{'metric': 'rootMeanSquaredError', 'value': 0.3742661106678188}], 'cross_validation_metrics': [], 'test_metrics': [{'metric': 'rootMeanSquaredError', 'value': 0.35931540324459654}]}
{'denormalize_step': {'primitive': 'd3m.primitives.dsbox.Denormalize', 'hyperparameters': {}}, 'to_dataframe_step': {'primitive': 'd3m.primitives.datasets.DatasetToDataFrame', 'hyperparameters': {}}, 'extract_attribute_step': {'primitive': 'd3m.primitives.data.ExtractColumnsBySemanticTypes', 'hyperparameters': {'semantic_types': ('https://metadata.datadrivendiscovery.org/types/Attribute',)}}, 'profiler_step': {'primitive': 'd3m.primitives.dsbox.Profiler', 'hyperparameters': {}}, 'clean_step': {'primitive': 'd3m.primitives.dsbox.CleaningFeaturizer', 'hyperparameters': {}}, 'corex_step': {'primitive': 'd3m.primitives.dsbox.CorexText', 'hyperparameters': {}}, 'encoder_step': {'primitive': 'd3m.primitives.dsbox.Encoder', 'hyperparameters': {}}, 'impute_step': {'primitive': 'd3m.primitives.sklearn_wrap.SKImputer', 'hyperparameters': {}}, 'extract_target_step': {'primitive': 'd3m.primitives.data.ExtractColumnsBySemanticTypes', 'hyperparameters': {'semantic_types': ('https://metadata.datadrivendiscovery.org/types/Target', 'https://metadata.datadrivendiscovery.org/types/SuggestedTarget')}}, 'model_step': {'primitive': 'd3m.primitives.sklearn_wrap.SKARDRegression', 'hyperparameters': {}}} 0.35931540324459654
Training rootMeanSquaredError = 0.3742661106678188
Test rootMeanSquaredError = 0.35931540324459654
******************
[INFO] Saving training results in /nfs1/dsbox-repo/qasemi/dsbox-ta2/python/output/26_radon_seed.txt
[INFO] report: 0.35931540324459654
[INFO] UCT updated: [20.965409761365837]
[INFO] cache size: 10
*+*+*+*+*+*+*+*+*+*+
[INFO] Start test function
[INFO] No specified pipeline ID found, will load the latest crated pipeline.
The following pipeline file will be loaded:
/nfs1/dsbox-repo/qasemi/dsbox-ta2/python/output/26_radon_seed/pipelines/1a071a29-4132-4f01-a766-dd93f0385f9f.json
[INFO] Pipeline load finished
[INFO] testing data:
[INFO] Finished: prediction results saving finished
[INFO] The prediction results is stored at:  /nfs1/dsbox-repo/qasemi/dsbox-ta2/python/output/26_radon_seed/predictions/1a071a29-4132-4f01-a766-dd93f0385f9f
[INFO] Testing Done
[INFO] The time used for running program is 4008.54 seconds.
proska commented 6 years ago

Fixed in commit d95d73494628e842ebbd2e368faa5f0cddf648e9