CivicSpleen / ambry

A comprehensive data package manager
BSD 2-Clause "Simplified" License
4 stars 5 forks source link

calqualitycare.org-performance.yaml import error: KeyError: u'unknown' #125

Closed nmb10 closed 8 years ago

nmb10 commented 8 years ago

run:

python load_pre10.py ../../pre-10-bundles/converted/calqualitycare.org-performance/calqualitycare.org-performance.yaml

error:

Starting import ../../pre-10-bundles/converted/calqualitycare.org-performance/calqualitycare.org-performance.yaml...
Loading bundle: calqualitycare.org-performance-0.0.1~d043001
INFO calqualitycare.org-performance ---- Synchronized ----
Starting ingest...
INFO calqualitycare.org-performance Ingesting: edcare from http://ds.civicknowledge.org.s3.amazonaws.com/calqualitycare.org/Public%20Reports%202014Q3.xlsx
INFO calqualitycare.org-performance Ingested: edcare.mpr
INFO calqualitycare.org-performance Ingesting: experience from http://ds.civicknowledge.org.s3.amazonaws.com/calqualitycare.org/Public%20Reports%202014Q3.xlsx
INFO calqualitycare.org-performance Ingested: experience.mpr
INFO calqualitycare.org-performance Ingesting: heartcond from http://ds.civicknowledge.org.s3.amazonaws.com/calqualitycare.org/Public%20Reports%202014Q3.xlsx
INFO calqualitycare.org-performance Ingested: heartcond.mpr
INFO calqualitycare.org-performance Ingesting: hospitals from http://ds.civicknowledge.org.s3.amazonaws.com/calqualitycare.org/Public%20Reports%202014Q3.xlsx
INFO calqualitycare.org-performance Ingested: hospitals.mpr
INFO calqualitycare.org-performance Ingesting: lung from http://ds.civicknowledge.org.s3.amazonaws.com/calqualitycare.org/Public%20Reports%202014Q3.xlsx
INFO calqualitycare.org-performance Ingested: lung.mpr
INFO calqualitycare.org-performance Ingesting: measures from http://ds.civicknowledge.org.s3.amazonaws.com/calqualitycare.org/Public%20Reports%202014Q3.xlsx
INFO calqualitycare.org-performance Ingested: measures.mpr
INFO calqualitycare.org-performance Ingesting: motherbaby from http://ds.civicknowledge.org.s3.amazonaws.com/calqualitycare.org/Public%20Reports%202014Q3.xlsx
INFO calqualitycare.org-performance Ingested: motherbaby.mpr
INFO calqualitycare.org-performance Ingesting: other from http://ds.civicknowledge.org.s3.amazonaws.com/calqualitycare.org/Public%20Reports%202014Q3.xlsx
INFO calqualitycare.org-performance Ingested: other.mpr
INFO calqualitycare.org-performance Ingesting: perf_sum from http://ds.civicknowledge.org.s3.amazonaws.com/calqualitycare.org/Public%20Reports%202014Q3.xlsx
INFO calqualitycare.org-performance Ingested: perf_sum.mpr
INFO calqualitycare.org-performance Ingesting: readmissions from http://ds.civicknowledge.org.s3.amazonaws.com/calqualitycare.org/Public%20Reports%202014Q3.xlsx
INFO calqualitycare.org-performance Ingested: readmissions.mpr
INFO calqualitycare.org-performance Ingesting: safety from http://ds.civicknowledge.org.s3.amazonaws.com/calqualitycare.org/Public%20Reports%202014Q3.xlsx
INFO calqualitycare.org-performance Ingested: safety.mpr
Starting schema...
INFO calqualitycare.org-performance Populating table: edcare
INFO calqualitycare.org-performance Populating table: heartcond
INFO calqualitycare.org-performance Populating table: lung
INFO calqualitycare.org-performance Populating table: hospitals
INFO calqualitycare.org-performance Populating table: motherbaby
INFO calqualitycare.org-performance Populating table: measures
INFO calqualitycare.org-performance Populating table: experience
INFO calqualitycare.org-performance Populating table: safety
INFO calqualitycare.org-performance Populating table: other
INFO calqualitycare.org-performance Populating table: perf_sum
INFO calqualitycare.org-performance Populating table: readmissions
Starting build...
INFO calqualitycare.org-performance ---- Phase: build ---
INFO calqualitycare.org-performance Processing 11 sources, stage main ; [u'edcare', u'experience', u'heartcond', u'hospitals', u'lung', u'measures', u'motherbaby', u'other', u'perf_sum', u'readmissions']
INFO calqualitycare.org-performance Running phase build for source edcare with pipeline build
INFO calqualitycare.org-performance Running phase build for source experience with pipeline build
INFO calqualitycare.org-performance Running phase build for source heartcond with pipeline build
Traceback (most recent call last):
  File "load_pre10.py", line 262, in <module>
    main()
  File "load_pre10.py", line 221, in main
    _build(b)
  File "load_pre10.py", line 198, in _build
    b.build(force=force)
  File "/home/nmb10/projects/ambry_project/ambry/bundle/bundle.py", line 1378, in build
    return self.run_phase('build', sources=sources, stage=stage, force=force)
  File "/home/nmb10/projects/ambry_project/ambry/bundle/bundle.py", line 1292, in run_phase
    self.phase_main(phase, stage=stage, sources=sources)
  File "/home/nmb10/projects/ambry_project/ambry/bundle/bundle.py", line 1242, in phase_main
    pl.run(count=rows_count)
  File "/home/nmb10/projects/ambry_project/ambry/etl/pipeline.py", line 1782, in run
    self.sink.run()
  File "/home/nmb10/projects/ambry_project/ambry/etl/pipeline.py", line 285, in run
    for i, row in enumerate(self._source_pipe):
  File "/home/nmb10/projects/ambry_project/ambry/etl/pipeline.py", line 139, in __iter__
    self.headers = self.process_header(next(rg))
  File "/home/nmb10/projects/ambry_project/ambry/etl/pipeline.py", line 139, in __iter__
    self.headers = self.process_header(next(rg))
  File "/home/nmb10/projects/ambry_project/ambry/etl/pipeline.py", line 139, in __iter__
    self.headers = self.process_header(next(rg))
  File "/home/nmb10/projects/ambry_project/ambry/etl/caster.py", line 237, in process_header
    type_f = c.valuetype_class
  File "/home/nmb10/projects/ambry_project/ambry/orm/column.py", line 174, in valuetype_class
    return self.types[self.datatype][1]
KeyError: u'unknown'