hasadna / knesset-data-pipelines

Main repository for Open Knesset project - contains the knesset data scrapers and processing pipelines
https://oknesset.org/
MIT License
14 stars 26 forks source link

fix tika server errors #180

Open OriHoch opened 4 years ago

OriHoch commented 4 years ago

If succeeds but shows a lot of warnings / errors

committees/kns_documentcommitteesession
 Succeeded
Status
Pipeline
Source
Log
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:21,524 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-462124-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:21,559 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-462125-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:21,577 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-462126-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:21,601 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-462127-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:21,620 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-462128-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:21,641 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-462129-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:21,660 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-462130-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:21,682 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-462131-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:21,702 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-462132-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:21,721 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-462317-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 62, in process_row
parse_meeting_protocols:     of.write(protocol.text)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: 2019-10-07 01:42:21,722 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-462133-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:21,736 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-462134-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:21,760 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-462135-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:21,783 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-462136-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:21,809 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-462137-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:21,840 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-462138-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:21,858 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-462139-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:21,871 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-462140-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:21,885 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-462297-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:22,174 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-462317-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:22,437 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474895-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 62, in process_row
parse_meeting_protocols:     of.write(protocol.text)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:22,452 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474896-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 62, in process_row
parse_meeting_protocols:     of.write(protocol.text)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:22,466 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474901-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 62, in process_row
parse_meeting_protocols:     of.write(protocol.text)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:22,481 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474903-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 62, in process_row
parse_meeting_protocols:     of.write(protocol.text)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:22,496 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474906-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 62, in process_row
parse_meeting_protocols:     of.write(protocol.text)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:22,509 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474909-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 62, in process_row
parse_meeting_protocols:     of.write(protocol.text)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:22,832 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474895-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:22,874 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474896-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:22,901 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474901-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:22,934 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474903-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:22,967 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474906-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:22,986 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474909-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:23,150 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474969-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 62, in process_row
parse_meeting_protocols:     of.write(protocol.text)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:23,193 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474970-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 62, in process_row
parse_meeting_protocols:     of.write(protocol.text)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:23,215 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474971-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 62, in process_row
parse_meeting_protocols:     of.write(protocol.text)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:23,240 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474972-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 62, in process_row
parse_meeting_protocols:     of.write(protocol.text)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:23,270 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474974-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 62, in process_row
parse_meeting_protocols:     of.write(protocol.text)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:23,304 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474975-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 62, in process_row
parse_meeting_protocols:     of.write(protocol.text)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:23,492 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474976-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 62, in process_row
parse_meeting_protocols:     of.write(protocol.text)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:23,574 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474977-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 62, in process_row
parse_meeting_protocols:     of.write(protocol.text)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:23,587 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474978-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 62, in process_row
parse_meeting_protocols:     of.write(protocol.text)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:23,624 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474969-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:23,637 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474970-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:23,658 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474971-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:23,680 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474972-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:23,692 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474974-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:23,705 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474975-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:23,728 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474976-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:23,747 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474977-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: 2019-10-07 01:42:23,892 [MainThread  ] [WARNI]  Tika server returned status: 422
parse_meeting_protocols: WARNING :Tika server returned status: 422
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-474978-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 75, in parts
parse_meeting_protocols:     for line in re.sub("[ ]+", " ", self.text).split('\n'):
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 54, in text
parse_meeting_protocols:     text = decode(self.antiword_text, 'utf-8')
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/base.py", line 95, in antiword_text
parse_meeting_protocols:     self._tika_metadata = parsed['metadata']
parse_meeting_protocols: KeyError: 'metadata'
parse_meeting_protocols: DEBUG   :Starting new HTTP connection (1): tika:9998
parse_meeting_protocols: DEBUG   :http://tika:9998 "PUT /rmeta/text HTTP/1.1" 200 None
parse_meeting_protocols: ERROR   :exception parsing protocol for 23-475183-DOC
parse_meeting_protocols: Traceback (most recent call last):
parse_meeting_protocols:   File "/pipelines/committees/parse_meeting_protocols.py", line 66, in process_row
parse_meeting_protocols:     for part in protocol.parts:
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/cached_property.py", line 35, in __get__
parse_meeting_protocols:     value = obj.__dict__[self.func.__name__] = self.func(obj)
parse_meeting_protocols:   File "/usr/local/lib/python3.6/site-packages/knesset_data/protocols/committee.py", line 80, in parts
parse_meeting_protocols:     protocol_text[-1] += ':'
parse_meeting_protocols: IndexError: list index out of range
(sink): >>> PROCESSED ROWS: 138200
(sink): >>> PROCESSED ROWS: 138300
(sink): >>> PROCESSED ROWS: 138400
(sink): >>> PROCESSED ROWS: 138500
(sink): >>> PROCESSED ROWS: 138600
(sink): >>> PROCESSED ROWS: 138700
(sink): >>> PROCESSED ROWS: 138800
(sink): >>> PROCESSED ROWS: 138900
(sink): >>> PROCESSED ROWS: 139000
(sink): >>> PROCESSED ROWS: 139100
..datapackage_pipelines_knesset.dataservice.processors.add_dataservice_collection_resource: INFO    :Loaded 140000 dataservice objects
..datapackage_pipelines_knesset.common.processors.throttle: INFO    :processed 140088 rows 2801 pages, elapsed time (seconds)=1803.603795
(sink): >>> PROCESSED ROWS: 139200
(sink): >>> PROCESSED ROWS: 139300
(sink): >>> PROCESSED ROWS: 139400
(sink): >>> PROCESSED ROWS: 139500
(sink): >>> PROCESSED ROWS: 139600
(sink): >>> PROCESSED ROWS: 139700
(sink): >>> PROCESSED ROWS: 139800
(sink): >>> PROCESSED ROWS: 139900
(sink): >>> PROCESSED ROWS: 140000
(sink): >>> PROCESSED ROWS: 140100
..datapackage_pipelines_knesset.dataservice.processors.add_dataservice_collection_resource: INFO    :Processed 140389 rows
..datapackage_pipelines_knesset.common.processors.throttle: INFO    :Processed 140389 rows
download_document_committee_session: INFO    :Processed 140389 rows
parse_meeting_protocols: INFO    :Processed 140389 rows
parse_meeting_protocols: INFO    :Processed 140389 rows
knesset.dump_to_path: INFO    :Processed 140389 rows
knesset.dump_to_path: INFO    :uploading ../data/committees/kns_documentcommitteesession --> gs://knesset-data-pipelines/data/committees/kns_documentcommitteesession
knesset.dump_to_path: INFO    :: starting process
knesset.dump_to_path: INFO    :: process started
(sink): >>> PROCESSED ROWS: 140200
(sink): >>> PROCESSED ROWS: 140300
knesset.dump_to_path: INFO    :: process completed
knesset.dump_to_path: INFO    :: starting process
knesset.dump_to_path: INFO    :: process started
knesset.dump_to_path: INFO    :      3222  2019-10-07T01:43:37Z  gs://knesset-data-pipelines/data/committees/kns_documentcommitteesession/datapackage.json
knesset.dump_to_path: INFO    :  25530814  2019-10-07T01:43:38Z  gs://knesset-data-pipelines/data/committees/kns_documentcommitteesession/kns_documentcommitteesession.csv
knesset.dump_to_path: INFO    :TOTAL: 2 objects, 25534036 bytes (24.35 MiB)
knesset.dump_to_path: INFO    :: process completed
knesset.dump_to_sql: INFO    :Processed 140389 rows
(sink): >>> PROCESSED ROWS: 140389
knesset.dump_to_sql: INFO    :renaming sql table a66ab974e89f11e985f70a580a14009a --> committees_kns_documentcommitteesession