ivrit-ai / ivrit.ai

ivrit.ai codebase
MIT License
24 stars 9 forks source link

Issue downloading Knesset Plenum data #53

Open yairl opened 3 weeks ago

yairl commented 3 weeks ago

Describe the bug Multiple Plenums failing on download. Protocol IDs: 120892, 120940

Error:

  File "/home/yair/dev/ivrit.ai/./download_knesset_recording.py", line 274, in parse_plenum_transcript
    if extraction_context["latest_closed_ts_marker"] < extraction_context["current_text_ts"]:
       ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
TypeError: '<' not supported between instances of 'int' and 'Timedelta'

To Reproduce ./download_knesset_recording.py --type plenum --http-headers-file header-file --target-dir knesset/ --ids 120892 --ids 120940

Expected behavior Download completes successfully.