sambanova / generative_data_prep

Apache License 2.0
58 stars 8 forks source link

Improve the error handling for invalid jsonl line file input #98

Closed snova-zoltanc closed 2 months ago

snova-zoltanc commented 5 months ago

This error is hard to parse for user - we should specifically say why the JSONL is not formatted properly

RuntimeError: Hdf5TextBuffer exit failed with exc_type: <class 'json.decoder.JSONDecodeError'>, exc_val: Expecting value: line 2 column 1 (char 1), exc_tb: <traceback object at 0x7f850917b8c0>

snova-zoltanc commented 2 months ago

Done by @snova-jerrym in PR https://github.com/sambanova/generative_data_prep/pull/110