Luodian / Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
https://otter-ntu.github.io/
MIT License
3.54k stars 242 forks source link

When loading the DC.json dataset, I encountered the 'ijson.common.IncompleteJSONError: parse error: premature EOF' error. #291

Closed hcwei13 closed 9 months ago

hcwei13 commented 9 months ago

Thank you for your excellent work! I encountered some questions while running the training script. When loading the DC.json dataset, I encountered the 'ijson.common.IncompleteJSONError: parse error: premature EOF' error. This is the second time I've downloaded the data, so it's unlikely to be a dataset corruption issue. How should I resolve it?

with open(cur_images_path, "rb") as f: 
        for key, value in ijson.kvitems(f, "", use_float=True):
            self.images[key] = value

  File "/home/weihongchen/miniconda3/envs/otter/lib/python3.9/site-packages/ijson/utils.py", line 55, in coros2gen
    f.send(value)
  File "/home/weihongchen/miniconda3/envs/otter/lib/python3.9/site-packages/ijson/backends/yajl2_cffi.py", line 225, in basic_parse_basecoro
    yajl_parse(handle, buffer)
  File "/home/weihongchen/miniconda3/envs/otter/lib/python3.9/site-packages/ijson/backends/yajl2_cffi.py", line 196, in yajl_parse
    raise exception(error)
ijson.common.IncompleteJSONError: parse error: premature EOF

                     (right here) ------^

Before you open an issue, please check if a similar issue already exists or has been closed before.

When you open an issue, please be sure to include the following

Thank you for your contributions!

hcwei13 commented 9 months ago

@Luodian