Open helen-m-lin opened 4 months ago
@helen-m-lin did that linked PR fix this issue? If so we can go ahead and close this out
@mekhlakapoor no, it was just a hotfix to add error handling so the indexer job doesn't crash completely. We still need this bug ticket to resolve the actual issue.
Describe the bug Some uploads from Jul 16 were not able to be written to docdb. The indexer is erroring out after WriteError in
_process_prefix()
and_process_codeocean_record()
. It was found that the code does not first checkis_dict_corrupt
for existing metadata.nd.json. Additionally, the currentis_dict_corrupt
does not check the fieldnames in nested lists.To Reproduce
Expected behavior
aind_data_access_api.utils.is_dict_corrupt
.is_dict_corrupt
should check nested lists recursively.Additional context The errors are currently causing the job to crash completely. A hotfix will be implemented to add error handling for processing each record.