Closed evekhm closed 2 weeks ago
I do see that information is there inside shards.entities, but entities itself is totally broken/missing/unusable
Looking further at the issue:
Traceback (most recent call last):
File "/Library/Frameworks/Python.framework/Versions/3.10/lib/python3.10/dataclasses.py", line 405, in wrapper
result = user_function(self)
File "<string>", line 3, in __repr__
AttributeError: 'Entity' object has no attribute 'start_page'
Both start_page
and end_page
need to be made Optional
(since this info is not provided by the Classifier)
Hello,
The wrapped_document, when using
document.from_batch_process_metadata
(or any other methods) will be missing entities field when using data from the Classifier.When using output of splitter, everything works fine. But with classifier - you wont get any important information like type and confidence.
output-document_split.json output-document_classify.json