Open vffuunnyy opened 1 month ago
@vffuunnyy I had the same issue, had to add an extra statement to take [0] of that only if splitter.split_text("\n".join([d.text for d in docs]))
was not empty, after which it worked fine.
Ended up doing this
text = splitter.split_text("\n".join([d.text for d in docs]))
if len(text) >0:
text = text[0]
documents.append(Document(text=text, metadata=docs[0].metadata))
This is but a hack to get it running past this point and not a proper fix
Please, configure
structlog
for better logging :) I did it myself, like this (not perfect, but at least something):Okay. About exception. I got this one:
While processing this file:
Cards.pdf