richardpaulhudson / holmes-extractor

Information extraction from English and German texts based on predicate logic
MIT License
134 stars 12 forks source link

ERROR executing register_serialized_documents() on Worker 0. Please examine the output from the worker processes to identify the problem. #10

Closed rahul1g2 closed 1 year ago

rahul1g2 commented 2 years ago

I am running the sample code to see the serialized_document format.

Loading Book 1 ‘The Philosopher's Stone’ Ch 1 ‘THE BOY WHO LIVED’.hdc Book 1 ‘The Philosopher's Stone’ Ch 1 ‘THE BOY WHO LIVED’

{"Book 1 ‘The Philosopher's Stone’ Ch 1 ‘THE BOY WHO LIVED’": 'tly normal, thank you very much. They were the last people you\xe2\x80\x99d expect to be involved in anything strange or mysterious, because they just didn\xe2\x80\x99t hold with such nonsense. Mr. Dursley was the director of a firm called Grunnings, which made drills. He was a big, beefy man with hardly any neck, although he did have a very large mustache.'}

Not sure what's wrong . Can you please recommend the format on how to pass large text file to effectively load the data

richardpaulhudson commented 1 year ago

There should be more error output further up (from Worker 0): could you post it please?

richardpaulhudson commented 1 year ago

Closing owing to lack of activity.