From what I can see, several font files from fontbox.cmap are missing and need to be included in the Tika native image. I only see a few of them in the configuration. Is it possible to include all of them in the configuration?
Hello @s4zuk3!
Yes, we can add the missing entities to the config. Could you provide a PDF for testing? Perhaps a short example of when it doesn't work would also help.
Hello! While trying to extract content from a PDF, I got the following error with very little information:
After modifying the code, I was able to extract the full error, which is as follows:
From what I can see, several font files from fontbox.cmap are missing and need to be included in the Tika native image. I only see a few of them in the configuration. Is it possible to include all of them in the configuration?
Thanks!