Closed s4zuk3 closed 1 week ago
Was reviewing this today and found out that Tika metadata is actually Map<String, List
I'm halfway through making the necessary changes by adding apache.tika.Meatada to StringResult and ReaderResult then converting to HashMap<String, Vec
Hey! Thanks for the recommendations, I applied the changes that were instructed. I changed the Java step from HashMap to Tika.Metadata, fixed the HashMap to <String, Vec
Please let me know if any additional changes are needed, Thanks again @nmammeri and @KapiWow .
Thanks again @s4zuk3 for your the updates. amazing work. I've just made the checking logic of metadata tests more stringent. made it so
In previous code if I remove some values of lists it doesn't detect it. Also If I remove some keys from the extracted metadata it does not get detected.
I changed it so that we check that all keys in expected metadata are found and the values for every expected key are similar to the extracted values.
Here is the new revised version of the Tika Metadata implementation. Any comments and/or changes are welcome.
Once again, thanks to @nmammeri and @KapiWow for their help with the HashMap configuration. And thank you for the opportunity to contribute a small part to this great project!