This pull request includes several changes aimed at improving logging and error handling in the chunking and tools modules. The most important changes include adding logging statements to provide better insights during processing and enhancing the handling of document URLs.
Improvements to logging:
chunking/chunker_factory.py: Added logging to indicate the use of the Doc Intelligence API and to log when processing 'pptx' and 'docx' files without the required API. [1][2]
This pull request includes several changes aimed at improving logging and error handling in the
chunking
andtools
modules. The most important changes include adding logging statements to provide better insights during processing and enhancing the handling of document URLs.Improvements to logging:
chunking/chunker_factory.py
: Added logging to indicate the use of the Doc Intelligence API and to log when processing 'pptx' and 'docx' files without the required API. [1] [2]Enhancements to URL handling:
tools/doc_intelligence.py
: Added URL decoding for blob names to handle encoded URLs correctly.Codebase simplification:
tools/doc_intelligence.py
: Simplified the import statement by combining imports from the same module.