Azure / gpt-rag-ingestion

MIT License
52 stars 48 forks source link

asynchronous operations & updating openai version. #84

Closed gbecerra1982 closed 3 weeks ago

gbecerra1982 commented 3 weeks ago

This pull request focuses on refactoring the chunk_documents_docint.py, chunk_documents_raw.py, chunk_metadata_helper.py, and text_embedder.py files to use asynchronous operations. The most important changes include converting synchronous functions to asynchronous ones, updating Azure SDK imports to their asynchronous counterparts, and ensuring proper session and resource management.

Asynchronous Refactoring:

placerda commented 3 weeks ago

Refactored old chunkers into multiple chunks to be more modular and extensible, so changes in chunk_documents_docint.py, chunk_documents_raw.py, chunk_metadata_helper.py, and text_embedder.py wont be effective.