LLM-based text extraction from unstructured data like PDFs, Words and HTMLs. Transform and cluster the text into your desired format. Less information loss, more interpretation, and faster R&D!
Looks like most the tests are for object init with external mocks, which may not be valuable. Let me consolidate some thoughts and get back with some suggestions
@DavidHHShao @SayaZhang Please help review.