run-llama / LlamaIndexTS

Data framework for your LLM applications. Focus on server side solution
https://ts.llamaindex.ai
MIT License
1.84k stars 353 forks source link

Documents excluding PDF are throwing an error in LlamaParse #1302

Open LawrenceGB opened 1 week ago

LawrenceGB commented 1 week ago

Using the LlamaCloud parser only works with PDFs and Images. If I upload a docx, txt etc it throws this error.

It should be agnostic and should work for these types also I believe?

Error: Failed to parse the file: c175a7e7-edbb-4287-9d3d-b0ca4b288d4b, status: ERROR
>      at LlamaParseReader.getJobResult (/Users/lg/Documents/code/contacted/backend/functions/node_modules/@llamaindex/cloud/reader/dist/index.cjs:597:23)
>      at process.processTicksAndRejections (node:internal/process/task_queues:95:5)
>      at async LlamaParseReader.loadJson (/Users/lg/Documents/code/contacted/backend/functions/node_modules/@llamaindex/cloud/reader/dist/index.cjs:650:32)
>      at async pdfHandler (/Users/lg/Documents/code/contacted/backend/functions/src/endpoints/training/file-upload/index.js:89:22)
>      at async FileUpload (/Users/lg/Documents/code/contacted/backend/functions/src/endpoints/training/file-upload/index.js:294:77)
>      at async /Users/lg/Documents/code/contacted/backend/functions/src/routes/training.js:38:9
>      at async runFunction (/usr/local/lib/node_modules/firebase-tools/lib/emulator/functionsEmulatorRuntime.js:506:9)
>      at async runCloudEvent (/usr/local/lib/node_modules/firebase-tools/lib/emulator/functionsEmulatorRuntime.js:523:5)
>      at async /usr/local/lib/node_modules/firebase-tools/lib/emulator/functionsEmulatorRuntime.js:690:21
ajpanyteam commented 1 week ago

I reported a similar issue with PPTX and DOCX files today. https://github.com/run-llama/LlamaIndexTS/issues/1313