FlowiseAI / Flowise

Drag & drop UI to build your customized LLM flow
https://flowiseai.com
Apache License 2.0
30.37k stars 15.7k forks source link

[BUG] PDF FILE UPLOAD Error: vectorsService.upsertVector - UnknownErrorException: PDFDocument: stream must have data #2312

Closed TheVoidRoger closed 1 month ago

TheVoidRoger commented 5 months ago

Describe the bug PDF File node upserting to Pinecone is giving an error even though the pdf file has been uploaded. File size is less than 5mb.

To Reproduce Steps to reproduce the behavior:

  1. Connected the following nodes.. Recursive Character Text Splitter with 1000 chunk and 100 chunk overlap --> PDF File with less than 5mb pdf --> Pinecone with valid API, index and namespace. --> Conversational Retrieval QA Chain > ChatOpenAI.
  2. Clicked the green 'Upsert Vector Database' button
  3. show error: "Error: vectorsService.upsertVector - UnknownErrorException: PDFDocument: stream must have data"

Expected behavior Expected to show successful upsert. Tried to replace the "pdf file" with a "plain text" node, it successfully upserted the data.

Setup Via DigitalOcean, docker.

HenryHengZJ commented 5 months ago

not sure if this helps, but try using legacy build: image

TheVoidRoger commented 5 months ago

Thank you @HenryHengZJ . Tried it. But no luck. Noticed with the PDF node, it won't even save as soon I've chosen a pdf. (tried couple of pdf, varying file sizes & compressions). I'm forced to refresh the page and it will remove the pdf that I've previously selected.

TheVoidRoger commented 3 months ago

Update: I've re-deployed it to render and it works. I think something is blocking the pdf upload in digitalocean's backend. It's not just the PDF uploader node as this issue also occurs with "Unstructured File loader" node inside DO environment.

Has anyone experience the same issue with DO? Found any solution?

JNNelson commented 3 months ago

Hello. How did you solve it? I have the same problem. Docker v26.1.4 I've already tried with version 1.8.3 and also with 1.72. VPS Contabo @TheVoidRoger

HenryHengZJ commented 1 month ago

Try the latest version, itt should be working now. Feel free to re-open if issue persists