FlowiseAI / Flowise

Drag & drop UI to build your customized LLM flow
https://flowiseai.com
Apache License 2.0
29.43k stars 15.22k forks source link

[FEATURE] Ability to upload text,csv or pdf by endusers #2763

Closed mahmoudsorf closed 1 week ago

mahmoudsorf commented 2 months ago

Currently with simple text file uploads (applicable to pdf and csv files as well) The text needs to be converted to embeddings, stored to a vector DB then the chatbot can be used/published. If we can make this first step executed automatically if a user uploaded a text file within the embed-chat itself it would really be a huge improvement (of course file size limits should apply).

image

HenryHengZJ commented 1 month ago

same as - https://github.com/FlowiseAI/Flowise/issues/2444

haimco50 commented 1 month ago

Same as https://github.com/FlowiseAI/Flowise/issues/2084 as well :-) @HenryHengZJ - would it work by calling the Flowise API ? (i.e instead of passing base64 encoded image will pass PDF url or whatever Claude needs to support this) - any work around you could think of? btw, with the multi modality ramping up quickly this should be an higher priority item.

haimco50 commented 1 month ago

@HenryHengZJ - I took a look at Anthropic Claude API doc and it looks like the Create message API accepts either text or based64 encoded image which means that in order to support PDF/None image format they had to do some pre-processing extraction activity of text/images before calling the API so is this the way to plan to support this or maybe you plan on using some kind of embedding technique etc.. - I have a use case of PDF files that have only images in it (scanned documents) so I need to figure out the base way to support this - I can extract images from the PDF (one image for each PDF page) and then call the API with the extracted images but I am not sure this is the most effective way dealing with this request.

HenryHengZJ commented 1 week ago

Closing as https://github.com/FlowiseAI/Flowise/pull/3059 merged!

haimco50 commented 1 week ago

Wow - finally !! this is really amazing