Closed sangee2004 closed 3 months ago
@iwilltry42 When I test with the latest knowledge code, I now see "No documents found" which is similar to what I used to see with "csv files - https://github.com/gptscript-ai/knowledge/issues/6
/usr/local/bin/knowledge ingest -d testdocx /Users/sangeethahariharan/Downloads/demo.docx
2024/05/03 13:57:51 INFO IngestOpts opts="{Filename:0x1400cdac120 FileMetadata:0x1400ae68e00 IsDuplicateFuncName: IsDuplicateFunc:0x104f04920}"
2024/05/03 13:57:51 ERROR No documents found
2024/05/03 13:57:51 no documents found
Confirmed. I'll debug this.
@sangee2004 can you re-test now that @StrongMonkey 's changes are in?
@sangee2004 Since there is no new release yet, you will have to make the binary from main.
@StrongMonkey I should be able to test these changes using remote tool call to knowledge. That should be able to pick the latest fixes ?
Tested with latest knowledge repo. Able to ingest and query docx documents.
Steps to reproduce the problem:
make run
fromhttps://github.com/gptscript-ai/knowledge
to launch knowledge in server mode.bin/knowledge
to path (/usr/local/bin
)Following is the output I get which is not the contents that are present in the docx .
Note : Not able to be decode the the contents from the retrieve tool call output. Decode from Base64 format produces garbage output like
Debug logs for gptscript execution:
Logs from knowledge: