Open singloudly90 opened 5 months ago
cc @mattgotteiner
So is your goal to be able to repeatedly re-run prepdocs to pick up new files in ADLS2, without having to re-index existing files? I think we'd probably want to implement https://github.com/Azure-Samples/azure-search-openai-demo/pull/942 for both normal Blob storage and ADLS2, which would mean the MD5 would be stored in the blob itself, and we'd check against that.
@pamelafox this could be a great feature, since in production, the docs are sitting somewhere on the lake that other system maybe able to drop files into.
However i realised when I tried on the ACL solution, MD5 didnt create as expected compared to the solution without ACL. correct me if I am wrong: Without ACL solution: Upload files from local folder, MD5 generated at local folder, files uploaded to blob storage and to AI Search Index. With ACL solution: Upload files from local folder to datalake, datalake to AI Search.
These solution are difference in term of file processing...
This issue is for a: (mark with an
x
)Minimal steps to reproduce
Any log messages given by the failure
Expected/desired behavior
OS and Version?
azd version?
Versions
Mention any other details that might be useful