microsoft / AzureSearch_JFK_Files

This repo contains the sample code of the Azure Search and Cognitive Services used to provide insights and analysis around the JFK Files.
MIT License
388 stars 225 forks source link

Using your own .PDF files #135

Open gorkam23 opened 1 year ago

gorkam23 commented 1 year ago

I'm pretty sure my issue lies within the indexer, but I have not been able to reindex my files after placing them in the storage account container.

I know the project doesn't support all formats, but I'd like to just see a few files indexed. Any help is appreciated.

Careyjmac commented 1 year ago

Did you reset the indexer and then run it again? The indexer in the project is not set to run on a schedule so requires a manual rerun in order for new files to be indexed.

gorkam23 commented 1 year ago

I absolutely did do that multiple times. But, do the files need to be in the file folder that is loaded into the storage container or can I put them directly in the storage container and then reindex?

Careyjmac commented 1 year ago

Should be fine just in the storage container.

gorkam23 commented 1 year ago

I am currently getting this error during the final indexing. I have been using .PDF files. Would they cause this error?

Thank you!

error

Careyjmac commented 1 year ago

Depends on the PDF files. I would need to see what the more specific error is, ideally. You should be able to check the errors in the Azure portal using this guide: https://learn.microsoft.com/en-us/azure/search/search-howto-monitor-indexers

gorkam23 commented 1 year ago

I am attaching some of the .PDF files. OJL_APP_187_6.pdf OJL_APP_187_5.pdf OJL_APP_187_4.pdf OJL_APP_187_3.pdf OJL_APP_187_2.pdf OJL_APP_187.pdf

I will go through the guide on troubleshooting the indexer. Thank you!