Open xuze1993 opened 5 years ago
The code can be modified to read pdf documents (with a pdf library) while crawling and index it, but a copy of the file would need to be kept so the user can open it in the search result page. thats not good i think.
The code can be modified to read pdf documents (with a pdf library) while crawling and index it, but a copy of the file would need to be kept so the user can open it in the search result page. thats not good i think.
or we could just use the original pdf link in the search results, but if the original file is longer available, you wont be able to open it
gocha,nice work anyway. Sadly that fewer static sites are left on the web.
I've pulled a website from webhttrack which is mixed of pdf and html,it seems that localgoogle can only index html files,is there anyway to solve the problem?