searchdaimon / enterprise-search

An open source search engine for corporate data and websites.
http://www.searchdaimon.com/
GNU General Public License v2.0
106 stars 43 forks source link

File size limit hardcoded in smb crawler #31

Open runarbu opened 9 years ago

runarbu commented 9 years ago

The smb crawler will not download files larger than 100 Mb ( 104857600 bytes ). This is hardcoded at https://github.com/searchdaimon/enterprise-search/blob/f379ee87574aae83da0fa3cdc8e629ecc92ab1bc/src/crawlSMB/crawlsmb.c#L347

Maybe have this as an option in src/crawl/crawl.h:struct collectionFormat instead ?