anthonydb / pneumatic

pneumatic is a bulk-upload library for DocumentCloud.
MIT License
23 stars 3 forks source link

Determine prohibited files for upload #5

Open anthonydb opened 8 years ago

anthonydb commented 8 years ago

DocumentCloud handles anything that LibreOffice can open and turn into a PDF. That would exclude things like Microsoft Outlook .pst files or Access databases. The sanitize_uploads function looks for files with these extensions and removes them from the list of files to upload. We need to research a thorough list of extension types to exclude.