webrecorder / warcit

Convert Directories, Files and ZIP Files to Web Archives (WARC)
https://pypi.python.org/pypi/warcit
Apache License 2.0
81 stars 13 forks source link

Add Apache Tika support for MIME type and character set detection, structural reporting, and mapping of files to URLs and other properties via CSV #2

Closed despens closed 6 years ago

despens commented 6 years ago

Added and tested support for Apache Tika for MIME type and/or encoding detection.