internetarchive / Zeno

State-of-the-art web crawler 🔱
GNU Affero General Public License v3.0
83 stars 11 forks source link

End to End Testing #162

Open willmhowes opened 1 week ago

willmhowes commented 1 week ago

Using a mocked server, perform a series of crawls that test whether the resulting WARC matches the parameters given to the crawl at runtime. It's important to clarify that this feature is NOT testing the validity of a WARC under the WARC spec, that testing is intended to be handled by the WARC-writing library used by Zeno.

Here is a list of the configurable flags offered by the -h command in the binary: