bellingcat / auto-archiver

Automatically archive links to videos, images, and social media content from Google Sheets (and more).
https://pypi.org/project/auto-archiver/
MIT License
552 stars 55 forks source link

Extract text in wacz_enricher #110

Closed liliakai closed 10 months ago

liliakai commented 10 months ago

The enricher already passes the --text flag to browsertrix. This commit extracts and exposes it in the result metadata.

MVP for #107