taras: Team, i wrote a little lib to help scrape stuff better: https://github.com/tarasglek/scrape2md would appreciate if someone would be my coauthor, first order of business would be to package it as a lib and figure out how to deploy it on cloudflare instead of our current scraper.
Let's get this into a usable state, which doing a few things across various PRs:
Finish and merge #370, which alters the flow of our proxy.ts to be more flexible for plugging in various transformers. This library is essentially a transformer for a few content types, and would be pretty easy to wrap with the new API.
Integrate this scraper via npm into the new proxy transformer structure. I suspect we'll have some fixes to do to make this work in the CloudFlare context.
From Discord:
Let's get this into a usable state, which doing a few things across various PRs: