Use metadata-embedded contentUrl for direct access, fallback on contentid. (software heritage has rate limiting of 120 calls, and hash-archive.org has proved unstable, so content resolution is less reliable than desired).
Avoids downloading parquet directly, and builds persistent duckdb tables instead of Views from remote parquets. native tables enables potential for full-text search.
update database to 23.01
contentUrl
for direct access, fallback on contentid. (software heritage has rate limiting of 120 calls, and hash-archive.org has proved unstable, so content resolution is less reliable than desired).