Closed cldellow closed 1 year ago
// Extract link graph "extract-links": { // optional; absent implies .* "url-regex": ".*", // optional "database": "dbname", // optional; defaults to dss_links "table": "dss_links", }
Needs https://github.com/cldellow/datasette-scraper#extract_from_responsescraper-config-url-response
Extracts from, to, anchor text.
Future improvements: extract main focus image and its alt text, if there is one. Extract dofollow/nofollow.
Needs https://github.com/cldellow/datasette-scraper#extract_from_responsescraper-config-url-response