Open nelsonic opened 5 years ago
progress: (all the next pages to crawl are being inserted for an Org)
SELECT
next_page,
COUNT (next_page) AS c
FROM
logs
WHERE next_page IS NOT null
GROUP BY
next_page
ORDER BY
c asc
limit 1;
Context: I'm trying to get the next_page
we need to view that has not been viewed before. 🔍
SELECT
next_page,
COUNT (next_page) AS c
FROM
logs
WHERE next_page IS NOT null
AND next_page NOT IN (
SELECT path
FROM logs
WHERE path IS NOT NULL
)
GROUP BY
next_page
ORDER BY
c ASC
LIMIT 1;
Stars are being saved! ⭐️
As part of the example app we are assembling #51 we need to source fresh data. I propose writing a "crawler" to index all of @dwyl's repositories and people. The crawler should:
log
table asnext_page
next_page
to be crawled from the DB and keep going ...