-
Crawl Wikipedia pages.
Settings:
- Secret token to send data to locodb
- Initial URL(s)
- Time till next request
Data extracted:
- URL
- `is_city`
- population
- census date (or other latests measu…
-
Create Dutch Wikipedia article about [Zimmerman en Space](https://www.buzzsprout.com/2096278) and its predecessor [Zimmerman in Space](https://juke.nl/podcasts/zimmerman-in-space)
(https://www.npor…
-
The wikipedia says "Bornova is a municipality and [district](https://en.wikipedia.org/wiki/Districts_of_Turkey) of [İzmir Province](https://en.wikipedia.org/wiki/%C4%B0zmir_Province), [Turkey](https:/…
-
### Prerequisites
- [X] I [searched for any existing report](https://github.com/darkreader/darkreader/issues?q=is%3Aissue) about this website issue to avoid opening a duplicate.
- [X] I can reproduce…
-
IPFS has made some progress on read-only snapshots of Wikipedia
https://github.com/ipfs/distributed-wikipedia-mirror
Might be able to work together on making a read-write decentralized Wikipedia…
-
check/add references to CalConnect in wikipedia on calendaring and scheduling related articles.
languages:
- en for sure
- others?
wsdwl updated
6 years ago
-
```
minet wikipedia pageviews pages_wikipedia -i test_wikipedia.csv --start-date 2012 --end-date 2024 > pages_test_wikipedia.csv
Collecting pageviews ━━━━━━━━━━━━ 0/2 pages ⠦ [ 0%] in 510.50ms (?/s…
bmaz updated
4 months ago
-
Look at different methods of search.
- https://github.com/google-research/google-research/tree/master/scann
- https://github.com/facebookresearch/faiss
- https://github.com/nmslib/hnswlib
- http…
-
An obvious source of decent quality, freely usable sentences is Wikipedia/Wiktionary. It would not be that difficult to download a database dump and extract them. And it would be useful to have non-fi…
-
https://en.wikipedia.org/wiki/List_of_colors