issues
search
openzim
/
wikihow
WikiHow scraper
https://download.kiwix.org/zim/wikihow/
GNU General Public License v3.0
15
stars
2
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
wikihow is not retrying to fetch illustrations
#164
benoit74
opened
1 month ago
0
wikihow does fetch article CSS properly (at least it does not uses a retry mechanism)
#163
benoit74
opened
1 month ago
1
pywikiapi: why + update
#162
benoit74
opened
3 months ago
2
wikihow does not retry API requests
#161
benoit74
opened
4 months ago
3
Release 1.3.0
#160
benoit74
opened
4 months ago
0
Release 1.2.3
#159
benoit74
closed
4 months ago
1
Move to cssbeautifier and fix delays in category scraping
#158
benoit74
closed
4 months ago
3
Category scraping in not honouring the configured `delay`
#157
benoit74
closed
4 months ago
0
Issues around category processing
#154
benoit74
closed
4 months ago
1
wikihow_en_endless_arts-and-entertainment is failing
#155
benoit74
closed
1 month ago
5
wikihow2zim.scraper.DomIntegrityError: #content_wrapper not found
#153
benoit74
closed
4 months ago
6
Object of type Response is not JSON serializable
#152
benoit74
closed
4 months ago
1
css-beautify has been removed
#151
benoit74
closed
4 months ago
3
WikiHow does not scrape anymore, too many http requests
#150
kelson42
closed
9 months ago
13
Upgrade python-scraperlib to 3.x, including CLI support for description / long_description flags
#149
benoit74
opened
10 months ago
0
TypeError: Object of type Response is not JSON serializable
#148
kelson42
closed
4 months ago
3
Too many redirects
#147
kelson42
closed
1 year ago
1
Improve favicon
#146
kelson42
opened
1 year ago
1
Scraping broken
#145
kelson42
closed
1 year ago
1
Release 1.2.2
#144
kelson42
closed
1 year ago
1
added global 15mn pause on 429 response
#143
rgaudin
closed
1 year ago
0
Handle 429 responses
#142
rgaudin
closed
1 year ago
0
Retry backoff too short
#141
rgaudin
closed
1 year ago
0
TypeError: 'NoneType' object is not callable
#140
kelson42
closed
1 year ago
1
TypeError: 'NoneType' object is not callable
#139
kelson42
closed
1 year ago
1
Unable to retrieve image in CSS leads to crash
#138
kelson42
closed
1 year ago
0
wikihow2zim: error: argument --missing-article-tolerance: invalid int value: '1.0'
#137
kelson42
closed
1 year ago
1
use a single Session for all requests
#136
rgaudin
closed
1 year ago
0
Implement keep-alive HTTP header
#135
kelson42
closed
1 year ago
0
Prevent duplicate categories/articles
#134
rgaudin
closed
1 year ago
1
Latest wikiHow (en) has a lot of broken links
#133
holta
closed
1 year ago
4
Homepage added twice
#132
kelson42
closed
2 years ago
0
Wikihow scrape stuck at the end
#131
kelson42
closed
2 years ago
3
Better handle articles which ar in Q&A quarantine
#130
kelson42
closed
1 year ago
4
wikiHow EN doesn't pass DOM Integrity Check anymore
#129
kelson42
closed
2 years ago
2
Missing article on wikiHow DE
#128
kelson42
closed
2 years ago
1
Latest wikihow tr makes only 40MB
#127
kelson42
closed
2 years ago
2
WikiHow pt 30% smaller between Feb and May
#126
kelson42
closed
2 years ago
8
Stop the scrape if too many 404
#125
kelson42
closed
2 years ago
2
Latest WikiHow in Thai is almost empty
#124
kelson42
closed
2 years ago
7
wikihow tr loops over same subcat while building categories list
#123
kelson42
closed
2 years ago
1
ConnectionError during Wikihow EN
#122
kelson42
closed
1 year ago
6
Full wikihow triggers block
#121
kelson42
closed
2 years ago
4
ERROR:object of type 'NoneType' has no len()
#120
kelson42
closed
2 years ago
0
Don't crash on Image: articles
#119
rgaudin
closed
2 years ago
0
Incorect homepage for single category
#118
rgaudin
closed
2 years ago
0
Must handle ConnectionTimeout in API requests
#117
rgaudin
closed
2 years ago
0
Scrape using pre-populated expected lists computed through API calls
#116
fadiga
closed
2 years ago
0
Use an expected list
#115
rgaudin
closed
2 years ago
0
Remove references to not-included articles
#114
rgaudin
closed
2 years ago
6
Next