issues
search
openzim
/
python-scraperlib
Collection of Python code to re-use across Python-based scrapers
GNU General Public License v3.0
18
stars
16
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Fixed download test inconsistency
#77
rgaudin
closed
3 years ago
1
Ensure filename in StaticArticle is not None
#76
satyamtg
closed
3 years ago
1
Image optimization on the fly (for JPEG/PNG/WebP)
#75
satyamtg
closed
3 years ago
1
Dependency resolve error with urllib3 when using in combination with kiwixstorage
#74
satyamtg
closed
3 years ago
2
Fix CI to use new GA way to set env
#72
rgaudin
closed
3 years ago
1
Stream downloads
#71
satyamtg
closed
3 years ago
1
Add ability to return content as bytes in save_large_file
#70
satyamtg
closed
3 years ago
3
ZIM creator API not working properly
#69
satyamtg
closed
3 years ago
9
Added most-used image functions to `image` module
#68
rgaudin
closed
3 years ago
1
Update image presets
#67
satyamtg
closed
3 years ago
1
Add high quality presets for WebM and Mp4 encoding
#66
satyamtg
closed
3 years ago
3
Handle wrong extension in optimize_image()
#65
satyamtg
closed
3 years ago
4
Image optimization if the image extensions are incorrect
#64
satyamtg
closed
3 years ago
2
Comprehensive benchmark of image presets
#63
rgaudin
opened
3 years ago
7
Add wait parameter in YoutubeDownloader.download()
#62
satyamtg
closed
3 years ago
1
YoutubeDownloader context manager option doesn't achieve parallelism
#61
satyamtg
closed
3 years ago
0
Prevent import errors while importing PIL.Image
#60
satyamtg
closed
3 years ago
1
Add YoutubeDownloader class in download.py
#59
satyamtg
closed
3 years ago
3
Update all scrapers to black 20 formatting
#58
rgaudin
closed
3 years ago
3
Newer black formatting
#57
satyamtg
closed
3 years ago
1
also expecting -6 retcode on test
#56
rgaudin
closed
3 years ago
1
fixed image tests file paths
#55
rgaudin
closed
3 years ago
1
Fix rewriting of links with empty target
#54
satyamtg
closed
3 years ago
1
Links without target not written properly
#53
satyamtg
closed
3 years ago
0
Check dependencies for pip's new dep resolver
#52
rgaudin
closed
11 months ago
9
Use lxml parser with BeautifulSoup
#51
satyamtg
closed
3 years ago
2
HTML tree changes while creating ZIM with make_zim_file()
#50
satyamtg
closed
3 years ago
8
Replaced imaging module with exploded image module
#49
rgaudin
closed
3 years ago
1
prevent all special schemes in links rewriting
#48
rgaudin
closed
3 years ago
1
Do not rewrite mailto: links while creating ZIM
#47
satyamtg
closed
3 years ago
1
mailto: links being rewritten while creating ZIM
#46
satyamtg
closed
3 years ago
2
Use WebP in scrapers
#45
rgaudin
closed
3 years ago
2
Better defaults for convert_image()
#44
rgaudin
closed
3 years ago
2
Deal with inconsistencies in magic mime detection
#43
satyamtg
closed
3 years ago
2
Rewrite links from poster attribute of audio element when creating ZIMs
#41
satyamtg
closed
3 years ago
1
Fixed StaticArticle usage with content=
#40
rgaudin
closed
3 years ago
1
Use mimetypes guessed from filenames for text files
#39
satyamtg
closed
3 years ago
1
Wrong mimetype for CSS file
#38
satyamtg
closed
3 years ago
3
SVG images do not display on kiwix-serve/chrome extension
#42
satyamtg
closed
3 years ago
4
Get language from HTML
#37
satyamtg
closed
3 years ago
1
Add image optimization
#36
satyamtg
closed
3 years ago
8
Find language from HTML
#35
rgaudin
closed
3 years ago
0
Fix HTML link rewriting and CSS link rewriting
#34
satyamtg
closed
3 years ago
6
Automatically redirect to articles with same checksum
#33
satyamtg
opened
3 years ago
7
Fix zim url rewritting on node without expected attribute
#32
satyamtg
closed
3 years ago
1
New zim API based on pylibzim
#31
rgaudin
closed
3 years ago
6
Add S3 based optimization cache support
#30
satyamtg
opened
4 years ago
1
YouTube downloader integration in scarperlib
#29
satyamtg
closed
3 years ago
1
Return headers in save_file
#28
satyamtg
closed
3 years ago
3
Enable returning headers in download module
#27
satyamtg
closed
3 years ago
1
Previous
Next