issues
search
c4software
/
python-sitemap
Mini website crawler to make sitemap from a website.
GNU General Public License v3.0
362
stars
110
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Images from different domains should not be added to sitemap
#41
ghost
closed
7 years ago
0
Suggestion: Not parseable resources ->parseable resources
#40
ghost
opened
7 years ago
1
Online interface for the script
#39
ghost
closed
6 years ago
3
Double entry if 2 slashes in the url
#38
ghost
closed
7 years ago
3
Endless loop part 2: report error and document workaround
#37
ghost
opened
7 years ago
0
Crawl fails to find one page
#36
ghost
opened
7 years ago
0
UnicodeDecodeError possibly with Scandinavian letters
#35
ghost
opened
7 years ago
0
Endless loop fix
#34
ghost
closed
7 years ago
1
UnicodeDecodeError possibly with Scandinavian letters
#33
ghost
closed
7 years ago
4
Duplicate entry
#32
ghost
closed
7 years ago
3
Improvement proposal: video support
#31
ghost
opened
7 years ago
0
Read the image title / alt
#30
c4software
opened
7 years ago
0
Image Licence
#29
c4software
closed
5 years ago
0
Add options to pretty print the output XML
#28
c4software
closed
7 years ago
3
Tracker images are included
#27
ghost
closed
7 years ago
2
IMG Data URI and image license
#26
ghost
closed
7 years ago
1
Slash missing in URL
#25
ghost
closed
7 years ago
3
More robust link regex
#24
Garrett-R
closed
7 years ago
1
HTTPS urls
#23
wernerb90
closed
7 years ago
2
Image Sitemap?
#22
wernerb90
closed
7 years ago
22
Add Dockerfile
#21
sebclick
closed
7 years ago
0
Merge pull request #6 from c4software/master
#20
sebclick
closed
7 years ago
0
Crawler.py giving error
#19
KKS161994
closed
7 years ago
3
Verbose output and fix robots.txt bug
#18
Garrett-R
closed
7 years ago
2
gpl
#17
etw3gh
closed
7 years ago
1
Make link regex ignore other attributes
#16
Garrett-R
closed
8 years ago
1
Question about Sitemap
#15
cocojambo89
closed
8 years ago
4
Stack overflow error
#14
dchaplinsky
closed
10 years ago
3
Add link that caused 404 to the sitemap
#13
kevinburke
closed
11 years ago
3
resolved: "Invalid date An invalid date was found. Please fix the date o...
#12
ghuntley
closed
11 years ago
1
patch for response error
#11
eneagu
closed
11 years ago
5
patch for <lastmod> in sitemap
#10
eneagu
closed
11 years ago
2
Ajout compteur temps total
#9
sebclick
closed
12 years ago
0
Ajout de l'option --report
#8
sebclick
closed
12 years ago
0
Ajout paramètre "drop" et résumé du crawler à la fin
#7
sebclick
closed
12 years ago
0
Correction Issue #3 et Issue #5
#6
sebclick
closed
12 years ago
0
URL en erreur 404 affichée dans le sitemap
#5
sebclick
closed
12 years ago
1
Correction Issue #3
#4
sebclick
closed
12 years ago
0
Soucis sur le nombre de lien avec code
#3
c4software
closed
12 years ago
3
Ajout d'info de log sur le nb de retour HTTP par code
#2
sebclick
closed
12 years ago
0
Ajout de l'option --exclude
#1
sebclick
closed
12 years ago
0
Previous