issues
search
bbcarchdev
/
anansi
A Linked Open Data Web crawler
https://bbcarchdev.github.io/anansi/
Apache License 2.0
0
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Cluster re-balancing should periodically refresh the directory
#20
nevali
opened
9 years ago
0
Cluster re-balancing doesn't check validity of directory
#19
nevali
closed
7 years ago
0
Use jansson instead of jsondata
#18
nevali
closed
7 years ago
0
Process robots.txt
#17
nevali
opened
9 years ago
0
Fetch /.well-known/void when a new root is added
#16
nevali
opened
9 years ago
0
Allow certain URLs to be automatically added to the queue when a new root is added
#15
nevali
opened
9 years ago
0
LOD processor acceptable licence list should be configurable
#14
nevali
closed
9 years ago
0
Process Link: HTTP response headers
#13
nevali
closed
9 years ago
0
crawler-add incorrectly reports its own name as 'crawld-add'
#12
nevali
closed
9 years ago
0
crawld queue implementations should be loadable modules
#11
nevali
opened
9 years ago
0
crawld-add triggers crash due to underlying memory leak
#10
nevali
closed
9 years ago
0
Enable use of etcd for on-the-fly rebalancing
#9
nevali
closed
9 years ago
1
Add an -l (list) option to crawler-add
#8
nevali
closed
9 years ago
0
Add a -f (force) option to crawler-add
#7
nevali
closed
9 years ago
1
Add a FORCE state which causes 'refresh without checking cache' action
#6
nevali
closed
9 years ago
0
Resources are being added which do not meet the processing criteria
#5
nevali
opened
9 years ago
0
Add init script to debian control scripts
#4
nevali
closed
9 years ago
1
Confirm behaviour when adding a URL containing a fragment
#3
nevali
closed
9 years ago
1
LOD processor doesn't need to perform its own same-origin check
#2
nevali
closed
9 years ago
0
crawler-add segfaults if invoked with no parameters
#1
nevali
closed
9 years ago
0
Previous