bbcarchdev anansi issues

bbcarchdev / anansi

A Linked Open Data Web crawler

https://bbcarchdev.github.io/anansi/

Apache License 2.0

0 stars 0 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Cluster re-balancing should periodically refresh the directory

#20 nevali opened 9 years ago
0
Cluster re-balancing doesn't check validity of directory

#19 nevali closed 7 years ago
0
Use jansson instead of jsondata

#18 nevali closed 7 years ago
0
Process robots.txt

#17 nevali opened 9 years ago
0
Fetch /.well-known/void when a new root is added

#16 nevali opened 9 years ago
0
Allow certain URLs to be automatically added to the queue when a new root is added

#15 nevali opened 9 years ago
0
LOD processor acceptable licence list should be configurable

#14 nevali closed 9 years ago
0
Process Link: HTTP response headers

#13 nevali closed 9 years ago
0
crawler-add incorrectly reports its own name as 'crawld-add'

#12 nevali closed 9 years ago
0
crawld queue implementations should be loadable modules

#11 nevali opened 9 years ago
0
crawld-add triggers crash due to underlying memory leak

#10 nevali closed 9 years ago
0
Enable use of etcd for on-the-fly rebalancing

#9 nevali closed 9 years ago
1
Add an -l (list) option to crawler-add

#8 nevali closed 9 years ago
0
Add a -f (force) option to crawler-add

#7 nevali closed 9 years ago
1
Add a FORCE state which causes 'refresh without checking cache' action

#6 nevali closed 9 years ago
0
Resources are being added which do not meet the processing criteria

#5 nevali opened 9 years ago
0
Add init script to debian control scripts

#4 nevali closed 9 years ago
1
Confirm behaviour when adding a URL containing a fragment

#3 nevali closed 9 years ago
1
LOD processor doesn't need to perform its own same-origin check

#2 nevali closed 9 years ago
0
crawler-add segfaults if invoked with no parameters

#1 nevali closed 9 years ago
0