issues
search
AusDTO
/
disco_layer
Code, outputs and Information relevant to the discovery layer.
1
stars
5
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
crawler: url encoding of query parameters
#36
nokout
closed
9 years ago
1
crawler: add a document hash field
#35
nokout
closed
9 years ago
1
acnc.gov.au is currently excluded
#34
nokout
closed
9 years ago
1
crawler: Expected redirect but getting 599
#33
nokout
closed
9 years ago
1
crawler: Stripping of query params
#32
nokout
closed
9 years ago
3
automated tests for the node stuff
#31
monkeypants
closed
9 years ago
1
content cagefight API presentation
#30
monkeypants
opened
9 years ago
4
haystack/solr version compatability
#29
monkeypants
closed
9 years ago
1
configure disco_service to use celery-haystack
#28
monkeypants
closed
9 years ago
1
jenkins job for disco_service
#27
monkeypants
closed
9 years ago
1
purge orientdb references
#26
monkeypants
closed
9 years ago
2
Production prep
#25
nokout
closed
9 years ago
0
orientdb trigger and function to handle changes
#24
nokout
closed
9 years ago
1
duplicate keys in url index
#23
nokout
closed
9 years ago
1
only close database after queries are done.
#22
nokout
closed
9 years ago
2
Rotate Log Files
#21
nokout
closed
9 years ago
2
Production prep
#20
nokout
closed
9 years ago
0
Code cleanup
#19
nokout
closed
9 years ago
0
Add build tests for node module
#18
nokout
closed
9 years ago
1
Fix Logging
#17
nokout
closed
9 years ago
0
Load JSON samples of service description document
#16
nokout
closed
9 years ago
1
Create a sample of user assertions about content.
#15
nokout
opened
9 years ago
5
make AST (graph) of cleaned content
#14
monkeypants
closed
9 years ago
3
Prevent crawling job from deleting content
#13
nokout
closed
9 years ago
2
Link AST Graph to service documents using assertions
#12
nokout
closed
9 years ago
1
normalise/decruftify content
#11
monkeypants
closed
9 years ago
3
use bookmarklet to record assertions about pages
#10
nokout
opened
9 years ago
3
generate enhanced document
#9
nokout
closed
9 years ago
1
Post enchanced document to solr
#8
nokout
closed
9 years ago
2
Crawl Server Errors
#7
nokout
closed
9 years ago
1
No database error handling
#6
nokout
closed
9 years ago
1
Fetch Condition - Due Date Callback fail
#5
nokout
closed
9 years ago
1
Exclude state based domains
#4
nokout
closed
9 years ago
1
simplecrawler (Crawler.prototype.domainValid) Hack
#3
nokout
closed
9 years ago
1
Node/crawl/crawl.js - Externalise config
#2
nokout
closed
9 years ago
2
Create genService.js
#1
nokout
closed
9 years ago
0
Previous