issues
search
ucldc
/
harvester
Harvester for the ucldc solr index. Pushes content into the raw solr index.
BSD 3-Clause "New" or "Revised" License
3
stars
5
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Update rq & redis to latest versions
#320
mredar
closed
7 years ago
0
Check that <=1000 records fetched from XML file
#319
mredar
closed
7 years ago
0
limit messge to 100k chars
#318
mredar
closed
7 years ago
0
dh-q and jobs queueing bug fixes
#317
mredar
closed
7 years ago
0
Add -y flag for non-interactive run
#316
mredar
closed
7 years ago
0
Add some output for sync
#315
mredar
closed
7 years ago
0
XML fetcher: harvest 1000 records at a time from large file
#314
mmmmatthew
closed
7 years ago
0
solr update bug fix, add more msgs to deep queueZZ
#313
mredar
closed
7 years ago
0
Harvest components as well.
#312
mredar
closed
7 years ago
0
bug fix
#311
mredar
closed
7 years ago
0
Deep harvest by queuing a bunch of single object jobs
#310
mredar
closed
7 years ago
0
Add number fetched, make coverage-cmds exe
#309
mredar
closed
7 years ago
0
More explicit image harvesting report to SNS & Slack
#308
mredar
closed
7 years ago
0
Use trusty build.
#307
mredar
closed
7 years ago
0
save objsets to s3
#306
mredar
closed
7 years ago
0
simplify code
#305
mredar
closed
7 years ago
0
bug fix
#304
mredar
closed
7 years ago
0
Redact object_auth data.
#303
mredar
closed
7 years ago
0
args to rq needs to be a iterable
#302
mredar
closed
7 years ago
0
Point to mredar nuxeo-calisphere, fix path arg
#301
mredar
closed
7 years ago
0
Add single object deep harvest
#300
mredar
closed
7 years ago
0
XML fetcher: cast obj_mdata defaultdict object to dict
#299
mmmmatthew
closed
7 years ago
0
Go to "vertical" style to help avoid conflcts
#298
mredar
closed
7 years ago
0
Generic XML fetcher for Center for Sacramento History
#297
mmmmatthew
closed
7 years ago
0
make sort & wt overrideable
#296
mredar
closed
7 years ago
0
Make timeout big
#295
mredar
closed
7 years ago
0
New requests based solr fetcher for more flexible usage
#294
mredar
closed
7 years ago
0
Add num saved to report for metadata harvest
#293
mredar
closed
7 years ago
0
Better formatted SNS Slack messages
#292
mredar
closed
7 years ago
0
Add the worker emoji
#291
mredar
closed
7 years ago
0
Nicer formatted Slack messages for start/stop jobs
#290
mredar
closed
7 years ago
0
Uncomment publish lines
#289
mredar
closed
7 years ago
0
Add RQ exception handler to report to SNS topic
#288
mredar
closed
7 years ago
0
put worker ip first in message
#287
mredar
closed
7 years ago
0
use logging in sns_message
#286
mredar
closed
7 years ago
0
Pin md5s3stash to commit 7c32a3270198ae9b84f22a4852fe60105f74651b
#285
mredar
closed
7 years ago
0
Try GET if HEAD fails for testing type of content
#284
mredar
closed
7 years ago
0
Fix bug when sourceResource/title is just a string
#283
mredar
closed
7 years ago
0
Raise is status code != 200
#282
mredar
closed
7 years ago
0
Does it fail the "is image test"
#281
mredar
closed
7 years ago
0
add newline
#280
mredar
closed
7 years ago
0
Better message from image harvest
#279
mredar
closed
7 years ago
0
print to stderr for errors
#278
mredar
closed
7 years ago
0
Handle boto exception ClientError from mediajson test
#277
mredar
closed
7 years ago
0
Remove unused code
#276
mredar
closed
7 years ago
0
Fix call to publish_to_harvesting
#275
mredar
closed
7 years ago
0
Use MediaJson to check media json. Improve reporting
#274
mredar
closed
7 years ago
0
Check for Nuxeo deep harvest products
#273
mredar
closed
7 years ago
0
BAMPFA provenance mapping
#272
mmmmatthew
closed
7 years ago
1
Remove self called image_harvest
#271
mredar
closed
7 years ago
0
Previous
Next