issues
search
sul-dlss-deprecated
/
rialto-etl
ETL tools for RIALTO, Stanford Libraries' research intelligence project
https://library.stanford.edu/projects/rialto
Apache License 2.0
3
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Run one-off full load of publication data in prod (for all time)
#322
peetucket
closed
5 years ago
6
Change schedule to vary per object type
#321
mjgiarlo
closed
5 years ago
0
Fix untested extractor invocation bug
#320
mjgiarlo
closed
5 years ago
2
Support incremental publication extracts
#319
mjgiarlo
closed
5 years ago
0
Install AWS CLI
#318
jcoyne
closed
5 years ago
1
Push organization data to s3
#317
jcoyne
closed
5 years ago
1
Switches to using English labels for countries.
#316
justinlittman
closed
5 years ago
0
Update ETL logging
#315
aaron-collier
closed
5 years ago
0
Don't use processing threads for sparql load
#314
jcoyne
closed
5 years ago
0
Retry on server error
#313
jcoyne
closed
5 years ago
0
Can't load organizations on production without ConcurrentModificationException
#312
jcoyne
closed
5 years ago
0
Schedule the deploy process for the production server
#311
jcoyne
closed
5 years ago
0
Log errors with honeybadger
#310
jcoyne
closed
5 years ago
0
Remove -labs from readme.
#309
jcoyne
closed
5 years ago
0
Harvest WoS publications en masse
#308
mjgiarlo
closed
5 years ago
2
Update dependencies, fixes secvuln in rack
#307
jcoyne
closed
5 years ago
0
Only adds parent context to specific org names.
#306
justinlittman
closed
5 years ago
0
Setup scheduled loading in prod via cron/whenever
#305
peetucket
closed
5 years ago
1
Remove school name in parens after department name for institutes
#304
peetucket
closed
5 years ago
1
Adjust WoS harvesting so that subsequent updates do not pull data for all time
#303
peetucket
closed
5 years ago
0
Increase backoff time, to be nicer to WoS
#302
jcoyne
closed
5 years ago
0
Add honeybadger for composite ETL
#301
jcoyne
closed
5 years ago
0
Reduce the number of retries and increase interval between to avoid overloading WoS
#300
peetucket
closed
5 years ago
0
Push organization data to S3
#299
jcoyne
closed
5 years ago
1
Create service account for S3 and cloudwatch
#298
jcoyne
closed
5 years ago
1
Setup whenever for organizations
#297
jcoyne
closed
5 years ago
0
Composite ETL should return non-zero return code on significant errors
#296
justinlittman
closed
5 years ago
0
Changed SERA extractor to log and raise error when http fails.
#295
justinlittman
closed
5 years ago
0
Logs and raises error when Sparql Writer post fails. Composite ETL co…
#294
justinlittman
closed
5 years ago
0
Ignore the default data directory
#293
mjgiarlo
closed
5 years ago
0
Sparql Writer should check for and handle errors
#292
justinlittman
closed
5 years ago
0
Handle entity resolver errors by logging and raising.
#291
justinlittman
closed
5 years ago
3
SERA extractor should log errors as warnings
#290
justinlittman
closed
5 years ago
0
Stop transform on Entity Resolver error
#289
justinlittman
closed
5 years ago
0
[WIP] Thread experiment
#288
justinlittman
closed
5 years ago
4
Log error and not create ndj file when wos extract error occurs.
#287
justinlittman
closed
5 years ago
2
Add GeoNames mappings for "Unmapped country" output
#286
mjgiarlo
closed
5 years ago
2
WOS client should log non-200 responses
#285
justinlittman
closed
5 years ago
1
Investigate using new CAP/Profiles API enhancements
#284
mjgiarlo
opened
5 years ago
0
Add Capistrano for deployment
#283
jcoyne
closed
5 years ago
0
Update to a released version of the rdf library
#282
jcoyne
closed
5 years ago
0
[WIP] Use Sidekiq to handle composite ETL
#281
mjgiarlo
closed
5 years ago
3
Use preferred DOI resolver
#280
mjgiarlo
closed
5 years ago
0
Sets charset to utf-8 in content-type when posting Sparql.
#279
justinlittman
closed
5 years ago
0
Fixes fetch_grant_identifiers so that can handle grant identifiers th…
#278
justinlittman
closed
5 years ago
0
Fix undefined method in fetch_grant_identifiers
#277
justinlittman
closed
5 years ago
1
Update codeclimate ID in Travis build config
#276
mjgiarlo
closed
5 years ago
0
Map DOIs as RDF URIs
#275
mjgiarlo
closed
5 years ago
0
Some publications are not getting indexed.
#274
jcoyne
closed
5 years ago
7
Update to use upstream fix for race condition
#273
jcoyne
closed
5 years ago
0
Previous
Next