issues
search
ucldc
/
rikolti
calisphere harvester 2.0
BSD 3-Clause "New" or "Revised" License
7
stars
3
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Investigate issue with UCR Library encountering EZID errors
#1158
aturner
opened
17 hours ago
1
oai.pspl `fetch_collection` error: `403 Client Error: Forbidden for url` - is this a data provider issue?
#1157
christinklez
closed
5 days ago
1
Spec for metadata quality report for all CSphere collections (e.g., Dates, Type, Rights)
#1156
aturner
opened
5 days ago
0
Add subfield e for creator
#1155
barbarahui
closed
1 week ago
0
`marc.ucb_tind` validation results
#1154
barbarahui
closed
1 week ago
11
Fetcher retries
#1153
barbarahui
closed
1 week ago
4
Update dynamic task mapping tasks to name all our mapped tasks
#1152
amywieliczka
opened
1 week ago
1
Add validation to contentdm for is_shown_at and by
#1151
barbarahui
closed
2 weeks ago
0
TIND `validate_by_mapper`'s throws 503 errors when fetching batch jobs; possibly does not like being hit in quick succession (running individual collections works fine) - implement retries globally for the fetcher
#1150
christinklez
closed
1 week ago
2
Remove couchdb env vars
#1149
barbarahui
closed
3 weeks ago
0
Rikolti `validate_by_mapper_type` dag is in a success state, but doesn't produce any validation reports
#1148
christinklez
closed
3 weeks ago
1
Generate and review validation reports: marc.ucb_tind
#1146
christinklez
closed
1 week ago
3
Whittle down the `type information not supplied` records; contact contributors about collections missing `type` values & ask them to add `type` information or if they would like to use the Registry` fill option
#1145
christinklez
opened
3 weeks ago
0
Added type hints, use asdict to serialize dataclasses
#1144
amywieliczka
closed
3 weeks ago
1
Rikolti `validate_by_mapper_type` dag fails during the `map_endpoint` task: `TypeError: Object of type MappedCollectionStatus is not JSON serializable`
#1143
amywieliczka
closed
3 weeks ago
0
ucb_tind_mapper
#1142
barbarahui
closed
3 weeks ago
0
Investigate Nuxeo etags
#1141
barbarahui
closed
3 weeks ago
1
ETag checks: allow redirects and add Nuxeo workaround
#1140
barbarahui
closed
1 month ago
0
Aggressive caching for Nuxeo only
#1139
amywieliczka
closed
1 month ago
2
Aggressive Caching
#1138
amywieliczka
closed
1 month ago
0
Allow for failure of media component derivative creation
#1137
barbarahui
closed
1 month ago
0
Content cache
#1136
amywieliczka
closed
1 month ago
0
Delete tmpfiles
#1135
barbarahui
closed
1 month ago
0
Nuxeo `content_harvest` issues for #26713 - `OSError: [Errno 28] No space left on device`
#1134
christinklez
closed
1 month ago
0
Nuxeo `content_harvest` issues for #27124 - various error messages
#1133
christinklez
closed
1 month ago
2
Content DM mapper issues
#1132
barbarahui
closed
1 month ago
0
Strip date string after splitting on ;
#1131
barbarahui
closed
1 month ago
0
Change ECSRunTask Operator's network config to use public subnets
#1130
amywieliczka
closed
1 month ago
0
Bad Dates: Roll all this up into one try block
#1129
amywieliczka
closed
1 month ago
0
Airflow version upgrade to 2.8.1
#1128
amywieliczka
closed
1 month ago
0
AttributeError: `str' object has no attribute 'get'`
#1127
gamontoya
closed
1 month ago
2
Supersized Nuxeo reharvest has `content_harvesting` errors: `OSError: [Errno 28] No space left on device` & `cache resources exhausted` & `Failed to seek in the stream`
#1125
christinklez
closed
1 month ago
0
Content harvest rm tmp files
#1124
barbarahui
closed
1 month ago
5
Don't use default value when doing dict lookup for 'thumbnail_source'
#1123
barbarahui
closed
1 month ago
2
Nuxeo `content_harvesting` error: `AttributeError: 'dict' object has no attribute 'decode'`
#1122
barbarahui
closed
1 month ago
2
One RIKOLTI_DATA to rule them all
#1121
amywieliczka
closed
1 month ago
0
Don't raise 409 errors from OpenSearch
#1119
amywieliczka
opened
1 month ago
1
Pass configured cache along to content_harvester container
#1118
amywieliczka
closed
1 month ago
1
OS Schema is nested under index name
#1117
amywieliczka
closed
1 month ago
1
Relax content cache: require ETag OR Last-Modified, not both
#1116
amywieliczka
closed
1 month ago
1
Airflow Version Upgrade to 2.9.2
#1115
christinklez
closed
3 weeks ago
1
Image Cache
#1114
amywieliczka
closed
1 month ago
0
Audit existing mappers for bespoke date logic
#1113
amywieliczka
closed
1 month ago
5
Add `pytest metadata_mapper` to CI script
#1112
amywieliczka
opened
2 months ago
0
Create technical documentation describing the kinds of dates Rikolti can parse, date best practices and recommendations - for developers
#1111
christinklez
opened
2 months ago
0
Add date data to existing Calisphere records
#1110
amywieliczka
closed
2 months ago
1
Investigate cacheing URLs of harvested content for any re-harvests
#1109
christinklez
closed
1 month ago
0
Investigate options to re-map from previously fetched vernacular data & recycle (re-pair) previously harvested content
#1108
christinklez
closed
1 month ago
0
Reharvest collections to pick up date/decade enrichments: non-Nuxeo sources + Nuxeo sources (check w/ campuses)
#1107
christinklez
opened
2 months ago
1
Create documentation describing the kinds of dates Rikolti can parse, date best practices and recommendations - for contributors
#1106
amywieliczka
opened
2 months ago
1
Next