issues
search
gbif
/
crawler
The crawling pieces - ws, cli, coordinator
Apache License 2.0
4
stars
3
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
CamtrapDP datasets replacing DWCA datasets cannot be crawled
#68
MattBlissett
opened
4 months ago
1
#66 Extract camtrap contributors from datapackage.json
#67
mike-podolskiy90
closed
8 months ago
0
Extract camtrap contributors from `datapackage.json` and update dataset
#66
mike-podolskiy90
closed
8 months ago
0
Bump ch.qos.logback:logback-classic from 1.2.9 to 1.2.13
#65
dependabot[bot]
opened
10 months ago
0
Bump ch.qos.logback:logback-core from 1.2.9 to 1.2.13
#64
dependabot[bot]
opened
10 months ago
0
Bump ch.qos.logback:logback-classic from 1.2.9 to 1.3.12
#63
dependabot[bot]
closed
10 months ago
1
Bump ch.qos.logback:logback-core from 1.2.9 to 1.3.12
#62
dependabot[bot]
closed
10 months ago
1
Configure Renovate - autoclosed
#61
renovate[bot]
closed
2 months ago
0
Bump com.rabbitmq:amqp-client from 4.8.0 to 5.18.0
#60
dependabot[bot]
opened
11 months ago
0
Because of current BioCASE metadata mapping, dataset owner names aren't included in citation
#59
ManonGros
opened
1 year ago
3
& characters in dataset titles are not correctly escaped in BioCASe requests.
#58
MattBlissett
closed
1 year ago
0
Pipelines 803
#57
fmendezh
closed
1 year ago
0
Bump commons-beanutils from 1.9.2 to 1.9.4
#56
dependabot[bot]
opened
2 years ago
0
Bump jackson-databind from 2.11.3 to 2.12.6.1
#55
dependabot[bot]
opened
2 years ago
0
Bump logback-core from 1.2.3 to 1.2.9
#54
dependabot[bot]
closed
10 months ago
1
Bump hadoop-common from 2.6.0-cdh5.16.2 to 3.2.3
#53
dependabot[bot]
opened
2 years ago
0
Restrict datasets to be crawled based on certain rules
#52
fmendezh
opened
2 years ago
0
Bump hadoop-common from 2.6.0-cdh5.16.2 to 2.10.1
#51
dependabot[bot]
closed
2 years ago
1
Bump log4j-api from 2.3 to 2.16.0
#50
dependabot[bot]
closed
2 years ago
0
Bump log4j-api from 2.3 to 2.15.0
#49
dependabot[bot]
closed
2 years ago
1
Bump httpclient from 4.5.6 to 4.5.13
#48
dependabot[bot]
closed
2 years ago
0
Bump commons-io from 2.5 to 2.7
#47
dependabot[bot]
closed
2 years ago
1
BiocaseMetadataSynchroniser: Ignore unknown namespaces
#46
snsb-seifert
closed
2 years ago
6
Upgrade to Spring 2.3.7.RELEASE
#45
fmendezh
closed
1 year ago
2
Pipelines process running endpoint with search capabilities shouldn't have all the parameters as required
#44
marcos-lg
closed
3 years ago
0
Add machineTag capturing the extensions used in the dataset
#43
MortenHofft
opened
3 years ago
0
The monitoring API for running processes shows always attempt 0
#42
marcos-lg
closed
3 years ago
0
Bump junit from 4.12 to 4.13.1
#41
dependabot[bot]
closed
3 years ago
1
"crawl cleanup" command
#40
MattBlissett
closed
1 year ago
1
DELETE call on ingestion run doesn't work
#39
MattBlissett
closed
1 year ago
1
DwCA: HTTP Redirect 308 (Moved Permamently) fails.
#38
snsb-seifert
closed
4 years ago
0
Sequential ingestion attemps can run in parallel
#37
MattBlissett
closed
1 year ago
1
Add logging MDC for attempt
#36
MattBlissett
closed
4 years ago
0
Bump zookeeper from 3.4.5 to 3.4.14
#35
dependabot[bot]
closed
4 years ago
1
Pipelines history with executions
#34
marcos-lg
closed
4 years ago
0
Embedded search
#33
marcos-lg
closed
4 years ago
0
wish: allow a registry admin to force a crawl, regardless of the date of the last run
#32
ahahn-gbif
opened
4 years ago
2
ZK cache for pipelines monitoring
#31
marcos-lg
closed
4 years ago
0
adding spark history config to local spark runners
#30
fmendezh
closed
4 years ago
0
Pipelines process status
#29
marcos-lg
closed
5 years ago
0
BioCASE dataset has same crawl ID spread across different dates
#28
jlegind
closed
4 years ago
2
Endpoint type for pipelines
#26
marcos-lg
closed
5 years ago
0
suggest to add more tests for json serializer
#25
jhpoelen
opened
5 years ago
0
Improve overcrawl tracking for BioCASe datasets
#24
MattBlissett
opened
6 years ago
3
#105 changes for migrating to spark2
#23
aalbatross
closed
5 years ago
0
94 execute data interpretation beam pipelines programatically
#22
muttcg
closed
6 years ago
0
#92 command to trigger interpretation pipeline when dataset as avro i…
#21
aalbatross
closed
6 years ago
1
#91 Publishing ExtendedRecordAvailable Message when the convertion to…
#20
aalbatross
closed
6 years ago
1
Crawl scheduler stops working after a few days
#19
MattBlissett
closed
2 years ago
4
DiGIR crawls which always fail not marked as failed crawls
#18
MattBlissett
opened
6 years ago
0
Next