Open artntek opened 3 months ago
For the Testing section, here's a quick rundown:
dataone
installedrdataone
and (probably) switch to the develop
branch, depending on what you need to testrdataone/dataone.RProj
in RStudioinstall.packages(c('remotes', 'devtools'))
devtools::load_all()
to load the current dataone
library code for testingremotes::install_deps()
to install all of the package dependencieshttps://dev.nceas.ucsb.edu
and copy your token for R from the web UI; paste the token options
command into the R console and run itdevtools::test()
to run the original tests against standard nodesd1cTestKNB <- D1Client("STAGING", "urn:node:mnTestKNB")
to use a different node for CRUD testsdevtools::test()
as described aboveFirst conversion (with errors) took almost exactly 48 hours
11/19/24: Second conversion (comprising only the failed objects from last time) took 42 minutes (Douglas Adams would approve)
see https://github.com/NCEAS/metacat/issues/1964#issuecomment-2484552430 for error analysis
brooke@arctica:~$ time sudo rsync -aHAX --delete /var/lib/postgresql/ /mnt/ceph/repos/$NAME/postgresql/
real 60m57.679s
user 1m5.106s
sys 4m33.743s
time sudo rsync -rltDHX --stats --human-readable /var/metacat/data/ /mnt/ceph/repos/$NAME/metacat/data/
real 29m29.133s
user 1m56.979s
sys 6m45.844s
time sudo rsync -rltDHX --stats --human-readable /var/metacat/dataone/ /mnt/ceph/repos/$NAME/metacat/dataone/
real 0m10.742s
user 0m0.037s
sys 0m0.018s
time sudo rsync -rltDHX --stats --human-readable /var/metacat/documents/ /mnt/ceph/repos/$NAME/metacat/documents/
real 0m16.327s
user 0m0.490s
sys 0m1.014s
time sudo rsync -rltDHX --stats --human-readable /var/metacat/logs/ /mnt/ceph/repos/$NAME/metacat/logs/
real 0m0.101s
user 0m0.025s
sys 0m0.016s
hashstore conversion started: Wed Nov 20 22:55:15 UTC 2024
Similar to #1932;
checklist:
[x] Work with @nickatnceas to copy production data for testing:
[x] Time how long it takes to...
arcticdata.io:/var/lib/postgresql
) to the PROD ceph volume at/mnt/ceph/repos/arctic/postgresql
(treat it like a hot backup)./var/lib/postgresql/10
directory[x] copy the following subset of production data from
arcticdata.io:/var/metacat
to the PROD ceph volume at/mnt/ceph/repos/arctic/metacat
:Actual Times taken for /var/metacat/data:
real 14286m43.628s user 1131m15.740s sys 3907m38.871s
-> 9.92 days
Follow the Quick Reference: Metacat K8s Installation Steps. Supplementary TODOs below...
Persistent Volumes
.../repos/arctic/metacat
for metacat.../repos/arctic/postgres
for postgresprod_cluster/metacatarctic/pvc--metacatarctic-postgres.yaml
MetacatUI setup
Metacat Config
values.yaml
overrides for non-default 2.19 settings (diff arcticdata.io$TOMCAT_HOME/webapps/metacat/WEB-INF/metacat.properties
with defaultmetacat.properties
from 2.19 release)values.yaml
overrides for newly-introduced 3.0 settings (diff defaultmetacat.properties
from 3.0.0 release with defaultmetacat.properties
from 2.19 release)First Deployment
[x] Complete steps in "First Install - IMPORTANT IF MOVING DATA FROM AN EXISTING LEGACY DEPLOYMENT" BEFORE first startup!
[x] solr pods not starting. root cause from logs:
SOLVED - was overriding
extraVolumes
values, and the override didn't include the permissions line[x]
https://arctic-prod.test.dataone.org/catalog/
(trailing slash) works, buthttps://arctic-prod.test.dataone.org/catalog
gives a 404 (nginx)[x] ensure all data and documents files are group writeable (otherwise, hashstore upgrader can't create hard links):
[x]
chown -R 59997:59997
the ceph dir corresponding to/var/metacat
, and update values.yaml to use this uid:gid[x] Hostname aliases and rewrite rules
ATTENTION: Still To Do Before Final Deployment
[x] Time hashstore conversion
[ ] Time reindex-all
[ ] MetacatUI + WordPress setup. How do we host it and link to k8s metacat?
[x] ACTION: Ask @nickatnceas for help with letsencrypt certs - do we need to remove
arcticdata.io
from wildcard cert on arctica? NOTE: we still need subdomain certs there (ie status.adc, beta.adc).[ ] Skip 3.0.0 and deploy 3.1.0, but only after it's been running on less-trafficked hosts for a while. See proposed release plan in Issue #1984.
Testing - see Matt's comment below