Open kltm opened 4 years ago
@kltm at some point I think I will need a lot more information to contribute here. :-) Right now I don't understand what this has to do with the ontology makefile. Does this result from the ontology build running as root
? https://github.com/geneontology/pipeline/blob/dac947b73cfc9cfd39c832ac0699def334522bde/Jenkinsfile#L382
In my Phenoscape build I took that out and it works much better for me, having all the files created and owned by the Jenkins agent.
@balhoff No worries--I just wanted to loop you in as we had talked about it before. I think that option "3" there is the likely way forward and there would be nothing for you to do for that.
Good point about the permissions there. IIRC, the reason I have that is that, without the root mapping there, there is a different set of permission strangenesses that come from an unknown user in the image creating files on the external filesystem. It might be worth it to explore that again though before setting off on the more complicated solutions. Added to the list.
This problem annoys me so much.. I struggle a lot with this in monarch as well. I will keep listening here what solutions are found :)
@matentzn There are a few things I want to try, guaranteed to work, but they all operate at different levels of irritating complexity, coordination, or permission. The easiest/simplest things I've been looking at recently have been to try and ensure that there is no information leakage (like the solr build images), but in this case it's a little harder to control. My hope is to find a command that tells Jenkins not to mount the default workspace for the agent when running--that would solve pretty much everything. Still looking...
Noting recent rash of these for us.
Instance of on 2020-10-12 snapshot
run (from inability to fully clean on 2020-10-11 run).
Instance of on 2020-10-24 snapshot
run (likely from 24hr+ runtime due to resource strangle with other testing).
Instance of on 2020-10-30 snapshot
run (unknown reason).
Instance of on 2020-11-17 snapshot
run (likely from series of unfortunate events, including an upstream EBI FTP failure).
Instance of on 2020-12-17 snapshot
run. Noting exactly a month from last time. Also, a fix went in a few days ago (although there were two good builds in between).
Instance of on 2020-12-19 snapshot
run. It then "healed" itself, which I think might be a first...
Instance of this on 2020-01-09 snapshot
run. Cleaning.
Instance of this on 2020-01-13 snapshot
run. Cleaning.
Instance of this on 2020-01-15 snapshot
run. Cleaning.
Occurrence on issue-noctua-models-170-zfin-import-test
(tagging @sierra-moxon ).
I suspect that there are no further interesting patterns here, so I'm going to stop listing failures.
I'm thinking that it may be better to treat this as a sub-issue of #139 if we can get a non-root docker setup to correctly handle permissions.
Recently, we've had a rash of ontology build failures around errors like:
This is caused by the usual jenkins/docker permissions misalignment. Possible solutions are:
This kind of thing may indicate that a general in-Jenkins solution for #139 is not possible.
Also noting that it mostly seems to be having issue with the /target directory.
Tagging @balhoff