Closed jgallowa07 closed 2 years ago
whoops, yes, #82 is resolved, i added a comment.
ok I've switched the partis .gitmodules to https.
I don't seem to have updated this issue at the time, but I uncommented those steps here a while ago since unfortunately they weren't optional.
The underlying issue (lm.fit()) was called by coda::effectieSize()
in run_bootstrap_asr_ess.R, which was calculates ess values for every column in ess.data. The crash was caused by a single NaN/inf value at the top of the LHLogLikelihood column, which I'm guessing maybe was there since this test data has a very small number of steps or something? Note that this crash only happened on this test data, at least that I'm aware of. Anyway did not investigate, just removed rows (with printed warning) with any nan/inf values which seems to fix it.
I believe this error may be out of date. I will use this issue to document how my build experience is going - and potential solutions I have found so far.
background
To summarize the pipeline of things going on here, we have a github action here which generally does the following:
Additionally, there is a build trigger in the quay repo which is triggered by push events. I believe this probably un-necessary as it's a duplication of image build efforts - for now it is toggled off anyway. So if the action builds, we should be good to go on the quay container.
Github actions
Problem 1: submodules
Github submodules failing -> as with here
Potential Solution 1 We need to change the .gitmodules in partis to point towards the
https
accessable code. Currently, jared-test branch just points towards a partis I personally cloned and modified. @psathyrella can you handle making this change in the actual partis? This should be done in the branch which linearham is using as a submodule -> currently this to do this update the .gitmodules file to look like this ->Potential Solution 2 We could do this for ssh -> https://github.com/actions/checkout/issues/116#issuecomment-644419389 .
problem 2: scripts/run_bootstrap_asr_ess.R
When running the test.sh in the built container, we get the following error
Up to this point, I have not figured out exactly what is causing this. I'm not even sure where the lm() function is being called.
Hacky solution
Simply comment out all Scons targets that involve this dependency. I'm no expert with Scons but it looks to be quite important in testing -> here's all that got commented out.
We'll certainly want to com back to this and figure out what the problem is at some point. For now, I've tested the gh action build and push with jared-test branch and we seem to be okay. Once the partis submodules are updated I can update my branch's partis, and submit a PR for a clean build on master (should be changed to main -> )