Open jsheunis opened 9 months ago
Thanks again, I will answer the questions here.
More clarifying questions:
So inside of the rawdata subdataset on the acquisition computer you run:
datalad save datalad update --merge datalad push
correct?
- Can I ask why you run the update --merge?
- Are you making changes to the raw data subdataset at any other location/clone than the acquisition computer?
The only changes to this dataset are made in the acquisition computer, I was told to always update just in case to avoid conflicts and assumed that worst case scenario this update would just be redundant.
Thanks for the additional info. Its still difficult to piece together precisely what happened. I have tried a few attempts at recreating the situation you describe (in a dataset hierarchy with a sibling on Gin, using mv
and git mv
and rm
on directories or subdatasets, followed by save
, update --merge
, and push
) but I did not observe this issue yet - but this is likely because there simply are some details missing for a reproducer. I'm looking forward to investigating this closer in an office hour, where we can exchange relevant information in real time!
Follow up in the office hour: We got to a productive screensharing session in which everyone got quite confused by what we saw. Here are a few facts:
Acquisition Computer (windows) saves and restructures files; Regular pushes to a Gin sibling; a clone on a mac pulls updates from Gin.
The Gin webinterface has a bug - folders created and pushed from a windows machine, and later renamed and pushed again do not get removed in the webinterface' index. In this minimal reproducer, "folder" was renamed to "newname" and "folder" should not exist in the webinterface, but lingers around. (overall: confusing, but with no impact on the the clone)
The local clone on the mac was in a convoluted state (we couldn't figure out how it got there, but it was a mix of a very updated index, a detached HEAD, and unmerged branched - likely the Gin confusion contributed to that). Also, the repository reported on a background garbage collection process that looked a bit shady. And finally, an icloud backup process to the cloud created duplicated files (HEAD 2
, index 2
, ...) in the .git/
directory.
Recloning the repository from Gin fixed the issue
We left with the following recommendations:
General:
Helpers we recommended:
Origin: Office Hour chatroom message
Description
User reported:
@adswa asked to confirm that:
User answer:
More clarifying questions:
correct?
Next steps
TODO (not necessarily to be performed in this order)