Open ebezzi opened 1 year ago
@ebezzi / @atolopko-czi : Close this as a duplicate of #5682 ?
This was a particular problem, performance-wise, for CI/CD GHAs that were fetching full git histories. I believe they're now only fetching 1 or 2 levels, so it shouldn't be a problem for CI now. But if so, this is a larger reason to fix.
Developers' local git repo clones are not checked out "fresh" very frequently. However, I have run into large updates for even "git pulls", though maybe that was only after large data files had been updated.
Our data portal git repo is much bigger than it needs to be, to the point where cloning it takes several minutes. This is a problem for GHA and for developers' productivity. To solve this issue, we should:
Note that as a mitigation strategy, it is possible to specify
--single-branch
when cloning the data portal, but this should only be used as an emergency.