deployment-gap-model-education-fund / deployment-gap-model

ETL code for the Deployment Gap Model Education Fund
https://www.deploymentgap.fund/
MIT License
6 stars 2 forks source link

Make repo public #217

Closed bendnorman closed 1 year ago

bendnorman commented 2 years ago

Things to remove from git history:

bendnorman commented 1 year ago

It doesn't look like bfg removes sensitive information from open branches, example. We have 24 open branches that I think would have to be deleted. Many of them have already been merged we just didn't delete the branch.

I think we have two options to really clear out this repo:

  1. Continue with this repo:
    • Delete all merged branches that still have sensitive data.
    • I don't know of any way to force Github to reindex a repo so sensitive information can still come up in search results. We could hope that no one searches for the sponsors name in the repo and that GItHub reindexes the repo soon.
  2. Create a duplicate repo from the most recent commit.

@TrentonBush what do you think is the best option?

TrentonBush commented 1 year ago

Given the choice, I'd rather keep the current repo. But if there are lots of additional steps to making that happen, I think option 2 isn't horrible. Comparing branches to issues, it looks like branches are easier to get rid of. It looks like almost every branch has already been merged or abandoned. I just took a brief tour through the open issues and could close about 40% of them without trying very hard.

I guess I'd vote for option 1 and to ignore the temporary indexing problem. If I search for The Term that Shall Not Be Named I see over 1k repos. I think we would be quite far down that list.

bendnorman commented 1 year ago

Sounds good. I'll move forward with option one.

bendnorman commented 1 year ago

I deleted branches that have been merged. @TrentonBush do you mind going through the remaining branches? I'm not familiar with the work living on the remaining branches.