apache / lucene

Apache Lucene open-source search software
https://lucene.apache.org/
Apache License 2.0
2.59k stars 1.01k forks source link

remove refs to people.apache.org/home.apache.org in build #13647

Open rmuir opened 1 month ago

rmuir commented 1 month ago

Description

The server will be decomm'ed soon, we should fix these refs:

$ git grep people.apache.org
$ git grep home.apache.org
mikemccand commented 3 weeks ago

Thanks @rmuir and @ChrisHegarty.

I've downloaded all my content from home.apache.org (Lucene benchmark source corpora, line file docs, large vector file, etc.), so we won't lose any benchy stuff once the box goes poof.

I need to find a new home for the nightly benchmarks logs/charts, currently at https://home.apache.org/~mikemccand/lucenebench ... when each nightly benchy finishes it copies up the results (all charts with a new data point) using sftp via Python. I'm leaning towards a simple GitHub pages site (thank you @msokolov for the idea), though it has a limit of 1 GB and the benchy reports are now ~1.7 GB: ~13 years of detailed nightly benchy reports adds up! I can probably work around that.

For the larger stuff (corpora) I'll mull some more. I have plenty of storage in my personal Google drive account, so I can just start there.

mikemccand commented 3 weeks ago

I'm leaning towards a simple GitHub pages site (thank you @msokolov for the idea)

I enabled pages for the luceneutil repo and pushed a copy of the current nightly benchy reports: https://mikemccand.github.io/luceneutil/index.html. Looks like it basically works, yay!

It should be simple to fix the nightly benchy script to publish updates via git add/commit/push instead of the current sftp mikemccand@home.apache.org. I'll do that next...

mikemccand commented 3 weeks ago

A nice side effect of this is that the long running (13+ years now!) nightly reports will be backed up via git/GitHub and no longer single sourced on my home box, yay. And if ever some exotic bug shows up in the publishing, we will have the full git history showing each nightly benchy update going forwards to help debug.

msokolov commented 3 weeks ago

Nice! glad it worked.

FYI: I clicked on a few random links and found a 404 https://mikemccand.github.io/luceneutil/analyzers.html although this page does seem to exist on the current site

mikemccand commented 2 weeks ago

Phew, OK, I think nightly benchy is now successfully publishing automatically to https://mikemccand.github.io/lucenenightly (using GitHub pages). Last night's run "just worked".

I had to create a new public repo (https://github.com/mikemccand/lucenenightly) just for publishing the many (~1.9 GB, ~5.9K pages) nightly artifacts over 13+ years.

FYI: I clicked on a few random links and found a 404 https://mikemccand.github.io/luceneutil/analyzers.html although this page does seem to exist on the current site

Hmm I'll dig into this -- it's still an issue with lucenenightly repo too. Thanks for catching it @msokolov.

mikemccand commented 2 weeks ago

FYI: I clicked on a few random links and found a 404 https://mikemccand.github.io/luceneutil/analyzers.html although this page does seem to exist on the current site

Hmm I'll dig into this -- it's still an issue with lucenenig

OK should be fixed now! https://mikemccand.github.io/lucenenightly/analyzers.html

mikemccand commented 2 weeks ago

I also aliased (CNAMEd) benchmarks.mikemccandless.com -- GitHub pages makes this simple-ish, yay.