NVIDIA / spark-rapids-examples

A repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.
Apache License 2.0
130 stars 51 forks source link

there is a dead link #402

Closed nvliyuan closed 3 months ago

nvliyuan commented 5 months ago

It seems the Agaricus dataset has been moved to another place.

ERROR: 1 dead links found!
[✖] https://gust.dev/r/xgboost-agaricus → Status: 404

FILE: ./docs/get-started/xgboost-examples/prepare-package-data/preparation-python.md
[✓] https://repo1.maven.org/maven2/com/nvidia/rapids-4-spark_2.12/24.06.0/rapids-4-spark_2.12-24.06.0.jar → Status: 200
[/] /docs/get-started/xgboost-examples/building-sample-apps/python.md → Status: 0
[/] /docs/get-started/xgboost-examples/dataset/mortgage.md → Status: 0
[✓] https://www1.nyc.gov/site/tlc/about/tlc-trip-record-data.page → Status: 200
[✖] https://gust.dev/r/xgboost-agaricus → Status: 404
GaryShen2008 commented 5 months ago

Hi @wbo4958, can you help to find the dataset?

wbo4958 commented 5 months ago

No idea the original dataset, but seems xgboost repo has agaricus, https://github.com/dmlc/xgboost/tree/master/demo/data ?

parthosa commented 4 months ago

Seeing more deadlinks in my PR's markdown check job #405

parthosa commented 4 months ago

Do we have an update on this? Existing PR seems to be blocked.

Here is a link I found for reference - https://web.archive.org/web/20200803173807/https://gust.dev/r/xgboost-agaricus

nvliyuan commented 4 months ago

Hi @parthosa , thx for the reference, please ignore the dead link issue, I will file a pr to fix it.

GaryShen2008 commented 3 months ago

Fixed by PR 409. Close it.