Open Yard1 opened 2 years ago
We are currently converting to Pandas before initialising the DMatrix. We should consider using Arrow instead to avoid unnecessary copies. XGBoost has Arrow support - https://github.com/dmlc/xgboost/pull/7512
Thanks for adding this! It looked like the changes were split over two PRs, just FYI here is the second https://github.com/dmlc/xgboost/pull/7283
And supporting polars dataframe for creating DMatrix in Python?
We are currently converting to Pandas before initialising the DMatrix. We should consider using Arrow instead to avoid unnecessary copies. XGBoost has Arrow support - https://github.com/dmlc/xgboost/pull/7512