Open Noxoomo opened 6 years ago
@Noxoomo @annaveronika if you want to further comment please go ahead, I will be redoing this benchmark next week for a paper submission.
Hello, my name is Dmitriy Kruchinin and i'm writing to you as a member of CatBoost team. We would like to ask you to pay attention to the following points during re-evaluation of your benchmarks.
Dataset collection. You may find the example of our gbdt benchmark here. It is based on your repository. We want to point out that we extend the collection of datasets:
Obviously, the set can be further expanded, but in our opinion this version fully covers the cases of various load on the GBDT library.
If you will use these datasets in your benchmark, please tell us if your results will differ from our.
@KruchDmitriy Thanks for your response, I will look at adding some extra datasets.
You should set number of devices to equal number for CatBoost/XGBoost
so the benchmark in not fair without CUDA_VISIBLE_DEVICES=id, because CatBoost uses all devices by default and this is not a good idea to you small benchmark datasets with 8V100 servers