smart022 commented 5 years ago

All about LGB params

掘金lgb参数解释 * 官方分类-核心/learn控制/IO--

Core
- task
- objective
- boosting : default=gbdt, == boosting_type, boost
- num_iterations : default=100, == num_iteration, n_iter, num_trees, num_rounds, n_estimators
- learning_rate : default=0.1, == eta, shrinkage_rate
- num_leaves : default = 31, == num_leaf, max_leaves, max_leaf
- num_threads : default = 0 recom = real cpu cores, == num_thread, nthread, nthreads, n_jobs
Learning Control
- max_depth: default = -1(no limit
- min_data_in_leaf: default = 20, == min_data_per_leaf, min_data, min_child_samples
- min_sum_hessian_in_leaf: default = 0.01, == min_hessian, min_child_weight
- bgging_fraction: default = 1.0 , avail = (0,1] , ==sub_row, subsample, bagging
- bagging_freq; default = 0 , type int k means bag at every k iters, == subsample_freq
- feature_fraction: default = 1.0, == sub_feature, colsample_bytree
- feature_fraction_seed
- early_stopping_round: default = 0
- max_delta_step
- lambda_l1: default = 0 avail=[0,inf), == reg_alpha
- lambda_l2: default = 0, avail=[0,inf), == reg_lambda
- min_gain_to_split: default = 0.0 >=0, == min_split_gain
- drop_rate
IO
- verbosity
- max_bin: default = 255, type = int, avail=(1,inf)
- min_data_in_bin
- bin_construct_sample_cnt
- histogram_pool_size
Objective
Metric
- metric
  - "":same as objective
  - l1
  - l2
  - auc
    官方调参建议
For the Leaf-wise tree
- num_leaves
- min_data_in_leaf
- max_depth
For faster speed
- bgging_fraction && bagging_freq
- feature_fraction
- max_bin: 用较小值
- save_binary
For Better Accuracy
- max_bin: 用较大值（会慢
- learning_rate: 用小值配上大的 num_iterations
- num_leaves: 用大值（会overfit
- 大数据
- dart
Deal with Over-fitting
- max_bin: 用小值（而且快
- num_leaves: 用小值
- min_data_in_leaf && min_sum_hessian_in_leaf
- bagging_fraction && bagging_freq
- feature_fraction
- lamda_l1/l2 && min_gain_to_split
- max_depth
  总结重要的参数
- max_bin: 小值防过拟合且加速
- num_leaves: <= 2^(max_depth)
- max_depth: 一般 5 - 10
- bagging_fraction && bagging_freq
- feature_fraction
- lamda_l1/l2 && min_gain_to_split
- max_depth

smart022 commented 5 years ago

三招提升数据不平衡模型的性能（附python代码）不平衡重要参数：

is_unbalanced
class_weight

smart022 commented 5 years ago

Model_Ensemble.py

smart022 / articles

All about tuning #10

xgb/lgb 参数完美对比

All about LGB params

官方调参建议

总结重要的参数