Kag-Jane / sandbox

Sandbox for EDA
0 stars 0 forks source link

Feature engineering #3

Open kurtosis111 opened 2 weeks ago

kurtosis111 commented 2 weeks ago
  1. Correlation of features from the same tag is high: https://www.kaggle.com/code/lifuhaha/analysis-of-weight-by-symbol-id
  2. Feature clustering: https://www.kaggle.com/competitions/jane-street-real-time-market-data-forecasting/discussion/542209
  3. Baseline notebooks: https://www.kaggle.com/code/motono0223/js24-inference-gbdt-with-lags-singlemodel
kurtosis111 commented 4 days ago

Screenshot 2024-11-24 at 22 03 46

kurtosis111 commented 4 days ago

correlation of lag with responder_3: Screenshot 2024-11-24 at 22 05 42

kurtosis111 commented 4 days ago

Screenshot 2024-11-24 at 22 03 46

This conclusion is only based on the train_parquet 0, 1, 2.