plantsgo / Instacart-Market-Basket-Analysis

kaggle:Instacart-Market-Basket-Analysis-12th solution
50 stars 34 forks source link

Is there miss some code or file? #1

Open timlincool opened 7 years ago

timlincool commented 7 years ago

Impressive Work! Thank you for sharing the code.

But I was wondering if there some missing code?

In basket_v12_prior_and_train_data_cv5_up_up.py

It's seem like miss output_2/train_add.csv ?

And in blending.py I can't find prediction_arboretum.csv

Could you tell me how to find it? thank you.

ankitnayan commented 6 years ago

same issue ... @plantsgo can you please clear things up

plantsgo commented 6 years ago

@timlincool @ankitnayan Sorry for the late reply! 1.create_prior_data.py and create_prior_add_data.py should run 2 times. 1nd time:run directly. 2nd time:we should change the the path from "output" to "output_2" in create_prior_data.py and reate_prior_add_data.py.Then run them.

2.prediction_arboretum.csv if from https://www.kaggle.com/c/instacart-market-basket-analysis/discussion/37697#latest-214906

ankitnayan commented 6 years ago

Thanks for answering @plantsgo. One more query though: what about product2vec = pd.read_csv('features/product2vec_begin_1.csv') in line 27 of create_prior_add_data.py

plantsgo commented 6 years ago

@ankitnayan haha..my mistake... product2vec =pd.read_csv('features/product2vec.csv')

ankitnayan commented 6 years ago

Cool

ankitnayan commented 6 years ago

@plantsgo Also can you please specify:

  1. Which code file creates order_streaks_features.csv which is read in basket_v10_none.py?
  2. Which code file creates stacking_blending.csv which is read in basket_v10_none.py and features_faron.py?
  3. What is the order in which the files should be run after plantsgo.bat?