DariaTan / ArableLand2

1 stars 0 forks source link

Review #1

Open vjugor1 opened 1 year ago

vjugor1 commented 1 year ago

README.md

Markdown

Dockerfile

Please wrap all your code into docker container

notebook_crop.ipynb

notebook_yield.ipynb

Data

'.pkl' files

vjugor1 commented 1 year ago

Don't forget to delete old commented code (like here), or store it somewhere else

vjugor1 commented 1 year ago

Another minor comment, within src.utils keep to the same name convention: if you use variable name clf in one function, do not use variable name model for the same substance in another place

vjugor1 commented 1 year ago

src.utils docstring for this function looks outdates (missing one argument). This makes the unclear what does out_meta.update method

Stopped here, will continue

vjugor1 commented 1 year ago

-Please, change the loop variable name prefix or do not use variable name prefix inside list comprehension in this loop, e.g., here. It is 1. misleading, 2. dangerous - something can be overwritten

DariaTan commented 1 year ago
  • Why years[1] here? It happens because rolling mean procedure turns first and last values to nan. We should kinda drop them

  • Seems like [const_2] ) is redundant to have inside the nested loop. Moreover, considering the definition of const_array_0, const_array_1, const_array_2, it is not clear why are they under loop over prefix, since, for each iteration, you assemble them over the full set of prefixes.

Well, I can believe it looks ugly a bit. It is literally one hot encoding done by hand. Arrays are defined inside the loop since they are set of zeros. When we use them as features for object, you put 1 to the column related with the country (prefix here)

vjugor1 commented 1 year ago

src.utils

src.models.crop_classifier.py

I am moving to notebooks