ccao-data / model-res-avm

Automated valuation model for all class 200 residential properties in Cook County (except vacant land and condos)
GNU Affero General Public License v3.0
26 stars 5 forks source link

Switch CV setup and expand comps search range #197

Closed dfsnow closed 8 months ago

dfsnow commented 8 months ago

This PR switches our main CV method to 10-fold random CV, rather than time-based rolling window. Random V-fold seems to yield hyperparameters that perform slightly better on our test set, and does so more consistently than rolling-window.

This PR also expands the comparables search range by increasing the size of the bins (from 30 bins to 10 bins).