Improve the efficiency of Q-learning

thekingofkings / chicago-partition

Automatically partition Chicago into Community Areas (CA), while minize the CA level crime prediction error.

MIT License

1 stars 1 forks source link

Closed thekingofkings closed 6 years ago

thekingofkings commented 6 years ago

Try to save a pre-trained Q-learning model, and update the model only when necessary.

Profile the Q-learning process. Which parts took the longest time to run?

If the feature generations took a lot of time. There is strong motivation to use Q-learning to estimate the results of a potential tract-flip, rather than actually flipping.
If the NB model training used most of the time, then what is the conclusion?

porterjenkins commented 6 years ago

Interesting idea! Is this sort of a transfer learning idea? Where we carry over weights iteration over iteration?