yamatokataoka / learning-from-human-preferences

Replication of Deep Reinforcement Learning from Human Preferences (Christiano et al, 2017).
MIT License
2 stars 0 forks source link

set up rl-human-prefs #4

Closed yamatokataoka closed 2 years ago

yamatokataoka commented 2 years ago
yamatokataoka commented 2 years ago

poetry would be good option for sophisticated dependency management

https://mitelman.engineering/blog/python-best-practice/automating-python-best-practices-for-a-new-project/#fast-track

yamatokataoka commented 2 years ago

poetry with Docker https://github.com/python-poetry/poetry/discussions/1879#discussioncomment-216865

I don't use Docker because we can't run Docker on Colab and isolation of environment would be enough with poetry and its venv.

yamatokataoka commented 2 years ago

linters

yamatokataoka commented 2 years ago

pyproject.toml https://martin-thoma.com/pyproject-toml/

yamatokataoka commented 2 years ago

poetry with tox https://github.com/python-poetry/poetry/issues/1941