I love the fact that this repo is being shared and The Economist is taking an open source approach to this. BRAVO!!!
That said, publicizing the code / data is one thing, making it ready for the community to digest and improve is another thing. So before I post issues, I'd love if you could clarify a few questions of mine:
1. What is the goal / purpose of this being open sourced?2. What is the goal / purpose for this codebase?
Answering my questions would help me understand the pros/cons of this. For example, if the goal is to accurately predict an outcome given all available data, this model should include more recent data, no? Furthermore, if the goal of this to be open sourced is to get community involvement, this codebase could use some better structure for people to separate "standard code" from "domain specific" code to help focus efforts.
For example, please correct me if I'm wrong, but after diving in a little, it looks like the data feeding this model is all 2016 and before? (This would mean there is no way to correct for specific new candidates that have never influenced the data before).
I'd love any feedback you have on this as the importance of this codebase and the influence of The Economist warrant this being taken seriously.
I love the fact that this repo is being shared and The Economist is taking an open source approach to this. BRAVO!!!
That said, publicizing the code / data is one thing, making it ready for the community to digest and improve is another thing. So before I post issues, I'd love if you could clarify a few questions of mine:
1. What is the goal / purpose of this being open sourced? 2. What is the goal / purpose for this codebase?
Answering my questions would help me understand the pros/cons of this. For example, if the goal is to accurately predict an outcome given all available data, this model should include more recent data, no? Furthermore, if the goal of this to be open sourced is to get community involvement, this codebase could use some better structure for people to separate "standard code" from "domain specific" code to help focus efforts.
For example, please correct me if I'm wrong, but after diving in a little, it looks like the data feeding this model is all 2016 and before? (This would mean there is no way to correct for specific new candidates that have never influenced the data before).
I'd love any feedback you have on this as the importance of this codebase and the influence of The Economist warrant this being taken seriously.
Towards transparency! Thank you The Economist!