austinjhunt / msci_esg

A Python package that uses Selenium to scrape content from the MSCI.com ESG Ratings Corporate Search Tool.
MIT License
25 stars 10 forks source link

Historical ESG Ratings #4

Open ssaltmann opened 2 years ago

ssaltmann commented 2 years ago

I think the Structure of the MSCI sustainability website changed, so that the request for the rating history doesn't work anymore..

vincLohm commented 2 years ago

I think the url has changed. It's now: https://www.msci.com/research-and-insights/esg-ratings-corporate-search-tool?p_p_id=esgratingsprofile&p_p_lifecycle=2&p_p_state=normal&p_p_mode=view&p_p_resource_id=searchEsgRatingsProfiles&p_p_cacheability=cacheLevelPage&_esgratingsprofile_keywords={} to get the issuerID and https://www.msci.com/research-and-insights/esg-ratings-corporate-search-tool/issuer/{}/{} for the actual rating page.

Also, I had to change the selenium webdriver from phantomJS because this is apparently completely deprecated now. You can fix it easily using the steps here: https://stackoverflow.com/questions/50416538/python-phantomjs-says-i-am-not-using-headless and download chromedriver from here: https://sites.google.com/chromium.org/driver/ . I still need to learn more about the headless chromedriver to also set js timeout parameters and other details, but after making those changes, the code worked for me.