Open BryanWilhite opened 4 years ago
Before answering your question directly, it's worth saying as a starting point: if all you need to do is pull content from static HTML pages, you should probably use a HTTP library (like Requests or the built-in
urllib.request
) withlxml
orBeautifulSoup
, not Selenium (although Selenium will probably be adequate too).
http://lxml.de/ http://www.crummy.com/software/BeautifulSoup/
i assume that sites from Bloomberg and Medium are too cool for beautiful soup, requiring escalation to Selenium
https://pypi.org/project/beautifulsoup4/
https://pypi.org/project/selenium/