psf / requests-html

Pythonic HTML Parsing for Humans™
http://html.python-requests.org
MIT License
13.64k stars 977 forks source link

Basic doc example no longer works #576

Open dannykellett opened 2 months ago

dannykellett commented 2 months ago

As doc here: https://requests-html.kennethreitz.org/

from requests_html import HTMLSession
def main() -> None:
    session = HTMLSession()
    r = session.get('https://python.org/')
    print(f"all links = {r.html.absolute_links}")

if __name__ == '__main__':
    main()

Traceback (most recent call last): File "E:\11-Projects\learning_requests_html.py", line 1, in from requests_html import HTMLSession File "E:\11-Projects.venv\Lib\site-packages\requests_html.py", line 14, in from lxml.html.clean import Cleaner File "E:\11-Projects.venv\Lib\site-packages\lxml\html\clean.py", line 18, in raise ImportError( ImportError: lxml.html.clean module is now a separate project lxml_html_clean. Install lxml[html_clean] or lxml_html_clean directly.

I guess I should mention that it worked after installing lxml but thought I should say the docs are not correct.

jordanralba commented 1 month ago

Ran into the same issue. Hopefully, they update their documentation shortly.