sethblack / python-seo-analyzer

An SEO tool that analyzes the structure of a site, crawls the site, count words in the body of the site and warns of any technical SEO issues.
Other
1.18k stars 305 forks source link

UnicodeEncodeError when calling print(output_from_parsed_template) for some websites #64

Open Lima-Codes opened 3 years ago

Lima-Codes commented 3 years ago

Describe the bug When running the analyzer on https://www.amazon.jobs/en/, the following error occurs:

UnicodeEncodeError: 'charmap' codec can't encode character '\u202f' in position 10874: character maps to <undefined>

The error occurs when calling print(output_from_parsed_template) in __main__.py and seem to be related to the html output option.

When commenting out the print statement, the program finishes but the HTML output report is blank.

To Reproduce Steps to reproduce the behavior:

  1. Run seoanalyze https://www.amazon.jobs/en/ --output-format html

Expected behavior No error. Program executes normally.

Desktop (please complete the following information):

sethblack commented 3 years ago

Hmm. This one may be a bit tricker but I have some ideas.