tasos-py / Search-Engines-Scraper

Search google, bing, yahoo, and other search engines with python
MIT License
513 stars 137 forks source link

Yahoo title parsing improving #38

Closed soxoj closed 2 years ago

soxoj commented 2 years ago

I noticed that title of Yahoo is extracted incorrectly:

URL: https://gist.github.com/soxoj/9d65c2f4d3bec5dd25949197ea73cf3a
Title: gist.github.com › soxoj › 9d65c2f4d3bec5dd25949197eamaigret.ipynb · GitHub

Title should be maigret.ipynb · GitHub

I did some fixes form my other project here

tasos-py commented 2 years ago

Nice catch! I fixed it with bs4's .decompose(), but I'll keep this open in case there is more work to be done