hearchco / agent

Hearchco agent used to gather info from a variety of sources.
https://hearch.co
GNU Affero General Public License v3.0
26 stars 3 forks source link

[FEAT/SE] Yandex #30

Open k4lizen opened 1 year ago

k4lizen commented 1 year ago

Describe your feature request Create a module for parsing Yandex search results

k4lizen commented 1 year ago

This is very hard. Searxng doesn't even support it because of this, see this and this

aleksasiriski commented 1 month ago

There's an implementation in SearXNG, although it gets hit by captcha a lot of times it's probably better to have it at least sometimes. Also, using a rotating proxy (like what AWS Lambda kind of is) may make this viable.

https://github.com/searxng/searxng/blob/master/searx/engines/yandex.py