Closed Alexhans closed 2 years ago
I'm not sure if it is necessary or not, but currently duckduckgo is not working as an engine due to https://github.com/gleitz/howdoi/issues/404
If you'd like to figure out that issue, I could use some help with it!
I can try. I actually came across this while quickly playing around to see If I could add brave support.
Support works but I wanted to understand the decisions around URLs and usage. I did get temporarily blocked when I added the unit tests in brave which it's somewhat expected (I still remember the Google has been DDoSing SourceHut for over a year story).
The only thing I can think of is ask duckduckgo & brave to see if they have specific ways to interact programmatically with their websites.
I do think the answer will not be satisfactory for duckduckgo since, in their instant answers api page they state:
This API does not include all of our links, however. That is, it is not a full search results API or a way to get DuckDuckGo results into your applications beyond our instant answers. Because of the way we generate our search results, we unfortunately do not have the rights to fully syndicate our results, free or paid. For the same reason, we cannot allow framing our results without our branding. Please see our partnerships page for more info on guidelines and getting in touch with us.
So crawling ethically (Without trying to circumvent through proxies or similar) will invariably get blocked. For DDG, it might be a case of choosing whether to remove it entirely or just support instant answers through their API (For any API based access, users could get their own tokens like in OpenBBTerminal
Yes I am not optimistic that API access will be given, so we're left with crawling.
I also get rate limited during development, which is why I have the caching mechanism in place when running tests.
I didn't know brave had a search engine. I would accept that PR if you want to open it.
I'll take a look at what we discussed over the weekend and create the pull request. It's been a busy period.
No worries - take your time and thanks again for any support you can give to the project.
Hi,
I was looking at the different query parameters in the search urls when I found that duckduckgo's
t
is a tracking code (I don't know whathj
as the value means: https://help.duckduckgo.com/privacy/t/https://github.com/gleitz/howdoi/blob/f202eea521be7c280390a950fc881cce59336ceb/howdoi/howdoi.py#L72