duckduckgo / zeroclickinfo-fathead

DuckDuckGo Instant Answers based on keyword data files
https://duckduckhack.com/
Other
318 stars 365 forks source link

Unix Man Pages: Doesn't trigger on some commands #174

Closed tagawa closed 7 years ago

tagawa commented 8 years ago

@flaming-toast

E.g. man cd doesn't trigger the IA: https://duckduckgo.com/?q=man+cd (Report via Twitter: https://twitter.com/manuelmagic/status/684309512552099840)


IA Page: http://duck.co/ia/view/unix_man

mbionchi commented 8 years ago

@tagawa that happens because the source for this IA, linuxcommand.org, doesn't have all the man pages. One possible solution to this would be to use a more complete source, e.g. linux.die.net/man/. I'd like to fix this later this week.

mbionchi commented 8 years ago

So apparently linux.die.net's robots.txt blocks wget. There is a way to circumvent that by changing the user agent or not honoring robots.txt at all, but I'm not sure if that's ethically correct. @moollaza what do you think?

amouat commented 8 years ago

What about http://manpages.ubuntu.com/ ?

I was trying to find man capabilities, which is strangely difficult and several resources seem out of date.

pjhampton commented 7 years ago

We are now going to address this issue here: https://github.com/duckduckgo/zeroclickinfo-fathead/issues/734. Closing this due to inactivity