duckduckgo / zeroclickinfo-fathead

DuckDuckGo Instant Answers based on keyword data files
https://duckduckhack.com/
Other
318 stars 365 forks source link

C: New Linux Man Pages Fathead #734

Closed pjhampton closed 7 years ago

pjhampton commented 7 years ago

Create a new Linux Man Pages Fathead Instant Answer

Help us make DuckDuckGo the best search engine for programmers!

What do I need to know?

There is surprisingly quite a lot of searches for linux commands and their man pages. It would be great to scrape the following URL and turn this into a fathead. A random portion of the queries include (duplicates omitted):

linux memset 
linux mkfifo remove fifo 
linux mmap 
linux mmap example 
linux mmap no disk 
linux mmap ram 
linux mutex_lock 
linux nano_sleep 
linux nanosecond gpio 
linux nanosleep example 
linux ncurse 
linux ncurses ESCDELAY 
linux ncurses noecho 
linux ncurses wmove 
linux open 
linux open O_RDWR 
linux opendir 
linux opendir spaces 
linux os.rename system call 
linux perl Storable file 
linux popen 
linux print cwd from program 
linux process time 
linux program mktemp 
linux programmed sleep 
linux pthread 
linux read file arbitrary position 
linux read number from picture 
linux read syslog 
linux readconfig files 
linux readdir 
linux readline 

Should also trigger on unix, man, linux man, unix man

You'll need to know how to code in Perl, Python, Ruby, or JavaScript. fathead languages

What am I doing?

You will write a script that scrapes or downloads the data source below, and generates an output.txt file containing the parsed documentation. You can learn more about Fatheads and the output.txt syntax here.

Data source: http://man7.org/linux/man-pages/dir_all_alphabetic.html

Instant Answer Page: to be created by developer

What is the Goal?

As part of our Programming Mission, we're aiming to reach 100% Instant Answer (IA) coverage for searches related to programming languages by creating new Instant Answers, and improving existing ones.

Here are some Fathead examples:

See more related Instant Answers

Get Started

Resources

danidee10 commented 7 years ago

@pjhampton i would like to handle this.

pjhampton commented 7 years ago

Awesome! Thanks @danidee10 πŸ‘ Let me know if you need any help! πŸ˜„

pjhampton commented 7 years ago

@danidee10 It seems that there is already a fathead for this (Sorry!). But reach is fairly limited. I reckon you update the trigger words of that fathead for now, and in a couple of weeks time if the bounce data is low or there is a high click through rate we will change the source. What do you think?

https://github.com/duckduckgo/zeroclickinfo-fathead/tree/master/lib/fathead/unix_man https://duck.co/ia/view/unix_man

pjhampton commented 7 years ago

Hmm, weird. @edgesince84 reported this issue (https://github.com/duckduckgo/zeroclickinfo-fathead/issues/273) which is pretty serious. Maybe we will just deprecate that one if it's got these problems and not a lot of content. What do you think, @danidee10? πŸ˜„

danidee10 commented 7 years ago

@pjhampton The new source is easier to parse (all the commands are listed on the main page) Looking at the future this is going to be more maintainable, and it's kinda harder to get things wrong with the new source :smile:

On the Other hand adduser is not really a *nix command, it's just a perl script that still calls useradd at the backend. Currently on my Machine (ArchLinux) I don't have adduser. so the IA's answer kinda makes sense :smile:

I think we should stick with the old source if another issue aside #273 hasn't been reported, because the new source has nothing on adduser too.

pjhampton commented 7 years ago

Hey @danidee10,

So I'm confused by your first and third paragraph. Are you taking this forward or are we just updating the current IA?

danidee10 commented 7 years ago

My point is changing the source won't really do much good if it's just to address #273 because the new source doesn't have a section for the adduser command

And i assume that they both cover the same set of commands (I'm yet to do a thorough check yet). So the new source doesn't add more commands to the IA.

pjhampton commented 7 years ago

Hey @danidee10, How are you getting on? It's more about increasing the coverage and improving the content. The last fathead had one liner descriptions, causing people to click on SERP links instead of bouncing (seeing the IA and going). That is the purpose of this πŸ˜„ Sorry if I didn't make myself clear earlier.

danidee10 commented 7 years ago

@pjhampton i've created a new IA page https://duck.co/ia/view/manpages