BonnyCI / mateys-ahoy

Individual contributor analysis (see shuffleboard)
Apache License 2.0
0 stars 2 forks source link

Automate web searching by name and email address #9

Open missaugustina opened 7 years ago

missaugustina commented 7 years ago

Explore options for automatically building profiles.

missaugustina commented 7 years ago

Comment from @jimspohrer:

For example - consider this JSON record of a contributor, and the important fields to help with a LinkedIn search:

"name": "James Wexler", "company": "Google",

{ "login": "jameswex", "id": 15835086, "avatar_url": "https://avatars2.githubusercontent.com/u/15835086?v=3", "gravatar_id": "", "url": "https://api.github.com/users/jameswex", "html_url": "https://github.com/jameswex", "followers_url": "https://api.github.com/users/jameswex/followers", "following_url": "https://api.github.com/users/jameswex/following{/other_user}", "gists_url": "https://api.github.com/users/jameswex/gists{/gist_id}", "starred_url": "https://api.github.com/users/jameswex/starred{/owner}{/repo}", "subscriptions_url": "https://api.github.com/users/jameswex/subscriptions", "organizations_url": "https://api.github.com/users/jameswex/orgs", "repos_url": "https://api.github.com/users/jameswex/repos", "events_url": "https://api.github.com/users/jameswex/events{/privacy}", "received_events_url": "https://api.github.com/users/jameswex/received_events", "type": "User", "site_admin": false, "name": "James Wexler", "company": "Google", "blog": "", "location": null, "email": null, "hireable": null, "bio": null, "public_repos": 0, "public_gists": 2, "followers": 1, "following": 0, "created_at": "2015-11-13T15:17:57Z", "updated_at": "2016-12-08T03:27:48Z" }

(https://github.ibm.com/Augustina-Ragwitz/github-hubbub/issues/6#issuecomment-2444324)

missaugustina commented 7 years ago

Setting up an account with this service to see if it can streamline this for us: https://pipl.com/api/

missaugustina commented 7 years ago

I looked at DuckDuckGo (instead of Google) because it's easier to automate/scrape/etc using DuckDuckGo. There are some overhead challenges but it could be useful for exploration or one-off. I think for DuckDuckGo, we should write a script that takes query terms and then runs it on a huge list of input. The goal would be to see how well we matched and for now would mostly likely be a manual verification.

missaugustina commented 7 years ago

LinkedIn API requires some weird process to gain access. I need to look into it further, my initial application was denied.