mkosowsk / ETHPrize-Data-Analysis

1 stars 1 forks source link

First Pass on Data Analysis for ETHPrize Interviews #1

Open mkosowsk opened 6 years ago

mkosowsk commented 6 years ago

Note: this issue is reserved for the excellent @jlas :)

User Story

As a PM managing ETHPrize bounties I want to perform data analysis on the current 82 (and counting!) developer interviews So that I can know with confidence the biggest developer pain points to focus efforts on the right areas for the next bounties.

Background

ETHPrize is a community research and bounty project that identifies the most critical issues facing the Ethereum ecosystem and organizes bounties to solve them. The current two bounties are $250,000 USD each and cover an Ethereum Package Manager and an Open Source Block Explorer.

82 (and counting!) interviews have been conducted in the space to determine where to focus efforts next. It would be great to perform data analysis on these interviews to help focus efforts.

Acceptance Criteria

Technical Details

Per @jlas:

Two approaches I am familiar with are analyzing n-grams and the LDA generative model.

With analyzing n-grams the idea is to parse the corpus of text and find which groups of words are most common. For example for a 2-gram analysis you would parse the text into groups of 2 words that appear side by side in the text and just count up each of these pairs.

The LDA model is a really cool graphical model which stipulates that words in a corpus of text are "generated" from a statistical distribution across different "topics". There is a cool library called gensim in python which you can use to load up a corpus and spit out these topics. The interesting things to see is which topics are prevalent, and for each document (I guess a single person's interview text would be a "document" here) which topics it most represents.

gitcoinbot commented 6 years ago

Issue Status: 1. Open 2. Started 3. Submitted 4. Done


This issue now has a funding of 0.15 ETH (84.18 USD @ $561.23/ETH) attached to it.

vs77bb commented 6 years ago

@oogetyboogety Just to confirm with @mkosowsk, looks like this one may be reserved for @jlas. Is that right, Mitch?

However, if you do decide to work this bounty and make a meaningful submission, Gitcoin Core will absolutely tip you for the .15ETH.

oogetyboogety commented 6 years ago

Thanks @vs77bb, I just saw that it's for @jlas. Definitely interested, once @jlas is completed, I may be able to help visualize the results. I'll let him get started first though and look for a couple other issues I can try

mkosowsk commented 6 years ago

@vs77bb yep, that's correct I'm currently working closely with @jlas on this one :)

however, @oogetyboogety let's keep in touch! will ping you after initial results with @jlas

vs77bb commented 6 years ago

@jlas might snagging this one on Gitcoin by clicking 'Start Work'?

nanspro commented 6 years ago

@mkosowsk is this issue still reserved, if it is not i will start working on it.

mkosowsk commented 6 years ago

@nanspro thanks for your interest but goalposts shifted on this one a bit so cancelled the bounty :)

gitcoinbot commented 6 years ago

Issue Status: 1. Open 2. Cancelled


The funding of 0.15 ETH (69.8 USD @ $465.36/ETH) attached to this issue has been cancelled by the bounty submitter