chaoss / augur

Python library and web service for Open Source Software Health and Sustainability metrics & data collection. You can find our documentation and new contributor information easily here: https://oss-augur.readthedocs.io/en/main/ and learn more about Augur at our website https://augurlabs.io
https://oss-augur.readthedocs.io/en/main/
MIT License
586 stars 846 forks source link

GSoC Idea: Machine Learning based Community Health and Communication #1637

Closed sgoggins closed 2 years ago

sgoggins commented 2 years ago

Ideas for Google Summer of Code projects

Interested in working with CHAOSS? Below are some project ideas. We describe how to apply to work with CHAOSS and how we select students on a different page: https://github.com/chaoss/community/blob/master/GSoC-interest.md

Idea: Advancing Risk Prediction With Machine Learning in Augur

Currently Augur uses computational linguistics, dependency mapping, license scanning, topic modeling, social network analysis, and algorithms that target temporal changes in CHAOSS metrics. The aim of this project is to advance the accessibility of these insights through the development of python based API endpoints that deliver visualizations of machine learning outputs, similar to the style found in https://github.com/chaoss/augur/augur/routes/pull_request_reports and https://github.com/chaoss/augur/augur/routes/contributor_reports

This work could include optimization and refinement of machine learning workers found under https://github.com/chaoss/augur/workers to generate additional, or reporting optimized data, as well as the extension of Augur's new front end at https://github.com/augurlabs/augur_view, which is based on twitter/bootstrap and flask.

The aims of the project are as follows:

The aims will require working in a programming language to automate the task, use API to generate the graphs, and use some Graphic editor to prepare the pdf.

Augur is more advanced in its machine learning than basic recommender systems, using TF-IDF, Boosting, Latent Dirichlet Analysis of sequenced conversations, and clustering algorithms that look for similar topics across repositories.

The Github Workers that use machine learning are visible in our Workers directory here: . We use our advanced contributor worker to manage identities, and the existing discourse_analysis, message_insights, clustering, insight, and pull_request_worker_analysis workers. Documentation about these existing workers can be found here: https://oss-augur.readthedocs.io/en/dev/development-guide/workers/toc.html

Augur features machine learning workers are now an active part of Augur's growing ecosystem.

The aims will require generating code in Python for FLASK and the GraphQL API, and for the web app, which has now advanced from the Vue.js and Vuetify ecosystems to more robust and sustained projects like Twitter/Bootstrap.

Microtasks

For becoming familiar with Augur, you can start by reading some documentation. You can find useful information at in the links, below.

Once you're familiar with Augur, you can have a look at the following microtasks.

rohanreddych commented 2 years ago

Hello Sean and CHAOSS community, my name is Rohan. I am interested in this project, will work on Augur issues and submit a draft proposal for review.

sgoggins commented 2 years ago

Thanks, @rohanreddych ! You can also participate in the CHAOSS Community slack if you have immediate questions: https://join.slack.com/t/chaoss-workspace/shared_invite/zt-r65szij9-QajX59hkZUct82b0uACA6g

purna135 commented 2 years ago

Hello, @sgoggins! I just found that some of the links you provided are broken. Could you suggest me where I can get these links?

https://github.com/chaoss/augur/augur/routes/pull_request_reports https://github.com/chaoss/augur/augur/routes/contributor_reports https://github.com/chaoss/augur/workers

purna135 commented 2 years ago

I think the correct link should be https://github.com/chaoss/augur/blob/main/augur/routes/pull_request_reports.py https://github.com/chaoss/augur/blob/main/augur/routes/contributor_reports.py

miguelvalderrama commented 2 years ago

Hello @sgoggins, my name is Miguel. I would contribute to this project and propose an idea for review, regards.

ErikaJZhou commented 2 years ago

CS4320 Team Terrapins decided to work with this issue.

AndroGari commented 2 years ago

I'm Garima and I would like to work on this project idea, will work on Augur issues and work in the project.