hackforla / website

Hack for LA's website
https://www.hackforla.org
GNU General Public License v2.0
317 stars 753 forks source link

Analysis: Collecting Data on Issue Completion per Prework Author and Creating Looker Dashboards to Uncover Insights #4152

Open kimberlytanyh opened 1 year ago

kimberlytanyh commented 1 year ago

Dependency

Overview

We need to collect data on the authors of all the prework issues in our repository to perform data analysis.

Action Items

Might be separated into another issue

Resources/Instructions

  1. GitHub API Documentation
  2. GitHub Rate Limiting
  3. Link to GitHub Data Analysis Folder
  4. Spreadsheet with accurate numbers as of 03/26/2023
  5. Link to process documentation
  6. Using Google Sheets API to add and refresh dataframe in Python to Google Sheets: https://www.youtube.com/watch?v=sVURhxyc6jE https://medium.com/@jb.ranchana/write-and-append-dataframes-to-google-sheets-in-python-f62479460cf0 https://www.youtube.com/watch?v=3wC-SCdJK2c
  7. Slides documentation process from Python to GitHub
kimberlytanyh commented 1 year ago

After finish drafting this issue, add the label "Ready for Product".

ExperimentsInHonesty commented 1 year ago

@kimberlytanyh Add a step to add data to a google sheet on the Team Google Drive. Add a link to the folder it will go in, under the resources section.

kimberlytanyh commented 1 year ago

Weekly Update:

  1. Progress: Retrieved data and calculated count of issues per complexity label. Left with converting rows to columns so that we can see the distribution in one row per assignee, and completing documentation.
  2. Blockers: None
  3. Availability: Mon - Fri, 12:00-5:00PM
  4. ETA: ~21 hours. 1-3 hours for remaining deliverables.
kimberlytanyh commented 1 year ago

Weekly Update:

  1. Progress: Adjusted data cleaning method and calculated count of issues per complexity label. Exported dataset as csv and uploaded to the drive. Manually checked accuracy of data. Working on data analysis now.
  2. Blockers: None
  3. Availability: Thurs-Saturday, Anytime
    1. ETA: ~17 hours
ExperimentsInHonesty commented 1 year ago

@kimberlytanyh we are in the process of changing the labels on issues currently labeled Complexity: Good second issue to good first issue

Why?

What you need to know

kimberlytanyh commented 1 year ago

@ExperimentsInHonesty Thank you for the heads up! I will adjust my code for the next round of analysis rerun accordingly.

kimberlytanyh commented 1 year ago

Weekly Update:

Progress: Identified means for identifying pull requests in retrieved issues through GitHub API. Will re-perform all analyses done and try to improve accuracy of datasets. Blockers: None Availability: Saturday ETA: ~6 hours

kimberlytanyh commented 1 year ago

@ExperimentsInHonesty As discussed in the Sunday Team Meeting, below are the labels to be added to prework/tracking issues for better data analysis:

Team Member Progression

kimberlytanyh commented 1 year ago

Weekly Update:

Progress: Looked into data pipeline options to automate data updates for visualizations in Looker. Blockers: Discussing preferred approach Availability: Friday-Sunday this week ETA: 6-10 hours

kimberlytanyh commented 1 year ago

Weekly Update:

Progress: Working on Streamlining Data Cleaning Code in Jupyter Notebook and adding in automation components. Going to try using Google Sheet API to create data source for Looker Dashboard. Blockers: Automating and scheduling notebook to run automatically. Deciding on best data source for Looker (in the midst of scheduling a working session with Chelsey, Karina, and Sophie).

Availability: Mon, Friday-Sunday next week ETA: 6-10 hours

kimberlytanyh commented 1 year ago

Weekly Update:

Progress: Created repository to establish automation of running Python Data Cleaning script using GitHub Actions with Sophia, Chelsey, and Karina. Next step is to clean up existing code for automation and data accuracy, try Google Sheets API and establishing data source for Looker.

Concepts/ tools used for setting up daily running of Python code cleaning script automatically (in case want to set up wiki in the future):

Blockers:

Availability: Weekend and Mon-Fri next week, 12PM -7PM ETA: 15+ hours

github-actions[bot] commented 1 year ago

@kimberlytanyh

Please add update using the below template (even if you have a pull request). Afterwards, remove the 'To Update !' label and add the 'Status: Updated' label.

  1. Progress: "What is the current status of your project? What have you completed and what is left to do?"
  2. Blockers: "Difficulties or errors encountered."
  3. Availability: "How much time will you have this week to work on this issue?"
  4. ETA: "When do you expect this issue to be completed?"
  5. Pictures (optional): "Add any pictures of the visual changes made to the site so far."

If you need help, be sure to either: 1) place your issue in the developer meeting discussion column and ask for help at your next meeting, 2) put a "Status: Help Wanted" label on your issue and pull request, or 3) put up a request for assistance on the #hfla-site channel. Please note that including your questions in the issue comments- along with screenshots, if applicable- will help us to help you. Here and here are examples of well-formed questions.

You are receiving this comment because your last comment was before Tuesday, May 30, 2023 at 12:15 AM PST.

github-actions[bot] commented 1 year ago

@kimberlytanyh

Please add update using the below template (even if you have a pull request). Afterwards, remove the '2 weeks inactive' label and add the 'Status: Updated' label.

  1. Progress: "What is the current status of your project? What have you completed and what is left to do?"
  2. Blockers: "Difficulties or errors encountered."
  3. Availability: "How much time will you have this week to work on this issue?"
  4. ETA: "When do you expect this issue to be completed?"
  5. Pictures (optional): "Add any pictures of the visual changes made to the site so far."

If you need help, be sure to either: 1) place your issue in the developer meeting discussion column and ask for help at your next meeting, 2) put a "Status: Help Wanted" label on your issue and pull request, or 3) put up a request for assistance on the #hfla-site channel. Please note that including your questions in the issue comments- along with screenshots, if applicable- will help us to help you. Here and here are examples of well-formed questions.

You are receiving this comment because your last comment was before Tuesday, June 6, 2023 at 12:16 AM PST.

kimberlytanyh commented 1 year ago

Progress: In the process of changing one more section of the code for automation and double checking accuracy of data after cleaning (need to improve accuracy of crediting the right amount of small issues for agenda issues that have multiple assignees). Next step is to add the Python script for automation and clean and create dataset for the live dashboard on number of issues available.

Blockers: None yet. Availability: 6-8 hours ETA: A few more weeks since it is an evolving and ongoing issue.

github-actions[bot] commented 1 year ago

@kimberlytanyh

Please add update using the below template (even if you have a pull request). Afterwards, remove the '2 weeks inactive' label and add the 'Status: Updated' label.

  1. Progress: "What is the current status of your project? What have you completed and what is left to do?"
  2. Blockers: "Difficulties or errors encountered."
  3. Availability: "How much time will you have this week to work on this issue?"
  4. ETA: "When do you expect this issue to be completed?"
  5. Pictures (optional): "Add any pictures of the visual changes made to the site so far."

If you need help, be sure to either: 1) place your issue in the developer meeting discussion column and ask for help at your next meeting, 2) put a "Status: Help Wanted" label on your issue and pull request, or 3) put up a request for assistance on the #hfla-site channel. Please note that including your questions in the issue comments- along with screenshots, if applicable- will help us to help you. Here and here are examples of well-formed questions.

You are receiving this comment because your last comment was before Tuesday, June 27, 2023 at 12:17 AM PST.

kimberlytanyh commented 1 year ago

Progress: Completed documentation of process for live issue availability dashboard (for GitHub class). Left to do: Edit Python script to add in data from other columns, add it to repository for automation, and finish creating dashboard. Blockers: None yet. Might have to consult Data Science COP about auto running automation script. Availability: 21 hours next week Mon-Fri. ETA: By next week or two.

mayankt153 commented 6 hours ago