hackforla / data-science

The Hack For LA Data Science team is a Community of Practice within the LA brigade seeking to make analytical and machine learning services available to local communities and organizations.
28 stars 17 forks source link

Create 311 data CSV files that can be accessed through a Jupyter notebook #177

Open akhaleghi opened 1 year ago

akhaleghi commented 1 year ago

Overview

We want to download 311 data and split by year, then month, so each is under 100MB and we can host tan append-only data warehouse on GitHub.

Action Items

Resources/Instructions

Cleaning Rules: https://github.com/hackforla/data-science/blob/main/311-data/CSV_files/Docs/CleaningRules.txt City Data:: https://data.lacity.org/browse?q=311%20data%20%2C%202024&sortBy=relevance (Please update the filter for the year 2024 based on the requirements.) Website (ghpages): https://hackforla.github.io/311-data-jupyter-notebooks/lab (navigate to folder : 311_Data_CleaningScript)

ExperimentsInHonesty commented 1 year ago

https://www.google.com/search?q=jupyter+notebook+ghpages&oq=jupyter+notebook+ghpages&aqs=chrome..69i57j0i22i30j0i390i650l3j69i60.9524j0j15&sourceid=chrome&ie=UTF-8#ip=1

ExperimentsInHonesty commented 1 year ago

I made this repo for @chelseybeck to see if its feasable to use Jupyter Notebook with ghpages https://github.com/hackforla/jupyter-ghpages-test

I am going to create another repo for the 311 data to go into

ExperimentsInHonesty commented 1 year ago

https://discourse.jupyter.org/t/run-jupyter-notebooks-on-github-with-reporting-to-a-static-website/14982

mru-hub commented 3 months ago

Outline of Data Cleaning Steps Data Cleaning was essential to prepare the 311 service request data for analysis. The following steps were undertaken:

1. Removing Duplicates

9. Saving Cleaned Data

salice commented 2 months ago

@bonniewolfe: @mru-hub is asking for clarification on this issue. Do we have a github page already for Hack for LA? Should she create a new page or add her work here https://github.com/hackforla/311-data-jupyter-notebooks? Also she mentioned "We have one for our organization which is created by Bonnie. Also the project page in above URL has '311-data', so i think we have one project page for our repository too. If this is true I have to use the same URL for current ghpage purpose."

ExperimentsInHonesty commented 1 month ago

I answered this in the data science meeting on 2024-09-16. Basically, the repository is the work for this issue, but it needs updated data files.

mru-hub commented 1 month ago

Started working on ghpages. Website: https://hackforla.github.io/311-data-jupyter-notebooks/lab (navigate to folder: 311_Data_CleaningScript). I've made some initial updates to the script and will continue working on integrating it for the ghpages.