hackforla / data-science

The Hack For LA Data Science team is a Community of Practice within the LA brigade seeking to make analytical and machine learning services available to local communities and organizations.
26 stars 16 forks source link

CoP: Data Science: City of Los Angeles Evictions #179

Open akhaleghi opened 1 year ago

akhaleghi commented 1 year ago

Prerequisite(s)

If you would like to work on this issue, please add a comment below and include the following information:

For example:

Once you have done this, please add yourself to the “Assignees” section on the right and update the issue weekly to document your progress.

Overview

We want to analyze eviction data for the city of Los Angeles, and incorporate data from other sources, to determine whether there are actions local leaders can take to address the problem. The following background information is from the LA Controller's website:

Action Items

Phase 1

Resources/Instructions

Feb 2023 - July 2023 eviction data csv file Check #178 for updates on whether a real time source for this data have been found

chelseybeck commented 1 year ago

perhaps another reason for the influx? would be interesting to explore https://www.wired.com/story/generative-ai-courts-law-justice/

JANEDIOKPO commented 1 year ago

Jane Diokpo I can commit to working on this issue 2 hours in the following week. Yes, I will provide an update on my progress with a comment below.

akhaleghi commented 1 year ago

@JANEDIOKPO Thanks for volunteering, so the first steps would be to investigate what other sources are available to obtain eviction data as what has been found is only a subset (2023 data only) and then perform EDA on the data set.

JANEDIOKPO commented 1 year ago

@JANEDIOKPO Thanks for volunteering, so the first steps would be to investigate what other sources are available to obtain eviction data as what has been found is only a subset (2023 data only) and then perform EDA on the data set.

@akhaleghi akhaleghi Hi, I'm completely new to data science and trying to learn. I'd appreciate it if you could send some resources on how to do an EDA or find sources.

akhaleghi commented 11 months ago

Hi @JANEDIOKPO I'm going to move this back to the backlog because there hasn't been any activity on the issue. Let me know if you'd like to work on it.

pranjaliseth commented 7 months ago
pranjaliseth commented 6 months ago

I have gathered the data set, analyzed and tried experimenting with a few EDA cleaning tasks

pranjaliseth commented 6 months ago

I worked for 6 hours the last week, here's the update -

The data set in itself is very less informative and it is hard to find any trends with the given variables against the target variable. I have therefore researched on other data sets on the LA Controller’s website to find the metadata or any supporting data that can be clubbed with the current dataset to find more concrete relationship with the target variable. I went through the following day sets - LA Homelessness expense tracker, LA Payroll Employee Residence Analysis, Cash for Keys and Affordable Housing Covenants. Out of these, the Cash for keys contains the information regarding the owners paying the residents to leave, which correlates to the owners dissatisfaction. If an area has a high dissatisfaction, it would be that people fail to pay rent or adhere to the society guidelines. The addresses in the dataset can be converted to zip codes using GeoPy and we can find the average buyout for that year and place and therefore find some relation with the eviction notices. I also found fair market rent(FMR) for Los Angeles from a different website (https://www.laalmanac.com/economy/ec40b.php ) along with looking for household income dataset for the LA County as well. Currently, need to discuss with the team, how to go ahead with the issue and whether to involve any other data sets with better variables or not.

pranjaliseth commented 5 months ago

I read several articles on the Los Angeles Evictions rules and laws before, during and post Covid-19 pandemic to get insights about the background information. Collected the Fair Market Rent(FMR) by zip codes data set and estimated population by zip codes data for the LA County. Merged the relevant datasets to the original data to find dependencies and trends between the datasets. Currently working with the population dataset to find useful insights on the eviction cases and intensity.

noelthomas28 commented 4 months ago
pranjaliseth commented 4 months ago

So far, I have gathered meta data (FMR, Population) to add to the original eviction data set. I have merged the metadata and did its exploratory data analysis along with data cleaning. I am working on running couple models on the current data to observe the trends.

rahul897 commented 3 months ago

Rahul Iragavarapu I can commit to working on this issue 2 hours in the following week. I will provide an update on my progress with a comment below.

pranjaliseth commented 3 months ago

Provided all findings in the CoP meeting today. I will next be working on documentation for the issue checklist.

pranjaliseth commented 3 months ago

Working on the documentation. Completed some part of it so far. Also will be working on creating the presentation soon.

pranjaliseth commented 1 month ago

Uploading the google drive links for data sets and code

pranjaliseth commented 1 month ago

Google drive link - https://drive.google.com/drive/folders/1-yiJ-ZcC20wOlikNzG1zCn8vsX2H-VNt

pranjaliseth commented 1 month ago

Data and Metadata Sources - https://www.laalmanac.com/employment/em12c.php https://www.laalmanac.com/population/po24la_zip.php https://gist.github.com/erichurst/7882666