labordata / lm20

labor relations consultants
MIT License
2 stars 1 forks source link

extract data from forms #3

Open fgregg opened 2 years ago

todrobbins commented 2 years ago

Connected to this goal:

Come up with some kind of data schema for the relationships between consultancies and clients. I'd be happy to manually pull out this data in the interim and contribute it to the repo as a CSV or JSON format. I guess it depends on how you would want to model the data.

Happy to discuss with you!

fgregg commented 2 years ago

hi @todrobbins, we have that for for many employers.

https://labordata.bunkum.us/lm20-cf1b077?sql=select+rptId%2C+companyName%2C+employer.empLabOrg+from+filer+inner+join+filing+using+%28srNum%29+inner+join+employer+using+%28rptId%29%3B

There are number of filings where the employer has not been extracted yet.

https://labordata.bunkum.us/lm20-cf1b077?sql=select+*+from+filing+left+join+employer+using+%28rptId%29+where+employer.rptId+is+null%3B

So, there could be work in extracting the information from those.

ALSO, even where we have employers, there's a lot of variation in how the employer is represented. It would be very useful to have a a lookup table to standardize the existing employer data so that you could easily find all contracts with a single employer.