Urban-Analytics-Technology-Platform / acbm

activity-based modelling pipeline (for transport demand models)
https://hackmd.io/w-m_OKaDT3GGBfSqFPpBjA
Apache License 2.0
4 stars 1 forks source link

SIC codes and workplace assignment #27

Open Hussein-Mahfouz opened 2 months ago

Hussein-Mahfouz commented 2 months ago

In the SPC, not all people with a job are assigned to a workplace. As a result, not all people with a job have a "commute" trip. The SIC codes could be very useful for assigning people to workplaces in our model - is there an issue with the SIC codes, and do they need editing?

Enriched Spenser <> TUS matching logic:

SPC commuting location assignment logic

Our approach to workplace assignment (TODO)

We can use SIC codes as done in spc, but have a fallback logic if SIC code does not exist

@sgreenbury could you please take a lot and edit if it doesn't make sense?

BZ-BowenZhang commented 1 month ago

In the current SPC dataset, the working-age population in West Yorkshire is 1,496,784, of which 597,873 people have the workplace assigned.

For West Yorkshire, the total employment recorded in the Business Register and Employment Survey (by MSOA) is 1,025,985, which should be the target number for assigned workplaces.

Possible reason for unmatching

I guess the main reason is when generating the 'job market', the proportion of the sic1d2007 cannot match the numbers in the Business Register and Employment Survey, which causes the part of jobs in each sector to be unmatched even though the overall job number is similar. I plotted the figure for the number of jobs in each sector in the Business Register and Employment Survey versus the number in SPC, which could prove this situation. I believe that could be the main reason for the unmatched workplaces.

Screenshot 2024-05-24 at 03 45 25

sgreenbury commented 1 month ago

Thanks @BZ-BowenZhang for the update on this, it's very helpful to see the distributions of the two datasets.

Hussein-Mahfouz commented 1 month ago

Notes from today's meeting:

sgreenbury commented 1 month ago

Thanks for adding this @Hussein-Mahfouz.

sgreenbury commented 1 month ago

Adding notes from discussion with @BZ-BowenZhang for options with increasing complexity:

sgreenbury commented 1 month ago

@Hussein-Mahfouz for reference

BZ-BowenZhang commented 6 hours ago

Update on 17th July:

The new SPC dataset without SIC code assigning has been tested, and the matching results are slightly improved:

Previous 597,873 assigned, 898,911 unassigned Now 656,296, assigned, 840,488 unassigned

There is still a gap between the current number and the target number from the Business Register and Employment Survey (1,025,985). The mismatches in the SIC code have not been resolved, so further checks of the matching process may be needed.

Picture 1