Closed aletzdy closed 1 year ago
Apologies for diving in but not attending the call. I think this is an interesting issue, but do not fully understand from the above description. A few questions:
Employees that work from home exclusively are often not represented in the employment data. By work from home
, do you mean episodic telecommuting in which the worker does not commute on the simulation day? Or those that work exclusively from home? Is it assumed that the input employment data represents jobs held by those that work from home
exclusively? If so, it is assumed that the work location for these workers is their home location?
What is the context in which this scaling occurs? I assume it is the usual work location model. Is that right?
Can you talk a little bit more about why this is necessary? Provided employment > number of workers, why does it matter if there is excess employment when running a usual work location model (if that is the context)? Is this for cosmetic purposes, i.e., to have workers and jobs match exactly? Or to improve the spatial match?
Where would these scaling factors come from? Presumably the factors for in-commuters would be higher at the boundaries of the region. Would the user be given guidance as to how to derive these? Depending on the definition of work from home
and employment
, it may be very challenging to develop those scaling factors, with important implications (i.e., applying uniform reductions for work from home across the region could give strange model results when work from home shares are large).
@DavidOry no need to apologize for not being on a call, we're trying to get better about collaborating asynchronously and not needing all our collaborators to join meetings constantly.
I do think the WFH concerns here are not that big a deal; there might be spatial variation in WFH but it's probably mostly explainable from modeled factors like income. I think the in-commute is more the target motivation for this, especially for regions that for political reasons have been stuck with a model region that includes significant population or employment just outside the model boundary.
I really like David Ory's number 4 point which is linked to point 1.
My thoughts and opinion, which is a model user perspective, not a developer -
I started to type my preferred way forward, but I don't think I have a good proposal yet - without a discussion and a back and forth plan development
It may be useful to separate the issue of in-commuters from episodic telecommuters (which I think is what you both mean by WFH). It may be that a single solution is best for both problems, but I think it would be best to start by assuming this is not the case rather than assuming it is.
Re: @jpn-- , spatial patterns of work from home. I think this depends on (a) your definition of work from home (specifically whether or not people who work from home are consuming a job in the socioeconomic data that is not located at their home, which given @bettinardi's response I assume the answer is yes) and (b) the context of the scenario (5 percent telecommuting on the simulation day versus 50 percent). Removing 50 percent of your jobs uniformly over space, even if done by industry, is almost certainly wrong in the long term, as it assumes there is exactly no response from the commercial real estate market to a large decline in demand.
If I could just echo your last 5% vs 50% comment - this is where I was starting to get stuck too, and then ultimately couldn't suggest a best path forward without kicking it around with a group of smart people, like this thread. But I don't think we will efficiently solve it in issue comments.
Thought it would be useful to clarify a few things:
Because work from home is a modeled choice, we could implement some version of Alex's suggestion (3 above) - summarize the shares of workers who work from home, make the assumption that the workers who work from home should be reduced in the same county or district as the workers home location, and apply that percentage reduction. It would be more challenging to this by sector in an automated fashion, since a) the employment sectors between implementations, and b) employment sectors are not necessarily known for the workers or mappable between worker industry\occupation and employment data. But it is also likely that very few workers who work in the retail sector, for example, work from home - so there's an argument for allowing the user to specify these factors by employment type. Just not sure if it is worth the trade-off. Also doing this in an automated fashion implies that the model system must have a work-from-home choice model, so it breaks the modularity of the model system. Not necessarily a deal-breaker, but a consideration.
I guess after this long-winded reply, I think keeping everything simple and just specifying one factor by TAZ to apply uniformly to all employment (and size terms) in the work location choice model to account for in-commuting and/or WFH shares is my preference.
Thanks @jfdman for clarifying the work from home definition. Building on this, we can then define employment in the socioeconomic data as: a count of people working outside the home by work location zone on a typical weekday. Is that right? If so, giving the user the ability to modify this input in response to a large increase in work from home share makes me nervous. I think many regional planning efforts are going to explore 20 or 30 percent work from home shares (or even higher). This will have a meaningful impact on the commercial real estate market that will have important transportation outcomes (e.g., uniform reductions will decrease transit riderships; land value-based reductions may increase transit ridership). Even with the caveats you note ("at their discretion", "careful"), I think providing this functionality is unwise. In my view, this work should be done carefully and thoughtfully outside the travel model.
Thanks everyone for contributing to this very relevant WFH topic within the context of our post-pandemic regional activity-based travel demand forecasting models, including ActivitySim of course. I would also simply add the definitional concept of a "homeworker", as characterized by the Bureau of Labor Statistics. Speaking of BLS, we're starting to see some preliminary results from the recent American Time Use Survey regarding WFH, see https://www.bls.gov/news.release/atus.nr0.htm
@DavidOry - Ideally, employment data would be absent of workers who work from home. Particularly post-pandemic employment data, where the work from home shares are much higher than they were in 2019. However, this isn't always the case. Regardless, it is still necessary to reduce the work location choice size terms and constraint targets by non-resident workers, because they consume jobs but they do not consume activity opportunities for resident non-work travel. Therefore there still needs to be an internal mechanism for making this happen. My preference at this time is to keep this mechanism simple - allow the user to specify a percentage of external workers for each TAZ and apply this percentage to the calculated size term and the total employment. I believe this is the procedure currently implemented in DaySim. A user would always be able to implement a more sophisticated (and complicated) approach where they specify two sets of employment data - one set of 'reduced' employment data that could be used for work location choice and one 'full' set for other models, which would not require code changes.
Based on Joel's description above and on the call on 10-13-22, I agree with the approach he is bringing. Joel, on the side, I would be very interested to see an example input set for how the user would interface with this feature - maybe it's the DaySim example.
@jfdman: agree that doing this for non-resident workers makes good sense. My concerns are limited to extending this idea to account for those working from home.
I do think it would benefit the broader ActivitySim project to be explicit about the definition of employment
. Even if obtaining the data is a challenge, the internal consistency of the model requires some definition.
Closing this issue, addressed via https://github.com/ActivitySim/activitysim/pull/613
We want to build a functionality in ActivitySim to factor the input employment data down by a set of scaling factors that accounts for workers who work-from-home and workers who in-commute, with the scaling factors varying by zone/district. There are multiple ways this can be achieved:
This approach is faster to implement, but less flexible: We can factor the total size terms (or the total employment in the simulation-based method), by the scaling factors to correct for in-commuting, etc. However, this approach does not have the flexibility of applying the scaling factors only to certain types of employments (in case we do not want, for instance, retail jobs to be changed).
This approach requires minimal code change, but requires a bit more work on the side of the model developer: we may define the scaled employment columns in the land use preprocessor file, then modify the size term calculation UECs to use the scaled employment columns in the workplace location model. All the other models would be unaffected.
This approach requires more code change, but little work on the side of the model developer/user: The user sets the employment columns that should be scaled in the yaml file, and ActivitySim does the calculations itself.