Open 1ec5 opened 4 months ago
These seem to be estimates based on some old maps, but the placeholder month and day leave me a bit uncertain about that.
By the way, #47 has an idea for dating streets with more certainty back to 1992. But if we want to stick with this outdated county dataset, we should replace the start_date=*
start_date:source=arbitrary
with a more pessimistic start_date=*
and start_date:edtf=*
based on the date_creat
field. Then mappers can selectively work their way backward through time, with the ability to choose between this import or a different source. (For example, it should be possible to source state-maintained highways more rigorously without relying on this import.)
@jeffreyameyer do you remember how the start dates came about? Were the years for real but with placeholder months and days?
Ok - clearly, I've left some incomplete work - my apologies! But, I do think things can be cleaned up quickly. Please see notes / comments below.
The years were largely set by choosing an arbitrary (sorry!) old year, then comparing slowly to old maps and adjusting backward as the maps got older. Roads that stopped showing up as you went back in time didn't get older years, those that did show up continued to get older years. This is not a foolproof method, but is directionally useful and having edtf tags is indeed a better solution than the "arbitrary" markings.
group=t
. This was a personal tag used for grouping related items I was working through, as well as filtering them out while editing. Also deleted: group=b
fcc
field) to highway=*
tags, for the benefit of future imports.name=*
based on the streetpref
, streetsuff
, and streettype
fields. Many of these fields were set by just changing the all caps field name in the source data to lowercase in OSM & would have benefitted from some pre-processing in QGIS. oneway=ft
and oneway=tf
with oneway=yes
and oneway=-1
, respectively. (Better yet, delete oneway=tf
and reverse those ways.) and then tag with oneway=yes
?roadlabel=*
, which is redundant to name=*
but less polished.streetname=*
, streetpref=*
, streetsuff=*
, and streettype=*
. (Alternatively, propose a more structured tagging scheme for street names that isn’t specific to this dataset.)surface=asphalt
with surface=paved
. The surface field’s PAV value doesn’t specify the kind of pavement, and I don’t think we’d be able to track minute changes in pavement material over time without massive effort.(Better yet, delete oneway=tf and reverse those ways.) and then tag with oneway=yes?
Yes, both the TF
and FT
values appear to indicate one-way streets. The dataset represents a two-way street by setting the field to null.
In 2020, Stanford Libraries republished a public domain dataset of streets throughout Santa Clara County that the Santa Clara County Planning Office used to publish on its open data portal.[^centerlines] Last August, @jeffreyameyer imported an extract of this dataset into OHM, 1,886 features in all, ahead of a presentation at Stanford. The import covers the Stanford campus, downtown Mountain View, and some major streets in that part of the San Francisco Peninsula. This issue tracks cleaning up the import to follow OHM norms.
The dataset has a
date_creat
field, but this only indicates when the feature was added to the database in ArcGIS, generally between 2004 and 2008. By contrast, the import tagged every street as if it started on March 1 in various years in the 19th and 20th centuries.[^leap] These seem to be estimates based on some old maps, but the placeholder month and day leave me a bit uncertain about that.Aside from dates, most of the other attributes need to be cleaned up. For example, on this stretch of San Antonio Road:
group=t
. I have no idea what it means,[^metadata] but it doesn’t appear in the dataset and isn’t an established OHM tag.fcc
field) tohighway=*
tags, for the benefit of future imports.name=*
based on thestreetpref
,streetsuff
, andstreettype
fields.oneway=ft
andoneway=tf
withoneway=yes
andoneway=-1
, respectively. (Better yet, deleteoneway=tf
and reverse those ways.)roadlabel=*
, which is redundant toname=*
but less polished.streetname=*
,streetpref=*
,streetsuff=*
, andstreettype=*
. (Alternatively, propose a more structured tagging scheme for street names that isn’t specific to this dataset.)surface=asphalt
withsurface=paved
. Thesurface
field’sPAV
value doesn’t specify the kind of pavement, and I don’t think we’d be able to track minute changes in pavement material over time without massive effort.[^leap]: 1924 was not a leap year, so every software package in our stack interprets
start_date=1924-02-29
as March 1, 1924. [^centerlines]: This dataset has been superseded by a continuously updated Road Centerlines dataset, also in the public domain. [^metadata]: Unfortunately, the original dataset is no longer available online, and although it came with an FGDC metadata file, this file says nothing about each attribute.