b-cube / semantics-preprocessing

initial text preprocessors for the triplestore and feature classification
Other
2 stars 3 forks source link

Add identifiers for XSD and a couple of other unwanted response types #80

Open roomthily opened 9 years ago

roomthily commented 9 years ago

For filtering out. So maybe EPSG responses, additional quakeml structures, maybe??? some generic bibliographic structure (that is not straightforward but I don't want to tag each provider).

There may also be a fuzzier identification for unknown data responses - a ratio of numeric chars to alpha chars or something. And this of course ignores social science data.