Open DanSmith opened 2 months ago
For background, this question arose from a discussion with colleagues at Making-Sense (https://making-sense.info/), who are looking at using SDTH to describe the operations in a VTL (https://sdmx.org/?page_id=5096) script. VTL distinguishes between persistent and transient datasets. However, VTL considers a dataset persistent if it is transferred from one VTL script to another. Since this transfer could happen entirely in silica, a VTL persistent dataset may never exist on a permanent storage device.
SDTH assumes that a FileInstance is not in in computer memory, because it must be Loaded to be used and Saved. In contrast, a DataframeInstance is transient, because it will disappear when execution of the script ends.
"File" implies a storage device. Should we use a more general concept that is more consistent with VTL?
Database, endpoint, etc are all similar concepts. Should file be more generalized?