sassoftware / saspy

A Python interface module to the SAS System. It works with Linux, Windows, and Mainframe SAS as well as with SAS in Viya.
https://sassoftware.github.io/saspy
Other
366 stars 149 forks source link

add partitioning, static columns and parameterization options #601

Closed rainermensing closed 1 month ago

rainermensing commented 1 month ago

Changes to the sasioiom module:

  1. In order to afterwards read files more efficiently using distributed frameworks such as Spark, I added an option to write the parquet file partitioned.
  2. Further, I added an option to add static columns to the final file. This is usefull when the goal is to union tables with the same schema, i.e. add a "year" column for tables storing data from different years.
  3. Added parameters to control chunk and partition size

@tomweber-sas I won't get to it this week, but since you already did the integration into the other IO modules. If you accept these changes, I could add them into the other modules too. I could either do it in this pull request or we open a new one... Or you do it yourself. However you like.

Signed-off-by: Rainer Mensing rainer.mensing@hotmail.de