konigio / konig

A collection of tools for semantic modeling
4 stars 4 forks source link

Sort records by timestamp in ETL pipeline #1618

Closed gmcfall closed 5 years ago

gmcfall commented 5 years ago

I want the ETL pipeline to sort source records based on timestamps.

The current logic for merging records from multiple source nodes into a single target node takes values from the source nodes in the order they are listed in the konig:explicitDerivedFrom property (given by the Derived From column in a semantic workbook).

The new logic should instead prioritize the values based on the konig:modified timestamp where the value from the record with the latest timestamp gets highest priority. The listed order of source nodes in the konig:explicitDerivedFrom property shall be considered only if two source records have the same timestamp.

gmcfall commented 5 years ago

This issue was fixed by #1619