derrickoswald / CIMSpark

Spark access to Common Information Model (CIM) files
MIT License
15 stars 1 forks source link

CIMDifference #24

Closed derrickoswald closed 4 years ago

derrickoswald commented 4 years ago

The ability to read CIM files into two different named RDD sets, i.e. RDD[\<CIM class>], via Allow for named RDD variations, allows for generation of a difference file based on two reference CIM files.

This issue tracks the creation of a new standalone executable called CIMDifference, that will perform this task.

Essentially:

derrickoswald commented 4 years ago

Commit efdf3aba596b1ca90947a2d14669ca3dd291b4d9 adds CIMDifference artifacts to the Maven repo. Actual implementation uses fullOuterJoin() instead of the separate foreach loops mentioned above.