Open Fokko opened 3 days ago
Is there a way on the Java/spark side to turn metadata information into JSON? With #535, perhaps we can compare the two JSON results and check for mismatches like this one.
@Fokko I would like to take a shot at this one.
@soumya-ghosh Feel free to take a stab at it, let me know if you run into anything
Is there a way on the Java/spark side to turn metadata information into JSON? With https://github.com/apache/iceberg-python/issues/535, perhaps we can compare the two JSON results and check for mismatches like this one.
That would be an interesting idea. We could take the PySpark schema and turn it into an Iceberg schema and compare the two (or just compare the Avro schemas)
@Fokko the PR https://github.com/apache/iceberg-python/pull/900 is ready for review.
Feature Request / Improvement
It looks like a misnamed field slipped in:
This should be
sequence_number
:Luckily this still worked due to Iceberg's field-id based lookup, but would be good to get this cleaned up.
Relevant code:
https://github.com/apache/iceberg-python/blob/a8d3f17d42b00b507a3522714fe431a18124493e/pyiceberg/manifest.py#L380