erezsh / reladiff

High-performance diffing of large datasets across databases
https://reladiff.readthedocs.io/en/latest/index.html#
Other
334 stars 5 forks source link

Add support for Microsoft SQL Server #20

Open ryano144 opened 1 month ago

ryano144 commented 1 month ago

Although it wasn't great, there was basic support added for MS SQL in the main data-diff back in September in this PR: https://github.com/datafold/data-diff/pull/696/.

Then updated slightly in this one: https://github.com/datafold/data-diff/pull/825

erezsh commented 1 month ago

I am aware that a MS-SQL driver had been attempted.

@ryano144 Have you used and benchmarked their implementation?

My impression is that the driver is too slow to be useful for cross-db comparison, as it uses the HashBytes() function which is approx 100 times slower than the equivalent function in postgres .

But perhaps there are details that I'm not aware of?