needs a better pk duplicate handling

netzkolchose / django-fast-update

Faster db updates using UPDATE FROM VALUES sql variants.

MIT License

21 stars 2 forks source link

Ideas to solve the duplicate detection:

keep set / hash reduction for pk types, where applicable Using python internals is by far the fastest, so it might be a good idea to keep it for primitive field types, that are known to be handled the same way by any db engine - should be true for int and string types, maybe also date types (caution with sqlite here)
explicit db roundtrip with a pk__in reduction Should be possible as fallback for all types, but creates an additional db query (bad perf). This is prolly the only way for complex types (e.g. json, hstore, own defined types). Most projects will never use complex types as pk type (discouraged), as they are not even stable across db engines (e.g. json will be handled very different in identity in postgres/jsonb vs sqlite/string-repr).

netzkolchose / django-fast-update