Use triggers to track when rows in a SQLite table were updated or deleted
pip install sqlite-chronicle
This module provides a function: sqlite_chronicle.enable_chronicle(conn, table_name)
, which does the following:
_chronicle_{table_name}
table exists already. If so, it does nothing. Otherwise...added_ms
, updated_ms
, version
and deleted
added_ms
and updated_ms
to the current timestamp in milliseconds, and version
column that starts at 1 and increments for each subsequent rowadded_ms
and updated_ms
to the current time and increments the version
updated_ms
timestamp and also updates any primary keys if they have changed (likely extremely rare) plus increments the version
updated_ms
, increments the version
and places a 1
in the deleted
columnThe function will raise a sqlite_chronicle.ChronicleError
exception if the table does not have a single or compound primary key.
Note that the version
for a table is a globally incrementing number, so every time it is set it will be set to the current max(version)
+ 1 for that entire table.
The end result is a chronicle table that looks something like this:
id | added_ms | updated_ms | version | deleted |
---|---|---|---|---|
47 | 1694408890954 | 1694408890954 | 2 | 0 |
48 | 1694408874863 | 1694408874863 | 3 | 1 |
1 | 1694408825192 | 1694408825192 | 4 | 0 |
2 | 1694408825192 | 1694408825192 | 5 | 0 |
3 | 1694408825192 | 1694408825192 | 6 | 0 |
The sqlite_chronicle.updates_since()
function returns a generator over a list of Change
objects.
These objects represent changes that have occurred to rows in the table since the since
version number, or since the beginning of time if since
is not provided.
conn
is a SQLite connection objecttable_name
is a string containing the name of the table to get changes forsince
is an optional integer version number - if not provided, all changes will be returnedbatch_size
is an internal detail, controlling the number of rows that are returned from the database at a time. You should not need to change this as the function implements its own internal pagination.Each Change
returned from the generator looks something like this:
Change(
pks=(5,),
added_ms=1701836971223,
updated_ms=1701836971223,
version=5,
row={'id': 5, 'name': 'Simon'},
deleted=0
)
A Change
is a dataclass with the following properties:
pks
is a tuple of the primary key values for the row - this will be a tuple with a single item for normal primary keys, or multiple items for compound primary keysadded_ms
is the timestamp in milliseconds when the row was addedupdated_ms
is the timestamp in milliseconds when the row was last updatedversion
is the version number for the row - you can use this as a since
value to get changes since that pointrow
is a dictionary containing the current values for the row - these will be None
if the row has been deleted (except for the primary keys)deleted
is 0
if the row has not been deleted, or 1
if it has been deletedAny time you call this you should track the last version
number that you see, so you can pass it as the since
value in future calls to get changes that occurred since that point.
Note that if a row had multiple updates in between calls to this function you will still only see one Change
object for that row - the updated_ms
and version
will reflect the most recent update.
Chronicle tables can be used to efficiently answer the question "what rows have been inserted, updated or deleted since I last checked" - by looking at the version
column which has an index to make it fast to answer that question.
This has numerous potential applications, including: