Closed RedSquirrel87 closed 6 months ago
Nice catch, I've added another optional param if you need to pass in your own set of file hashes, if not it will fall back to using self.db.md5
, and if that doesn't exist then it will try to upload everything and skip anything that returns a 409. The md5
col should always be present, unless you have no images (only videos, docs, etc.) then maybe those don't go through a dedup check.
ap = AmazonPhotos(
cookies={
...
},
db_path='~/ap.parquet',
dtype_backend='pyarrow',
engine='pyarrow',
)
md5s = ...
ap.upload('foo/bar', md5s=md5s)
I tried the latest version and with:
ap.upload("/home/mini/test")
it crashes with the following output:Is there some issue in the deduplication function or am I missing anything?