Closed jonathanrstern closed 1 year ago
Currently we don't provide such connectors.
No matter what youāll need to do egress to read the photos so you might as well download them locally for deduplication
@Joshfindit
Let's say you have an app with 1,000 users. Each has 1,000 photos, some of which are duplicates. All are saved to S3.
How would you go about de-duping?
I probably wouldnāt end up with that because Iād likely have built the app to store files on S3 with some sort of hash (my current go-to is <SHA256>.<size in bytes>
).
If I walked in to that as a project Iād:
All of my photos are saved on S3. Is there a way to use this library, or one like it, to identify duplicates?