Open ChenghaoMou opened 2 years ago
Near deduplication #7 only operates on file level. It is also possible for a file to be
Do we do something about them, knowing they contains large chunks of repeated snippets?
How hard would it be to do some analysis of how often this is the case maybe on a subset of data?
Near deduplication #7 only operates on file level. It is also possible for a file to be
Do we do something about them, knowing they contains large chunks of repeated snippets?