ipfs / ipfs

Peer-to-peer hypermedia protocol
https://ipfs.tech
MIT License
22.58k stars 1.47k forks source link

(draft) Common Bytes - standard for data deduplication #444

Closed danimesq closed 4 years ago

danimesq commented 4 years ago

This is a standard proposal for deduplicing common bytes on different versions of same kind of a file. It takes inspiration on git objects, but takes more approaches to ensure the right content parts are organized. This is not only for deduplicate data, but also for linking the same data which is represented in different kinds of files.

danimesq commented 4 years ago

discuss.ipfs post: https://discuss.ipfs.io/t/draft-common-bytes-standard-for-data-deduplication/6813

danimesq commented 4 years ago

It could also have i/o deduplicing, by generating different versions of same file by applying their common bytes.

hsanjuan commented 4 years ago

Thanks for posting to discuss! I'll close this.