VTECRM / vtenext

vtenext the CRM for the Digital Innovation. It allows you to engage your customers into your business processes using a specific technology. It can also be used to manage processes generated by internal customers.
GNU Affero General Public License v3.0
33 stars 14 forks source link

Storage deduplication #10

Open Pe46dro opened 2 years ago

Pe46dro commented 2 years ago

Hi, i use your software from 2016 and I think that the table _attachments can be improved. The new insert/update workflow should be something like this:

  1. Calculate hash file before insert or update
  2. If there is already a document with same hash , document can be "deduplicated" only the relation is needed on _seattachmentsrel
  3. Storage space needed will be reduced
  4. Profit!

The implementation should be not sto hard and not too much time consuming for recovery of previous lines. On our installation i made some script to check the reduction rate and we are around 33% of stored data reduction.

millenium-codebug commented 2 years ago

Great suggestion, i suggested a similar implementation on #2 for messages