Closed vladsud closed 2 years ago
@anthony-murphy, @DLehenbauer, @taylorsw04 - FYI.
i think we can solve the sequence issue around catch up ops. we have another snapshot format that we haven't rolled out, which stores deleted content inline, and rehydrates the deleted segments. rather than keeping the ops. we could actually change this to just store the length, and we don't actually need the deleted content. search would need to know how to parse the new format. and it need some stabilization before it would be safe to roll out. this won't solve the problem for other dds (if any exist with the problem), and it won't solve the issue of trailing ops outside the summary aka > summary ref seq
This issue has been automatically marked as stale because it has had no activity for 180 days. It will be closed if no further activity occurs within 8 days of this comment. Thank you for your contributions to Fluid Framework!
Applications build on Fluid Framework quite often need to adhere to various retention policies when it comes to content that no longer is in the document from user POV. An example of that would be a paragraph deleted by a user from a document and application (or ecosystem that this app is part of) promising that all deleted content is gone (and no longer retrievable) from storage.
Any solution in this space will likely require ability to summarize file at rest to get rid of trailing ops. That could assume collab window being collapsed to be empty, and thus summary not to include previous edits. But that likely is not sufficient as
As result, there is a need for summarization format of DDSs to be structured in a way where
Basically, reverse of what Sequence does today, where # 1 is state at MSN and # 2 tracking changes from # 1 into future - later should look back in history.
This kind of design will allow us to mark blobs in second bucket to be deleted by service when certain rules are met like