apache / incubator-graphar

An open source, standard data file format for graph data storage and retrieval.
https://graphar.apache.org/
Apache License 2.0
195 stars 40 forks source link

[Feat][Doc] Refactor and update the format specification document #387

Closed acezen closed 3 months ago

acezen commented 4 months ago

Proposed changes

Refactor and update the format specification document

Put an x in the boxes that apply. You can also fill these out after creating the PR. If you're unsure about any of them, don't hesitate to ask. We're here to help! This is simply a reminder of what we are going to look for before merging your code.

Further comments

close #362

Thespica commented 4 months ago

Hi, @acezen . I think we can highlight the benefits of splitting vertices/edges into chunks and splitting properties into property_groups (why we split that) for a little list, like:

1. Random access querying without reading all data from the file.
2. Appending data without modifying the existing files(or need only modifying the last chunk).

Maybe the third point: convenience for parallel reading.

acezen commented 3 months ago

Looks cool! One question: as I see, Timestamp and Date are already supported in #398, maybe we can update status.rst in this PR? Or it will be a separated PR?

Thanks for pointing out, I will update the status.rst in this PR.