apache / incubator-graphar

An open source, standard data file format for graph data storage and retrieval.
https://graphar.apache.org/
Apache License 2.0
195 stars 40 forks source link

[C++] Include an example of converting SNAP datasets to GraphAr format #386

Closed lixueclaire closed 4 months ago

lixueclaire commented 4 months ago

Proposed changes

This PR provides a sample conversion of SNAP datasets into GraphAr format, showcasing the process with the Facebook graph as a case study. The conversion leverages GraphInfo constructors and the high-level writer functions from the C++ library.

Checklist

Put an x in the boxes that apply. You can also fill these out after creating the PR. If you're unsure about any of them, don't hesitate to ask. We're here to help! This is simply a reminder of what we are going to look for before merging your code.

Further comments

As part of #275

acezen commented 4 months ago

Shell we upload the generated snap dataset of graphar format and provide a python download script to download it, like HuggingFace did? That would easy for user to get the data directly and no need to convert by themself.

lixueclaire commented 4 months ago

Shell we upload the generated snap dataset of graphar format and provide a python download script to download it, like HuggingFace did? That would easy for user to get the data directly and no need to convert by themself.

Certainly! This Pull Request serves as a demonstration of utilizing the GraphAr API for conversion. In the future, we plan to make generated datasets from sources like SNAP, LDBC, or others readily available to users. These datasets will be accessible for direct download via a dedicated repository or a specialized website, as detailed in discussion #275.

acezen commented 4 months ago

Can you add some document about the example to docs?

lixueclaire commented 4 months ago

Can you add some document about the example to docs?

Sure, I've created documentation for the example. Could you please review it?