apache / incubator-graphar

An open source, standard data file format for graph data storage and retrieval.
https://graphar.apache.org/
Apache License 2.0
217 stars 46 forks source link

[Feat] Include additional built-in data types for GraphAr libraries #76

Open lixueclaire opened 1 year ago

lixueclaire commented 1 year ago

Is your feature request related to a problem? Please describe. Currently, the GraphAr C++ and Spark libraries supports only several basic data types (including BOOL, INT32, INT64, FLOAT, DOUBLE, and STRING). To serve more scenarios, more built-in data types need to be added in GraphAr libraries.

Describe the solution you'd like Add more common data types to the GraphAr libraries, such as DATE, TIME, BINARY, STRUCT, MAP, ARRARY, and JSON. Since these types are not always supported by the CSV/ORC/Parquet file types and the C++/Spark standard libraries, careful handling should be taken in each case, e.g., performing the necessary type conversions.

Describe alternatives you've considered A clear and concise description of any alternative solutions or features you've considered.

Additional context Add any other context or screenshots about the feature request here.

acezen commented 1 year ago

Support the data types of Graph Query Language standard.

The GQL standard is not yet publicly available, we can refer to the data types of [cypher language](https://neo4j.com/docs/cypher-manual/current/syntax/values/].

acezen commented 1 year ago

For arrow and GQL standard. DateTime and Array Type could be the first priority and the good first issue to address