luminousmen / data-toolset

Upgrade from avro-tools and parquet-tools jars to a more user-friendly Python package.
MIT License
2 stars 0 forks source link

Function would convert from avro/parquet/csv/json to avro/parquet/csv/json #6

Open luminousmen opened 1 year ago

luminousmen commented 1 year ago

As part of the tool, we need to develop a versatile data conversion functionality. As a result, the tool should be capable of converting data between various formats, including Avro, Parquet, CSV, and JSON. The goal is to provide a seamless and efficient way to transform data from one format to another to support our diverse data sources and consumption needs.

AC:

Create a data conversion library that supports the following conversions:

  1. From Avro to Parquet
  2. From Avro to CSV
  3. From Avro to JSON
  4. From Parquet to Avro
  5. From Parquet to CSV
  6. From Parquet to JSON
  7. From CSV to Avro
  8. From CSV to Parquet
  9. From CSV to JSON
  10. From JSON to Avro
  11. From JSON to Parquet
  12. From JSON to CSV
ankan-mazumdar commented 12 months ago

Hello luminousmen, I would like to contribute, could you please schedule a meeting to explain the project.