frictionlessdata / frictionless-py

Data management framework for Python that provides functionality to describe, extract, validate, and transform tabular data
https://framework.frictionlessdata.io
MIT License
700 stars 148 forks source link

Support Feather data format #454

Open khughitt opened 3 years ago

khughitt commented 3 years ago

Overview

It should be pretty straight-forward to add a frictionless.parsers.FeatherParser class.

Using Pandas, one can do something like:

import pandas as pd
pd.read_feather('data.feather`)

It depends on the arrow python package being available.


Please preserve this line to notify @roll (lead of this repository)

khughitt commented 3 years ago

p.s. Instead of creating a FeatherParser class directly, it may be useful to instead create a PandasParser() class which wraps the pandas.read_xx() methods.

This way you could easy extend support to include other useful formats supported in pandas such a parquet / HDF5.

roll commented 3 years ago

Thanks!

Great idea I will add it to PadasParse :+1: