datahq / dataflows

DataFlows is a simple, intuitive lightweight framework for building data processing flows in python.
https://dataflows.org
MIT License
194 stars 39 forks source link

Excel output processor #47

Open rufuspollock opened 5 years ago

rufuspollock commented 5 years ago

Do we have one already? Where do I check this sort of stuff?

As a Developer I want to output the Data Package as an Excel file with each resource as a seperate tab so that I can share the excel file with people who use Excel

Branko-Dj commented 5 years ago

We don't have this option currently available.

To make it available I will add this functionality to already existing dump_to_path processor which currently supports only csv format so it can output the data and metadata in the separate sheets.

Current status of this can be found at https://github.com/datahq/dataflows/blob/master/PROCESSORS.md under dump_to_path. Once the coding is done I will update this file with instructions on how to use it and also the main TUTORIAL.md file with an example

akariv commented 5 years ago

Thanks, waiting for the PR :)

On Tue, Nov 20, 2018 at 10:38 AM Branko notifications@github.com wrote:

We don't have this option currently available.

To make it available I will add this functionality to already existing dump_to_path processor which currently supports only csv format so it can output the data and metadata in the separate sheets.

Current status of this can be found at https://github.com/datahq/dataflows/blob/master/PROCESSORS.md under dump_to_path. Once the coding is done I will update this file with instructions on how to use it and also the main TUTORIAL.md file with an example

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/datahq/dataflows/issues/47#issuecomment-440188331, or mute the thread https://github.com/notifications/unsubscribe-auth/AAQMdUDzzokZhidmftIuYVq7TP-BfzVqks5uw79zgaJpZM4YNRZ3 .

OriHoch commented 4 years ago

I wrote a basic implementation of this here: https://github.com/OriHoch/dataflows-xlsx

I'm hesitant about including it as part of core dataflows because it adds a big dependency on openpyxl and also I think that the processor is not as stable and consistent in usage as the core dump_to_path - e.g. it's not a standard tabular resource, can't load from it (only use the xlsx files directly) and excel itself is not a stable, well-define format.

akariv commented 4 years ago

Thanks @orihoch, that's awesome.

I'll review and reserve the right to fork it to the core organization :)

On Tue, Sep 24, 2019 at 10:18 AM Ori Hoch notifications@github.com wrote:

I wrote a basic implementation of this here: https://github.com/OriHoch/dataflows-xlsx

I'm hesitant about including it as part of core dataflows because it adds a big dependency on openpyxl and also I think that the processor is not as stable and consistent in usage as the core dump_to_path - e.g. it's not a standard tabular resource, can't load from it (only use the xlsx files directly) and excel itself is not a stable, well-define format.

— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/datahq/dataflows/issues/47?email_source=notifications&email_token=AACAY5MRKINFWQO35VZPGQDQLG5NHA5CNFSM4GBVCZ32YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD7NLEDI#issuecomment-534426125, or mute the thread https://github.com/notifications/unsubscribe-auth/AACAY5MHJZLFWYYBXLNHFKTQLG5NHANCNFSM4GBVCZ3Q .