gdcc / pyDataverse

Python module for Dataverse Software (dataverse.org).
http://pydataverse.readthedocs.io/
MIT License
63 stars 41 forks source link

Add Pandas integration for Datafiles #97

Open skasberger opened 3 years ago

skasberger commented 3 years ago

Add a pandas integration for spreadsheet datafiles. There are two options to do this:

  1. Create an API request with DataAccess.get_datafile(), which should return an Pandas Dataframe instead of the requests.Response object.
  2. Create an API requests with DataAccess.get_datafile(), which should return an models.Datafile() object, with the data stored inside and an offered method .to_df() to get the data as a Pandas Dataframe.

Idea coming from https://github.com/gdcc/pyDataverse/issues/80.

Prepare

Implementation

Review

Follow-Ups

pdurbin commented 4 months ago

As discussed during the 2024-02-14 meeting of the pyDataverse working group, we are closing old milestones in favor of a new project board at https://github.com/orgs/gdcc/projects/1 and removing issues (like this one) from those old milestones. Please feel free to join the working group! You can find us at https://py.gdcc.io and https://dataverse.zulipchat.com/#narrow/stream/377090-python