gdcc / pyDataverse

Python module for Dataverse Software (dataverse.org).
http://pydataverse.readthedocs.io/
MIT License
63 stars 41 forks source link

Add direct datafile upload support (directupload.py) to pyDataverse #157

Open cmbz opened 1 year ago

cmbz commented 1 year ago

Background Support for direct upload of datafiles using Python is available via the following standalone script related to the Harvard Dataverse Repository: dataverse.harvard.edu/util/python/direct-upload/directupload.py

This script enables users to upload many datafiles and their associated metadata all at once before requesting reindexing, rather than calling the API for each file resulting in a system performance hit due to frequent reindexing.

Request & Rationale Incorporating this functionality into pyDataverse would benefit Dataverse API users and pyDataverse users at all installations who need to upload large numbers of datafiles.

pdurbin commented 8 months ago

@cmbz for now @JR-1991 has created a separate uploader. Please see: