Add copy_dataset as a built in transformer function
Add new helper functions in helper.py:
dataset_from_fsurl: Create a dataset from the contents of an fsspec URL
metadata_from_fsurl: Create metadata, FILESET file list from fsspec URL
derived_dataset: Create a derived dataset via a single function call on .data from a
previous dataset
Add support for using arbitrary conda channels in the environment.yml while forcing the order the channels getting added using a channel-order section (coming in #245)
API Changes
.DESCR -> .README
.EXTRA -> .FILESET
Documentation Updates
Update docs to be more clear and concise
Re-add process_20_newsgroups as a working example of a process function
Tests
Use a miniconda image directly instead of python image and installing miniconda on it
Features
copy_dataset
as a built in transformer functionhelper.py
:dataset_from_fsurl
: Create a dataset from the contents of an fsspec URLmetadata_from_fsurl
: Create metadata, FILESET file list from fsspec URLderived_dataset
: Create a derived dataset via a single function call on.data
from a previous datasetenvironment.yml
while forcing the order the channels getting added using achannel-order
section (coming in #245)API Changes
.DESCR
->.README
.EXTRA
->.FILESET
Documentation Updates
docs
to be more clear and conciseprocess_20_newsgroups
as a working example of a process functionTests
_paths.py