-
Great paper, and really clean and explainable repo, Thanks!
Any plans to release the Hyperpartisan dataset and benchmark utils?
It could really help future researchers go through your pipeline of cl…
-
Imagine a pipeline that copies `/pfs/input` to a tmp directory outside of `/pfs`, say `/tmp`, processes data under `/tmp`, then finally copies `/tmp` back to `/pfs/output`.
This pipeline is problem…
-
Could you please let us know what are the missing values for the spacing along the z axis of train_11755_a_3 and train_11755_a_4? Thanks
-
There should be a component that can pass arguments directly from one node to the next. This is an edge case, but sometimes you may want to pass a data object directly to a joiner (ie with no addition…
-
__Proposal: Add generic support for exploratory workflows__
In big data pipelines, various variants of a specific operator may be
considered. For instance, the variants are related to a data cle…
-
Hello! I had a great chat with @romicolman today and she encouraged me to write some quick paragraphs describing features that would help it see adoption by me (and others in my field).
For my work…
-
## ❓ Contexte
Après avoir exploré nos données, nous avons dressé un ensemble de constats concernant la qualité des données. De ces constats, il faut mettre en place un flux de nettoyage et d'augmenta…
-
- Using `e-mission-server` `master`
- Load the Apr 24 dump for `dfc-fermata`
- Inspect the cleaned objects for my opcode:
```python
import emission.storage.timeseries.abstract_timeseries as es…
-
Should we add a bounty for integrating a SHACL validator in Whyis?
A data curation pipeline could pass an incoming dataset through various SHACL shape graphs and display validation reports (warning…
-
**Describe the issue**
I am following the tutorial In Introduction to generic pipelines and `Part 1 - Data Cleaning.ipynb` in [elyra-ai/examples](https://github.com/elyra-ai/examples/) is unable to …