-
**Is your feature request related to a problem? Please describe.**
Currently core pylibcudf owning objects are difficult to construct. Columns are constructed using a complete signature involving nea…
vyasr updated
3 weeks ago
-
请问一下,该方法是否可以删除,影响性能?还是影响效果?
def sort(data, sort_size=500, mode='train'):
""" Sort the data by feature length.
Sort is used after shuffle and before batch, so we can group
utt…
-
### Search before asking
- [X] I searched the [issues](https://github.com/IBM/data-prep-lab/issues) and found no similar issues.
### Component
Transforms/Other
### What happened + What you expect…
-
### Bug Description
## Bug description
Data pipeline Destination name shows as "Unnamed" despite being named
## How to reproduce
1. Name the destination and save.
2. Scroll
Before Scro…
-
Design and set up better long term testing structure/method for all types of testing
- Decide on a testing framework
- Create designated test folder with necessary persistent data
- find/crea…
-
After we bump the current processing pipeline version, processed data of previous pipeline versions doesn't serve a purpose anymore. We should just delete it to not accrue waste.
No longer purposef…
-
#### Description
Using the CLI parameter `--scenarios` results in broken DAGs due to incorrect conversion of the parameter into `egon-data.configuration.yaml`. This causes the following error:
###…
-
Hello! Is it possible to set up quesma without elasticsearch?
I mean, to have the following scheme: `Kibana -> Quesma -> ClickHouse`
With this configuration:
```yaml
#licenseKey: {your-quesma-lice…
-
I am currently working with imbalanced data using the mlr3pipeline package and applying the SMOTE method for balancing. Since SMOTE requires numeric features, I used po("encode") to convert categorica…
-
The main workflow `generate_report.yml` re-uses the original Orion workflow which has been removed from Orion in favour of the Artsdata Pipeline Action.
This workflow needs to be updated to use th…