I'm running triage over [Red Hat 11.3.1-4] on Linux, Python 3.10.6, and using the v8 triage version. My database is in PostgreSQL 15.7 on x86_64-pc-linux-gnu, compiled by GCC (Ubuntu 7.5.0-3ubuntu1~18.04) 7.5.0, 64-bit.
All the presented code worked fine until I used the feature_group_strategies in leave-one-out or leave-one-in. In both cases, I get the same error (detailed below). However, when I use feature_group_strategies: ['all-combinations'], it works, but it doesn't group the variables as expected, and I get results as if I were using all.
Everything runs smoothly until the matrix building step, where I encounter this error:
2024-09-08 15:17:14 - ERROR Child error
Traceback (most recent call last):
File "/Ccd/-pyenv/versions/tri-hp/lib/python3.10/site-packages/triage/experiments/multicore.py", line 166, in run_task_with_splatted_arguments return task_runner(**task)
File "/Ccd/pyenv/versions/tri-hp/lib/python3.10/site-packages/triage/component/architect/builders.py", line 321, in build_matrix
output, labels = self.stitch_csvs(feature_queries, label_query, matrix_store, matrix_uuid)
File "/Ccd/pyenv/versions/tri-hp/lib/python3.10/site-packages/triage/component/architect/builders.py", line 551, in stitch_csvs
if len(df_pl.get_column('as_of_date').head(1)[0].split)) > 1:
File "/Ccd/.pyenv/versions/tri-hp/lib/python3.10/site-packages/polars/dataframe/frame.py", line 6128, in get_column return self[name]
exceptions.ColumnNotFoundError: as_of_date
It seems like the as_of_date column is missing or not properly generated during matrix building, specifically when using the leave-one-out or leave-one-in strategies.
I expected the leave-one-out strategy to group variables accordingly and generate matrices without this error, but instead, the process halts when it reaches matrix building. I checked the matrices generated in the process and confirmed that the as_of_date column is indeed present.
My questions would be:
Is this a known issue with these feature grouping strategies?
Could this be related to how the as_of_date column is handled with these strategies?
Any guidance or suggestions would be greatly appreciated!
Hi everyone,
I'm running triage over [Red Hat 11.3.1-4] on Linux, Python 3.10.6, and using the v8 triage version. My database is in PostgreSQL 15.7 on x86_64-pc-linux-gnu, compiled by GCC (Ubuntu 7.5.0-3ubuntu1~18.04) 7.5.0, 64-bit.
Configuration details:
All the presented code worked fine until I used the
feature_group_strategies
inleave-one-out
orleave-one-in
. In both cases, I get the same error (detailed below). However, when I usefeature_group_strategies: ['all-combinations']
, it works, but it doesn't group the variables as expected, and I get results as if I were usingall
.Command used:
Everything runs smoothly until the matrix building step, where I encounter this error:
It seems like the
as_of_date
column is missing or not properly generated during matrix building, specifically when using theleave-one-out
orleave-one-in
strategies.I expected the
leave-one-out
strategy to group variables accordingly and generate matrices without this error, but instead, the process halts when it reaches matrix building. I checked the matrices generated in the process and confirmed that the as_of_date column is indeed present.My questions would be:
as_of_date
column is handled with these strategies?Any guidance or suggestions would be greatly appreciated!
Thank you for your help.