-
When your dataset contains purely nulls in the _first_ column the `skim()` method throws the Unbound LocalError.
I think this is very relevant if you download a sample from a database, and by chan…
-
***
df = pd.DataFrame(['GroupA'] * 5 + ['GroupB'] * 5, columns=['AB'])
df['CD'] = ['C'] * 3 + ['D'] * 3 + ['G'] * 4
df['EF'] = ['E'] * 6 + ['F'] * 2 + ['H'] * 2
df['F'] = np.random.normal(0, 1, 10…
-
I was thinking that `read_csv_lazy()` and `read_parquet_lazy()` might more clearly convey their usage.
-
See below:
```
#3 0x00007ffff7c4526e in __GI_raise (sig=11) at ../sysdeps/posix/raise.c:26
#4 0x00005555561da46e in absl::lts_20240116::AbslFailureSignalHandler(int, siginfo_t*, void*) ()
#5 …
-
Thanks for your great project! How can I inference on GPUs? It seems that the example code (below) does not support GPU.
```
import matplotlib.pyplot as plt
import pandas as pd
import numpy as n…
-
after updating to the latest version of pandrator, dependencies are missing
[INFO] XTTS Training: ModuleNotFoundError: No module named 'df'
-
based on the discussion from the discourse, I would like to suggest a parameter to skip missing datetime records (remove gaps between candles). `df.hvplot.ohlc(skip_missing_data=True)`.
original post…
-
### What happens?
From my understanding of https://duckdb.org/docs/sql/expressions/star.html#struct, I should be able to apply `struct.*` anywhere, including on the result of an aggregate.
### …
-
if I do not reset the index of df_train by df_train.reset_index(drop=True,inplace=True) raise error
code:
`X = np.array([2*i for i in range(100)],dtype=float)
y = np.array([(0.35*x+1)**2 for x i…
-
In the current implementation, the `__call__` method of `nemo_curator/modules/fuzzy_dedup.py`, it assumes that at least one LSH duplicate will be found, and the results will be saved as a parquet file…