-
Here's a simple reproducer.
```
import duckdb
import pyarrow as pa
import pyarrow.dataset
import uuid
import lance
class UuidType(pa.ExtensionType):
def __init__(self):
supe…
-
**Is your feature request related to a problem? Please describe.**
Tried to use `pandas.DataFrame` for outputs, got error:
```
Traceback (most recent call last):
File "/usr/local/lib/python3.1…
-
Probably because we didn't want to think about #42, but a pragmatic approach should be taken
-
**Describe the bug**
- What are you trying to do?
I try to serialize System.Data.DataSet types with Ceras.
- What do you think **should** happen?
They should be serialized to byte[] properly.
M…
-
Right now users have the option of creating tables with duplicate names. In the CSV and Parquet serializers each table is written to a file using that tables name. If there are duplicate names then se…
-
In trying to create a nicer way to access ocean model output (several stacks of netcdf files where each stack can be concatenated, but not necessarily merged into a single `Dataset` object), I've been…
-
PythonDataSource has repr defined on it by the data class. This results in the flat examples getting serialized. This can be a huge performance penalty. The other sources have their repr's defined by …
-
### System Info
When using C++ GptManager, I was using InferenceRequest::serialize to synchronize requests between ranks, not realizing that it does not handle logits post processors.
It would be …
-
In 1 November 2023 meeting we agreed that
1. The json "header" file shall be called "adapt.json" (all lower case)
2. Any GeoParquet or GeoTiff files referenced can be named and nested in subdirecto…
-
**Current behaviour**
Consider the following workflow example:
- `reana.yaml` content:
```yaml
inputs:
files:
- inputs.yaml
- Snakefile
parameters:
input: inputs.yaml
w…