-
Related to #294
Related to #901
During review on #798 dug more into why we need some hadoop dependencies.
Essentially, parquet uses `org.apache.hadoop.conf.Configuration` from `hadoop-common`…
-
### Related problem
In [this PR](https://github.com/nushell/nushell/pull/4980), we introduced "search terms": Words that help you search for commands (check `help commands`). Currently, only `prepe…
-
### Data Owner Name
FileDrive Labs
### Data Owner Country/Region
China
### Data Owner Industry
Life Science / Healthcare
### Website
https://filedrive.io/
### Social Media
`…
-
### Library Version
4.6.0+
### OS
all
### OS Architecture
64 bit
### How to reproduce?
it seems that in the recent improvements, the mechanism that calculates the MaxDefinitionLevel was also up…
-
... and workers to OOM
eg IDEA-CCNL/laion2B-multi-chinese-subset
```python
In [1]: import fsspec; import pyarrow.parquet as pq
In [2]: url = "https://huggingface.co/datasets/IDEA-CCNL/laion2…
-
For 1.0.0 we should have a validator that:
1) Tests not just the metadata but looks at the data itself to make sure it matches the metadata
2) Is user-friendly, not requiring python. Ideally a web…
-
Input file: .snappy.parquet file
size: 1MB
Hardware Overview:
Model Name: MacBook Pro
Model Identifier: MacBookPro16,1
Processor Name: 6-Core Intel Core i7
Processor Speed: 2,6 GHz
Nu…
-
**Description**
I tried to run the Engine using GCS as the backing store. On LoadTable, I get an `Internal Error` response.
**To Reproduce**
Everything below done from the `main` branch, since …
-
### Preflight Checklist
- [X] I have installed the [latest version of Storage Explorer](https://github.com/Microsoft/AzureStorageExplorer/releases/latest).
- [X] I have checked existing resources, in…
-
**Parquet Viewer Version**
2.7.0.3
**Where was the parquet file created?**
C#
**Sample File**
[NullableGuid.zip](https://github.com/mukunku/ParquetViewer/files/11495851/NullableGuid.zip)
*…