-
Hello,
I want to read some files from **Azure Datalake** (ADL) filesystem thought **Jupyter** using **almond** kernel (Scala) and I'm having problems with some dependencies.
Important to note th…
-
parquet-tools merge is extremely time- and memory-consuming when used with block-option.
The merge function builds a bigger file out of several smaller parquet-files. Used without the block-op…
-
I have encountered an issue where the environment consists of
Windows 10 OS,
Java 8,
Embulk version 0.11.0,
JRuby 9.4.3.0,
embulk-input-sqlserver (version 0.13.2),
and embulk-output-s3 (ver…
-
**Is your feature request related to a problem? Please describe.**
The way that CoreNLP server is encapsulated using a subprocess is really nice. I want to be able to use that code for booting the …
-
I'm trying to set up a kafka s3 sink connector that will consume messages in avro format and dump to s3 compatible storage (minio) in parquet format.
This pipe line works for certain topics but fai…
-
### Describe the usage question you have. Please include as many useful details as possible.
In my Flink process function, I receive serialized VectorSchemaRoot data which needs to be deserialized …
-
I have a catalog connecting against hive/s3/parquet (kerberos protected HMS). SELECT * from any of the tables works. But ANALYZE statement (or CREATE TABLE AS ie a managed table) fails with error like…
-
### Describe the problem you faced
Apache Hudi's deltastreamer utility on EMR, with YARN as the scheduler. I've noticed a rather peculiar behaviour that has started causing sporadic errors in my jo…
-
### Search before asking
- [X] I searched in the [issues](https://github.com/apache/incubator-paimon/issues) and found nothing similar.
### Paimon version
0.5.0-incubating
### Compute Engine
fli…
-
An "empty" parquet file, created via pyarrow, seems to be leading to Barrage writing issues.
```python
from deephaven import parquet
my_table = parquet.read("Empty1.parquet")
```
This manif…