-
I am writing spark job in java. runJob() method returns the expected output.
Now, I want to cache the output using NamedObjects. It works fine in scala but with Java, it doesn't store anything in Cac…
-
### Is your feature request related to a problem?
I have been implementing data indexing scripts in PySpark, where, given a large input dataset, we want to chunk it up and index each chunk as a dataf…
-
NPE happens in this line.
`values.mapNotNull { it.takeIf { it.nrow > 0 }?.schema() }.intersectSchemas()`
Code that leads to it:
![image](https://github.com/Kotlin/dataframe/assets/12936457/2e3…
-
DataFrame fails on simple actions with casting BigInteger to Long.
For example, the MySQL performance schema table_handles table is defined as:
```
show create table performance_schema.table_ha…
-
If you have a `DataFrame` consisting of a mix of `Number` types, calling `median` on it will break.
Example:
```
val df = dataFrameOf("a")(
1,
2L,
2.0f,
3.0,
)
df.median…
-
I saved a model(KNNClassificationModel) using java serialization and when I use it later, I always get **_java.lang.IllegalArgumentException: Flat hash tables cannot contain null elements._**
on the d…
-
Hi team,
I am facing an issue in getting the lineage of loading data to `elasticsearch` from a `csv` file.
### Spark Job
```python
from pyspark.sql import SparkSession
spark = SparkSession.…
wajda updated
6 hours ago
-
Our code relies on java(our own code) and scala libraries(ml, graph) and it would be very helpful to be able to convert the Java Dataframe and Session to Scala so that we can use both interoperably. I…
-
-
dataframe: 0.9.1
## error reporting
I had to check why the code generation fails by running my input through one of the already set-up tests in this repository, because the only error message I …