I am trying to run sample code:
from ts.flint import FlintContext
from ts.flint import summarizers
from ts.flint import TimeSeriesDataFrame
from pyspark.sql.functions import from_utc_ti…
I am using Spark 3.0, perhaps this is the cause of the following error:
`val tsRDD = TimeSeriesRDD.fromDF(dataFrame = dffg_ts)(isSorted = true, timeUnit = MILLISECONDS)
# Error
# Step 2: specify how the Spark dataframe should be interpreted as a time series by Flint
Hi - I am getting the following error when trying to run the Python example:
Traceback (most recent call last):
File "", line 1, in
File "/Users/__/flint-master/python/ts/flint/dataframe.…
Two joint dfs should be grouped by a third (clock) df.
But Error is thrown:
```java.lang.NoClassDefFoundError: Could not initialize class com.twosigma.flint.rdd.function.group.Intervalize$```
Hi n00b question: given how awesome and popular Flint has been, I'm really interested in making it work with Spark 3.0.
So I went ahead and tried the changes in https://github.com/twosigma/flint/pu…
Whenever I try to use Flint here locally (no Hadoop/EMR involved), it keep barfing at me with the above error message in the subject. It's a setup on top of Python 3.7 with PySpark 2.4.4 and OpenJDK 8…
Does this library currently work with spark 2.4?
(even though the "timeColumn" argument error can be bypassed by renaming the column in question to `time`) the `joinLeft` is not working for me:
print( sc.version )
print( tm )
I can able to convert spark df to flint df and apply the summarizeWindows function also working.
After applying any flint function , I can't able to access the df (like i can't convert back to spark …