-
Hi! Thank you for this great library. We used it to process our large input gz files, but we faced with some problem.
```
java.lang.IllegalArgumentException: The provided InputSplit (786432000;78643…
-
Hello,
Is there a way or package that could help me by doing visualizations with sparklyr?
I saw that SparkR suppports ggplot.
I do really appreciate your help
-
Greetings,
I would like to implement IO for some genomic file formats. The specifications for the file formats I would like to be able to read directly into Dask dataframes follow the [SAM](https:…
ghost updated
3 years ago
-
I am running Datalab on data proc cluster and when I run dataframe.write.csv() I got this error.
Can someone help me please.
`Exception: Python in worker has different version 2.7 than that in dri…
-
I have a question: why does not the same function work?
dplyr::ntile
```
function (x, n)
{
len % mutate_at(vars, funs(ntile(., n = 5)))`
**doens't work**
`df %>% select(vars) %>% m…
-
https://github.com/michaeloc/its_research/blob/1aa8a1ce3d24e800b6419e2b00f967a4cb2331ba/building_trajectories/sentences.py#L81-L82
Pensei em iterar sobre o conjunto de trajetórias candidatas e a pa…
-
I did some test consume kafka message, write to iceberg table by Spark structed streaming. I'm having some trouble.
1.My environment
```
Spark version:3.0.0
Iceberg version:0.9.0
```
2.Creat…
-
Hello! I want to know how to implement lag in spark streaming using sparklyr
---
your brief description of the problem
```r
spark_version %
mutate(r= lag(timestamp))
```
I get erro…
-
I have services using Firehose to create parquet files of application data, in which one of the columns is a json document that is the record from MongoDB. I check the data frame prior to using wr.s3…
-
Hello Community :wave:
This topic was actually started by @maulikjs [here](https://github.com/thanos-io/thanos/issues/2251#issuecomment-635488249) but I would love to reshape the topic to the main,…