spark-dataframes Search Results

1000+ results
for spark-dataframes

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

great-expectations/great_expectations #3766

Ability to apply a row_condition to PySpark DataFrame GE che…

I am applying around 400 dq checks to a table with 30M rows of data and 250 columns, and around 25% of these checks only apply to a subset of rows. There is too much data to use Pandas Dataframes. I…

philgeorge999 updated 2 years ago
2
RumbleDB/rumble #776

Add support for writing in different output formats

The possibility to write the query result into a file (like through `--output-path`) should be extended to other formats such as compressed JSON, Parquet, CSV, etc. This would extend Rumble to a co…

ingomueller-net updated 2 years ago
2
CSS-Electronics/mdf4-converters #8

Add Parquet Conversion

Could you add support to convert `mdf4` to `parquet`? as it is really important for people working with Spark. I know `asammdf` has the ability to convert to `parquet` but sometimes it doesn't wor…

Saldef updated 2 years ago
3
aws/sagemaker-spark #84

SageMakerProtobufFileFormat could not be instantiated

Please fill out the form below. ### System Information - **Spark or PySpark**: Pyspark - **SDK Version**: sagemaker-pyspark==1.2.2.post0 - **Spark Version**: 2.4.2 - **Algorithm (e.g. KMeans)**…

filipe-plutoflume updated 2 years ago
12
fugue-project/tutorials #170

Recipe for loading multiple files

Hey @kvnkho I'm back using `fugue` again. I was wondering what the canonical `fugue` method is for loading multiple csvs into `fugue`? I can write this up as a recipe after if you like I have…

rdmolony updated 1 year ago
6
feast-dev/feast #2079

Supporting remote and local data source provider (e.g.: Loca…

**Is your feature request related to a problem? Please describe.** We are trying to consume two different sources 1 remote e.g.: BigQuery and another one local Parquet file, which is currently not po…

ylokhande82 updated 2 years ago
2
bitlap/ScalaCookbook #56

一二级标题

# draft 1 ~~1. Command-Line Tasks - 命令行任务 1.1 Getting Started with the Scala REPL - 开始使用Scala REPL 1.2 Loading Source Code and JAR Files into the REPL - 将源代码和JAR文件加载进REPL 1.3 Getting Started wit…

jxnu-liguobin updated 2 years ago
11
spidru/JGribX #16

Build's tests failed and code does not work, both due to NoV…

Hi. I downloaded the actual source code (commit [0e5b8c3](https://github.com/spidru/JGribX/commit/0e5b8c3e2d1b52cb9578bda811ac30b6ad2ab15e) and try to build it with gradle on Intellij. When doing i…

Timelessprod updated 2 years ago
16
NVIDIA/spark-rapids #5234

[QST] Spark-SQL support is available ?

Most of the articles and blog always talk about SQL via dataframes approach, so just wondering **Can I run SQL queries via spark-rapids[Spark-SQL shell] ?**

Manoj-red-hat updated 2 years ago
4
OpenLineage/OpenLineage #610

[INTEGRATION][SPARK] `InMemoryRelation` not supported as in/…

Let's add support for [`InMemoryRelation`](https://spark.apache.org/docs/1.3.1/api/java/org/apache/spark/sql/columnar/InMemoryRelation.html) (see also InMemoryRelation [internals](https://jaceklaskows…

wslulciuc updated 2 years ago
5

上一页 1...70 71 72 73 74 75 76...100 下一页

1000+ results for spark-dataframes

1000+ results
for spark-dataframes