samelamin / spark-bigquery

Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
Apache License 2.0
70 stars 28 forks source link

Feature Request: DML Operations - Update, Delete Support #37

Closed kurtmaile closed 7 years ago

kurtmaile commented 7 years ago

Hi Sam,

Hope you are havingt a great holiday! Thought Id add this to track as we discussed and you thught simple enough to do.

As discussed would be great to be able to invoke these adhoc DML commands when required. The use-case can include things like hygene / cleaning up data controlled via a periodic spark jobs (e.g periodic removal of any duplicates, updating a corrupted column (e.g bq_load_timestamp from a previous bug etc). The

They dont return a DataFrame in the classical sense, however these are more for jobs that help to maintain data quality.

Cheers mate!

samelamin commented 7 years ago

This has been fixed on v0.2.2