-
Next Demo Day: December 5th
---
See what the Dask community has been up to, or share some Dask work of your own. Demos are short and informal (~5-10 minutes). Have something you'd like to share? L…
-
Use profiling tools to remove bundle processing overhead in the SDK harness.
Imported from Jira [BEAM-13015](https://issues.apache.org/jira/browse/BEAM-13015). Original Jira may contain additional co…
-
## Description
In preparations for Notifications Pipeline execution we need to finalize our plan for execution. This is necessarily both comprehensive and immediately actionable, laying out the ove…
-
### System information
- **Have I written custom code (as opposed to using a stock example script
provided in TensorFlow Model Analysis)**: yes
- **OS Platform and Distribution (e.g.,…
-
This is a reiteration of [closed issue](https://github.com/tensorflow/data-validation/issues/234)
to my understanding this has been already fixed for m1/m2 chips in [#251](https://github.com/tensor…
-
Currently, I/O is strewn around this code, base and we keep reinventing the wheel as a result. We are using many different ways of interacting with the filesystem and GCS. These include
- gcsfs/fss…
-
In BEAM-13203 `SnsIO.writeAsync` was deprecated to address the risk of data loss in that transform. Unfortunately, it has considerable issues in design and implementation with respect to the beam pro…
-
Now that we've established that there is no bug in the SparkRunner over HDFS as described [here](https://github.com/eljefe6a/beamexample#running-example-code-in-spark).
I was wondering if you have pla…
-
Hello, for testing reasons we wanted to see if approximate vocabulary was faster than vocabulary when there are many features (we have 36 features to analyze). In the past we hit the graph too large e…
-
To record Elliotte's idea:
> We have two different special cases here, that use different algorithms, but the same approach should work in both. I wonder if we should have a more general solution t…