-
### Feature Description
It would be awesome if we had a similar mechanism to message annotations but specifically for arbitrary data such as the model used to generate a response, the name/icon/id …
-
For background, see https://github.com/jaegertracing/jaeger/issues/5910
Jaeger `all-in-one` typically runs with `in-memory` or `badger` storage that both have a special implementation of Dependencies…
-
### What problem does your feature solve?
Consider validating network settings against the metadata of the objects fetched from the datastore to detect mismatches.
While working on #4911, I…
-
Because DistDGL and by extension GraphStorm has an assumption of a shared filesystem to function properly, in our SageMaker implementations need to implement various downloads and uploads to "fake" th…
-
Description: Develop a real-time streaming word count application leveraging Apache Spark Streaming's DStream API. Utilize Python to ingest streaming text data from a chosen source, tokenize the wor…
-
import json
from pymongo import MongoClient
# Step 1: Read the text file
def read_file(file_path):
with open(file_path, 'r') as file:
content = file.read()
return content
# …
-
Hi all :wave:
I'm using Spark Structured Streaming to stream from one Hudi table to another Hudi table.
I noticed that when stream started for the first time, each batch was relatively fast, with an…
-
Unit currently buffers the entire client request before sending it to an upstream target. This target could be an application or an upstream HTTP service. Either way, Unit will not start transferring …
-
## 🐛 Bug
When optimizing a dataset, [`BinaryWriter.serialize()`](https://github.com/Lightning-AI/litdata/blob/main/src/litdata/streaming/writer.py#L167) will first flatten the sample dictionary and…
-
Hello
when I try to read the data with the 2stage_reader and MPI I get the following error:
`ValueError: could not broadcast input array from shape (79,319140) into shape (80,319140)`
Any ide…