micro-batches Search Results

1000+ results
for micro-batches

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

danthegoodman1/UltraQueue #8

ScyllaDB Cluster TaskDB

Table for locks, LWT to acquire lock, heartbeat lock, table is taskdb, option to create new taskdb once end of list reached or just keep looping Configurable consistency level on write, read, and a…

danthegoodman1 updated 2 years ago
2
elastic/elasticsearch-hadoop #1422

Support Continuous Processing Mode in Spark Streaming

By default, Spark's streaming capabilities follow a "micro-batching" model, where data is collected into a batch for a window of time. At the end of that window, a batch job is launched on the cluster…

jbaiera updated 1 year ago
2
OpenRLHF/OpenRLHF #188

About using vLLM for generation

I have some thoughts about using vLLM for generation. Feel free to correct me if I were wrong. 1. Batching It seems that prompts are still passing to vllm engines in micro rollout batches during `ma…

LSC527 updated 9 months ago
5
BlazingDB/blazingsql #108

Window Function

**Why we need window functions?** When working with big data window functions help to slice the things out like removing the duplicated with rank/row number/dense rank without theses inbuilt function…

rsdpyenugula updated 3 years ago
6
confluentinc/parallel-consumer #560

Feature suggestion: Minimum batch size + batch max wait time

Hi Team, Does it make sense to support `minimumBatchSize` with `maxTimeToWait` in Parallel consumer? The idea is that I want to use the PC as a micro batch consumer, but since my consumers are faster …

Ehud-Lev-Forter updated 1 year ago
6
YotpoLtd/metorikku #216

how to execute metorikku in a loop from a single spark-submi…

use-case: near-real time sync from MSSQL db to s3. I know I can do a while loop calling spark-submit each time but this will be slow as JVM needs to startup each time. Is there a way to make metorikk…

tooptoop4 updated 5 years ago
1
pytorch/PiPPy #796

Could pippy be coexisted with deepspeed?

Hi, I want to know whether I could use pippy's pp capability with deepspeed's zero3 config? So that it together lead to 3d parallism? Thx

leiwen83 updated 1 year ago
1
huggingface/nanotron #233

Learning rate restart broken with Nanoset?

Retraining on checkpoint works perfectly with the tokenization on the fly, but breaks while using nanoset: training restart with a different lr, which is not the same as lr_schedule.pt We also have…

Pclanglais updated 1 month ago
5
huggingface/nanotron #209

multi-node pp hang when enable gradient accumulation

I tested llama3 continue training with multi-machine tp4 pp2 dp2. If I enabled grad accum operation, the training would hang. The experimental environment is: 16H800 torch 2.1.2+cu121. checkpoints:…

yuuxiaooqingg updated 2 months ago
4
streamnative/pulsar-spark #117

[FEATURE] maxTriggerDelay feature in Pulsar-Spark Connector

**Is your feature request related to a problem? Please describe.** I am consuming data from Pulsar through Spark Structure Streaming in micro-batches. Right now, what happens is that spark consume…

keenborder786 updated 1 year ago
6

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for micro-batches

1000+ results
for micro-batches