-
Today, ROPE in decode mode is implemented as a matmul, where rot_mat is precomputed on host based on the sin/cos for each user's position_id.
What we want is for the `sin,cos: [max_seq_len=128k, he…
-
Doing a full batch run in debug mode can take up to 20 minutes. This is especially cumbersome for maintainers, as doing a batch run is now a requirement when doing changes to the framework. One culpri…
-
I was using was load image batch on incremental with no issues for quite some time to load a folder of inpainting candidates in order to create multiple variants. It was very effective however somethi…
-
Hello everyone,
When generating a bitstream for my CNN model I seem to run into the problem of no model being generated using vitis AI 3.5, however I do not seem to get any error message. Could any…
-
Thanks for this awesome tool.
### Describe the question.
For my experiments, I need to modify a piece of open-source code. Due to the time-consuming nature of the transformation and the large size of…
-
After _n_ edits of a master zone, there will be in the worst case _n_ NOTIFY+AXFR actions, which i think is bad (or not really necessary), especially for large zones.
I was just wondering if there is…
-
To compile `aten._native_batch_norm_legit*`, we need a batch norm op in `tt-metal`.
1. Training mode requires native support (to update parameters on the fly)
2. Probable performance gain in infer…
-
Is the intent of "Report Partition" to ensure that reports are only aggregated together if they share the same label? If so, you may want to define a [batch mode](https://ietf-wg-ppm.github.io/draft-i…
-
### Component(s)
receiver/kafka
### What happened?
We wanted to collect, graph, and alert on lag for the kafka receiver, but observed unexpected behavior when observing the `otelcol_kafka_rec…
-
Hi,
I tested the speed and match results using batch mode and non-batch mode respectively and found: although batch mode is faster, its accuracy is worse than non-batch mode. I have checked the ma…