zero-overhead Search Results

1000+ results
for zero-overhead

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

PennyLaneAI/pennylane #4635

[BUG] Performance digression of decomposition-based differen…

### Feature details In #4585, the decomposition of `TmpPauliRot`, a helper object of `SpecialUnitary`, was changed in order to allow the new `DefaultQubit` device to differentiate `SpecialUnitary`. …

dwierichs updated 1 month ago
3
bytedance/monoio #296

suggestion: allow to register hook before executor goes park

**Is your feature request related to a problem? Please describe.** I have a special use case for my buffer pool, ```rust struct BufferPool { keep_capacity: usize, buffers: UnsafeCell, }…

Wireless4024 updated 2 weeks ago
3
HazyResearch/deepdive #455

Filter out fixed-zero-weight factors in grounding

In a common use case: - train the model on a small sample of data, - save the weight table, - and then reuse the learned weights for inference on a larger dataset. When doing the inference, many f…

zifeishan updated 8 years ago
1
Hopsan/hopsan #481

Abort or warn if divide by zero

--- Author Name: **Peter Nordin** (@peterNordin) Original Redmine Issue: 481, https://flumes.iei.liu.se/redmine/issues/481 Original Date: 2012/03/20 --- Abort or warn if divide by zero Right n…

hopsan-bot updated 6 years ago
1
maedoc/libtvb #89

Parallel eval hist get & scheme

The latter can run in parallel with dfun eval, something like ``` c #pragma omp parallel sections { #pragma omp section { (*hist->get_next_non_zero)(hist, c); } #pragma omp s…

maedoc updated 8 years ago
1
mlb2251/lambdas #5

add type inference for TypeRefs

right now `infer()` only works with `Type`s not `TypeRef`s. The only challenge here is in the App case when you construct `t1 -> t2` then unify that with something else. The issue is t1 and t2 migh…

mlb2251 updated 1 year ago
2
fifthweek/fifthweek-web #37

How do we split-test features?

We need to setup a system that allows painless split-testing. We must not be inhibited to split-test features, no matter how trivial. Therefore the system must come at next-to-zero added overhead to …

ljwagerfield updated 9 years ago
1
rosikand/torchplate #11

Interface with Hugging Face Accelerate for distributed train…

Create a new `distributed_train` function in `torchplate.experiment.Experiment` which interfaces with [Hugging Face Accelerate](https://huggingface.co/docs/accelerate/quicktour) for zero-overhead dist…

rosikand updated 1 year ago
2
StanfordLegion/legion #1732

Realm GPU Profiling Is Not Precise

Realm provides [a profiling response for measuring the upper and lower bound on when kernels launched by a GPU task are executed](https://gitlab.com/StanfordLegion/legion/-/blob/master/runtime/realm/p…

lightsighter updated 3 weeks ago
10
tenstorrent/tt-metal #12282

Optimize Dispatch Time for Mixtral Decode Ops

Dispatch time can be a limiting factor for perf, especially for decode, where op device latency are low. We show a traced Mixtral decode perf below: If dispatch went to 0, perf boost would be: 1.…

sraizada-tt updated 4 weeks ago
8

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for zero-overhead

1000+ results
for zero-overhead