arithmetic-computation Search Results

1000+ results
for arithmetic-computation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

arcee-ai/mergekit #328

`extract_lora.py` can't handle mismatched `lm_head` tensor d…

I tried to extract a LoRA from `Xwin-LM/Xwin-Math-70B-V1.1` and got this: ``` delta_weight = new_weight - base_weight ~~~~~~~~~~~^~~~~~~~~~~~~ RuntimeError: The size of tensor…

jukofyork updated 5 months ago
10
ExponentialDeepSpace/exponentialdeepspace.github.io #17

Intelligence systems

NirViaje updated 3 years ago
20
breandan/kotlingrad #11

Scala DSL

##### Purpose Introducing a secondary tensor operation DSL (Domain Specific Language) written in & optimised for Scala language & various compilers (the most common of which are JVM based scalac 2.…

tribbloid updated 3 years ago
23
NVIDIA/Megatron-LM #396

fp8 transformer engine only brings 35% speed up?

Hi there, I've used Megatron to train 13B gpt model on a H100 machine. Before I use fp8 transformer engine, the speed of the training is about 0.34s/step. After I enabled the fp8 transformer engi…

FeixLiu updated 3 weeks ago
4
pytorch/pytorch #60277

Sparse CSR tensor should not accept equal column indices in …

## 🐛 Bug Currently, one can construct a CSR tensor that has equal column indices in the same row. In principle, this corresponds to "uncoalesced CSR tensor" that we are not supposed to have. In…

pearu updated 3 years ago
2
pytorch/pytorch #27542

Make topk sort stable

## 🐛 Bug torch.topk with sorted=True doesn't return a result that is consistent across different values of k when dealing with duplicates values. The position of duplicated values in the returned s…

volcacius updated 4 years ago
18
rleonid/oml #142

weights in multinomial

a nitpick: the weights vector is summed up and tested for summing to 1. if you're gonna sum it up anyway, why not allow arbitrary positive weights and normalize the weight vector? oml/src/lib/stats/…

nilsbecker updated 8 years ago
8
sympy/sympy #5837

Make a distinction between operations and their result

Supporting unevaluated operations like Mul(3, 4, evaluate=False) occasions a lot of headaches (for instance issue #5783 ). I think that the root cause of this is that we try to represent 2 very differ…

rlamy updated 2 years ago
15
gazebosim/sdformat #95

allow multiple <inertial> blocks in a single Link element

**Original report ([archived issue](https://osrf-migration.github.io/sdformat-gh-pages/#!/osrf/sdformat/issues/95)) by John Hsu (Bitbucket: [hsu](https://bitbucket.org/%7B0a186eae-abf0-4514-a951-23db5…

osrf-migration updated 8 years ago
16
bigcode-project/bigcode-inference-benchmark #3

Improve inference speed of multi-query attention model

[The multi-query attention paper](https://arxiv.org/pdf/1911.02150.pdf) reports up to 10x speed-ups compared to incremental decoding with multi-head attention model. We've implemented multi-query atte…

harm-devries updated 2 years ago
2

上一页 1...92 93 94 95 96 97 98...100 下一页

1000+ results for arithmetic-computation

1000+ results
for arithmetic-computation