associative-scan-rnn Search Results

Lightning-AI/lightning-thunder #1134

Reentrant JIT for higher order operators

## 🚀 Feature Add support for PyTorch Callable -> Thunder Callable translation in Thunder JIT. ### Motivation Several PyTorch operators accept a Python function with PyTorch operations inside …

IvanYashchuk updated 4 weeks ago

pytorch/pytorch #50688

[feature request] `torch.scan` (also port `lax.fori_loop` / …

Exists in TF: https://www.tensorflow.org/api_docs/python/tf/scan (or some variant inspired by scan), JAX/LAX: https://jax.readthedocs.io/en/latest/_autosummary/jax.lax.scan.html, (old theano: https://…

vadimkantorov updated 1 month ago

pytorch/pytorch #95408

Parallel Associative Scan

### 🚀 The feature, motivation and pitch It would be great to have a general parallel prefix sum (associative scan) operation in PyTorch, something like [associative_scan](https://jax.readthedocs.io…

abdulfatir updated 1 month ago

pytorch/pytorch #105279

[Dynamo][Compile]Torch compile with dynamic shapes not worki…

### 🐛 Describe the bug My networks rely on varying shapes during training as well as during inference. Thus, I tried to use `torch.compile(... dynamic=True)` as well as the `torch._dynamo.optimize(..…

bohnstingl updated 7 months ago

pytorch/rl #2325

[Discussion] Remember TorchRL: the state of memory in TorchR…

# Remember TorchRL: the state of memory in TorchRL Hello! This is a discussion post to recap the state of memory models in TorchRL: what's doable, what's not doable, what is the way to do things, a…

matteobettini updated 2 months ago

lucidrains/minGRU-pytorch #8

Does this implementation handle the vanishing gradient probl…

We all know RNNs have this problem. While the paper "Were RNNs All We Needed?" focuses on parallelism, does it also lay down any changes to handle vanishing gradients? Just curious.

ParikhKadam updated 2 weeks ago

glassroom/heinsen_sequence #1

Comparison to existing algorithm

Hello, thanks for your work. I wonder what is the difference between the proposed algorithm and sect.1.4.1 in https://www.cs.cmu.edu/~guyb/papers/Ble93.pdf.

sustcsonglin updated 1 week ago

ivy-llc/ivy #27501

Failing Tests - Ivy Functional API

# Failing Tests > Please see the failing tests divided into sections below. Click on each section to expand. Feel free to get assigned to an issue by following the instructions [here](https://unify.ai…

ivy-leaves updated 2 months ago

i404788/s5-pytorch #3

How to carry state in apply_ssm?

Dear, how to use prev_state in apply_ssm function since I see it is now purely forward? I would ideally want to: x, states = s5(x, states), where apply_ssm carries state such that I can train with m…

looper99 updated 5 months ago

radarFudan/mamba-minimal-jax #1

parallel scan

Hi! Just a quick question. Isn't it possible to use a parallel scan already existing in jax? https://jax.readthedocs.io/en/latest/_autosummary/jax.lax.associative_scan.html (with binary operator from …

Howuhh updated 9 months ago

14 results for associative-scan-rnn

14 results
for associative-scan-rnn