sequence-modeling Search Results

1000+ results
for sequence-modeling

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

state-spaces/mamba #485

Is mamba suitable for time-series classification task?

I am working on a time-series classification task.Does mamba work for my task?

wuzzeze updated 1 month ago
1
huggingface/transformers #33095

llama3 position_ids error with left padding

### Feature request The LLaMA 3 implementation should generate default `position_ids` that take the `attention_mask` into account. @ArthurZucker @younesbelkada ### Motivation Is there a s…

ummagumm-a updated 1 week ago
2
arXivTimes/arXivTimes #2063

Decision Transformer: Reinforcement Learning via Sequence Mo…

## 一言でいうと Transformerを強化学習に応用した研究。State/Action/Rewardの系列を入力して次の行動を予測させる。収録済みの軌跡から学習するオフライン強化学習で、既存の手法を上回る精度(オンラインの強化学習ではまだ検証されていない)。 ![image](https://user-images.githubusercontent.com/544269/12…

icoxfog417 updated 3 years ago
1
hiyouga/LLaMA-Factory #4963

量化会卡住，Issues里很多人遇到了同样的问题，但都没有解决方案

### Reminder - [X] I have read the README and searched the existing issues. ### System Info - `llamafactory` version: 0.8.3.dev0 - Platform: Linux-5.4.54-1.0.0.std7c.el7.2.x86_64-x86_64-with-glibc…

ConniePK updated 1 week ago
2
huggingface/transformers #32944

clarify the label shifting behavior of llama models when `la…

### Feature request i believe `labels` in the training of causal LMs means the value to predict at time `n`, i.e., the next token. in other words, i'd assume, if `labels` is given, it should be alrea…

keunwoochoi updated 1 week ago
1
k2-fsa/sherpa-onnx #981

Hotwords encoding for phonemes

Hi. I have a phoneme-based Zipformer model. Before this [PR](https://github.com/k2-fsa/sherpa-onnx/pull/828), I was able to apply hotwords encoding for phoneme sequences, e.g. `ɪ z/dʒ ʌ s t/b ɛ s t…

w11wo updated 2 months ago
8
IST-DASLab/sparsegpt #35

AttributeError: 'NoneType' object has no attribute 'shape'

``` (textgen) [root@pve-m7330 sparsegpt]# python llama.py ../text-generation-webui/models/TinyLlama-1.1B-Chat-v1.0/ wikitext2 --nsamples 10 Token indices sequence length is longer than the specified…

thistleknot updated 2 weeks ago
8
Oufattole/meds-torch #30

Autoregressive Modeling

Generative modeling -- triplet works https://github.com/mmcdermott/EventStreamGPT/blob/main/EventStream/transformer/generation/generation_utils.py#L73 Simplify this code^ With triplet code - you …

Oufattole updated 4 weeks ago
8
HERA-Team/uvtools #53

try modeling foregrounds with Discrete Prolate Spheroidal Se…

Apparently the eigenspectrum of the sinc matrix (with delay width \tau_w), a regularized version of which is being used in the linear filter optimizes the “centralization problem". The eigenvectors of…

aewallwi updated 4 years ago
1
KittyCAD/modeling-app #3721

Show the Device Activation code when users log in

When users log into the modeling app, they click on their OAuth provider (e.g. Google) and then sign in, and then they see this. Users have to ignore the big code, and instead press the little …

adamchalmers updated 1 week ago
6

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for sequence-modeling

1000+ results
for sequence-modeling