issues
search
HazyResearch
/
H3
Language Modeling with the H3 State Space Model
Apache License 2.0
515
stars
54
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Full version of H3 model?
#29
FeelingFatigued
opened
1 year ago
4
use_fast_fftconv generates error
#28
FeelingFatigued
closed
1 year ago
2
ERROR: CUDA RT call cudaFuncSetAttribute. Failed with invalid device function (98).
#27
wang-zerui
closed
1 year ago
2
Setup for repo unclear
#26
inf3rnus
opened
1 year ago
1
TypeError: forward() got an unexpected keyword argument 'last_token_only'
#25
NewDaddy
opened
1 year ago
2
/bin/bash: line 0: cd: csrc/fused_softmax: No such file or directory
#24
jabowery
opened
1 year ago
0
Question about methodology used for evaluating FlashConv against cuFFT
#23
sylee0124
closed
1 year ago
2
why dividing kv_f by fft_size?
#22
violet-zct
closed
1 year ago
2
Error when using use_fast_fftconv option in generate_text_h3.py
#21
sylee0124
closed
1 year ago
3
Release of pretraining and fine tuning code
#20
ksrinivs64
opened
1 year ago
5
The motivation for not fusioning fff(k) into the kernel
#19
Doraemonzzz
closed
1 year ago
2
Having trouble for compiling fftconv
#18
Doraemonzzz
closed
1 year ago
9
Error Running `generate_text_h3.py` (`CUDA error: CUBLAS_STATUS_NOT_INITIALIZED`)
#17
gdebayan
closed
1 year ago
3
Modifs
#16
flbbb
closed
1 year ago
0
inconsistent output from fftconv_func and native pytorch fft
#15
mojanjp
opened
1 year ago
3
FFT Conv on Seq > 8192?
#14
darius-lam
opened
1 year ago
1
What is fftconv_bwd doing?
#13
darius-lam
closed
1 year ago
2
Releasing Training and Synthetic benchmarks
#12
mojanjp
opened
1 year ago
3
Unpickling errors when loading models
#11
mttk
closed
1 year ago
1
Correct method to load 2.7B?
#10
BlinkDL
opened
1 year ago
4
Update README.md
#9
eltociear
closed
1 year ago
0
Trying to generate something coherent
#8
nikitastaf1996
closed
1 year ago
2
CPU Port
#7
okpatil4u
opened
1 year ago
4
Training code?
#6
NtaylorOX
opened
1 year ago
4
Licensing information
#5
ghost
closed
1 year ago
1
fix MHA API
#4
kashif
closed
1 year ago
1
Error running benchmarks/benchmark_generation.py
#3
BlinkDL
opened
1 year ago
4
ssm utils
#2
bryanhpchiang
opened
1 year ago
2
2.7B Evaluations
#1
sdtblck
opened
1 year ago
2