issues
search
atosystem
/
SpeechCLIP
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022
https://atosystem.github.io/blogs/speechclip
BSD 3-Clause "New" or "Revised" License
108
stars
6
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Encountered a size mismatch problem while attempting to load a large model.
#9
marcos452
closed
3 months ago
1
derive embeddings: cascasded models
#8
lokesh12345678910
closed
3 months ago
4
Can derive embeddings with base but not large
#7
lokesh12345678910
closed
4 months ago
2
Training on Flickr Dataset Unexpectedly Hangs
#6
mhamzaerol
opened
6 months ago
4
about training codes
#5
Benjizhang
closed
11 months ago
1
about speech-text implement
#4
xiaoyaoxiaoxian
closed
7 months ago
1
ImportError: cannot import name 'LightningLoggerBase' from 'pytorch_lightning.loggers'
#3
seongq
closed
1 year ago
3
Simple Embeddings
#2
corranmac
closed
1 year ago
4
Dataset source?
#1
FlyToYourMooN
closed
1 year ago
2