audio-visual-speech-recognition Search Results

452 results
for audio-visual-speech-recognition

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

jwebmeister/tacspeak #23

Model test results - model 20240117

**Post test results + useful remarks here, ideally of both:** - the new model (20240117), and - the base model (kaldi_model_daanzu_20211030-mediumlm) , using the same test data, and using the d…

jwebmeister updated 8 months ago
55
immersive-web/webxr #815

Spec language precludes non-visual uses

There are XR use cases (e.g., "audio AR") that could build on poses and other capabilities exposed by core WebXR (and future extensions). The current spec language, though, appears to require visual d…

ddorwin updated 3 years ago
41
Thinking-with-Deep-Learning-Spring-2024/Readings-Responses #14

Week 7. May. 3: Sound & Image Learning - Possibilities

Pose a question about one of the following articles: [“Machine Learning as a Tool for Hypothesis Generation”](https://doi.org/10.1093/qje/qjad055), Jens Ludwig, Sendhil Mullainathan. The Quarterly…

JunsolKim updated 4 months ago
20
Thinking-with-Deep-Learning-Spring-2024/Readings-Responses #16

Week 8. May. 10: Multi-Modal Learning - Possibilities

Pose a question about one of the following articles: “[Online images amplify gender bias](https://www.nature.com/articles/s41586-024-07068-x),” 2024. Guilbeault, Douglas, Solène Delecourt, Tasker …

JunsolKim updated 4 months ago
23
omlins/JustSayIt.jl #24

Add possibility to filter background noise from input audio

omlins updated 9 months ago
2
facebookresearch/av_hubert #63

Finetuning Models for Visual Speech Recognition

Hello, I was trying to load a finetuned model for the VSR task. I followed the indications on the repository and the jupyter notebook (below you can see that I tried to import modules from the avhu…

david-gimeno updated 3 months ago
9
Vocaluxe/Vocaluxe #479

Adding Rap Notes

Vocaluxe doesn't support Rap Notes at all, which USDX does since it's last release in 2017. Even worse, if a song contains a rap note, Vocaluxe ignores the whole song. From my understanding, rap note…

JanK118 updated 1 month ago
26
grpc/grpc #37470

Memory leak in v1.65.x for C++

There appears to be a memory leak introduced in v1.65.x with C++ and using Google Speech To Text v1p1beta1 and v2 StreamingRecognize. The same code does not leak with v1.64.1 and v1.64.3 (and none not…

mark-dorrell updated 1 month ago
2
jungwoo-ha/WeeklyArxivTalk #73

[20230226] Weekly AI ArXiv 만담 시즌2 - 7회차

### News - 정부부처/지자체 모두 ChatGPT & 초거대AI 열공 모드: 앞으로 더많이 할 듯.. - [교육부](https://n.news.naver.com/mnews/article/079/0003736942?sid=102), [과기정통부](https://n.news.naver.com/mnews/article/421/0006645964?si…

jungwoo-ha updated 1 year ago
4
marl/group_meetings #2

Meeting topic suggestions

Some ideas floated in our meeting today: - Reading on speech models - Reading on hip-hop, lyrics, and language - Reading on attention mechanisms in deep learning - Workshopping figures or code i…

bmcfee updated 6 years ago
17

上一页 1...3 4 5 6 7 8 9...46 下一页

452 results for audio-visual-speech-recognition

452 results
for audio-visual-speech-recognition