-
Dear authors. thanks for your great job and contribution to the research community.
In my use case, I need use video-salmonn model for reasoning on audio file only. I know I can use original SALMO…
-
In [this project](https://github.com/emarkou/Audio-auto-tagging), they provide an example about how to feed mel spectrograms to a DNN. However, we need to deal with the fact that our inputs can have d…
-
In this issue you can either:
- Add papers that you think are interesting to read and discuss (please stick to the format).
- **Vote**: should be done using 👍 on comments
Example: [Voting Paper…
-
-
- [ ] 是否有人在QA上做过训练阶段的攻击
- [ ] VQA上的攻击主要是在做什么?
- [ ] NLP大方向上别人是怎么攻击的?
思考:
- 为什么通过注入极少量(50-200左右)有毒数据,trigger + fake answer,最后模型就会一遇到关键词就给fake answer,为什么会work, 背后的机理是过拟合嘛?和meta-learning相关嘛?
- 后门攻…
-
To capture audio during oral reading we need to record it in RoboTutor itself rather than in AZ.
1. Provide an easy way to enable or disable recording by a given .apk -- imaybe in an .ini file in t…
-
please share Keyword Search example with same `aar` lib ?
-
`torchaudio` is an extension library for PyTorch, designed to facilitate audio processing using the same PyTorch paradigms familiar to users of its tensor library. It provides powerful tools for audio…
-
I'm working on command detection. My vocabulary is bunch of names (~ 10000) and bunch of commands (~20), Sample command is CALL . Before training, I'll have the knowledge of both the names and the c…
-
When loading the model only once and running decoding on same file with each time new recognizer, results are different.
**Setup**
* `vosk==0.3.32`
* `vosk-model-en-us-0.22`
**Test**
Run deco…