-
Some sigproc generators will reset their state when the incoming sample rate or array shape have changed, because the state (e.g. filter coefficients) depend on these variables.
AxisArray now has `…
-
I want to know whether we can use pretrained model of language other than english
in different Aligner with same Kaldi toolkit uses ANN as Acoustic model?
-
本人刚接触RT-Thread,最近有个项目需要控制一台变频器,通信方式是CANOpen CIA402的方式。
硬件是正点原子的 阿波罗stm32F767开发版
1,首先我在ENV里只激活了CAN 并且用周立功的CAN测试仪器 测试了can的通讯口,可以正常收发。
2,然后再在ENV里激活了CANFentival的组件
想测试一下系统自带的例子CIA402。
3,程序编译通过 下载…
-
Hello InternVideo team,
You guys have done a great job with this project!
In your paper, you use the Stage 2 model for the task of temporal grounding on QVHighlight [Lei et al., 2021] and Charad…
-
The tutorial mentioned for feature extraction.
Are these the learned representations of AV-HuBERT or just extracting the features from input video file which needs to be passed to the AV HuBERT model…
-
i tried to reproduce the results in librispeech, and using train_am_tds_ctc.cfg:
--runname=am_tds_ctc_librispeech
--rundir=/root/wav2letter.debug/recipes/models/sota/2019/librispeech/
--archdir=/ro…
-
Hi,
I am trying to replicate the best results mentioned in the paper with unfrozen whisper features, mfcc and mesonet. EER that I get for model from stage 1 (encoder frozen) is close to that frozen c…
-
I am training ConvTasNet on Librimix train-100 dataset. It works fine when I train it using sep_noisy mode, while it prompts such an error when I train it using enh_single mode:
```
Results from the…
-
Current libraries specified in requirement files give ResolutionImpossible errors. Please provide all version of libraries to import and python on which this project will work. Also please provide an …
-
Hi,
I'm trying to retrain the model on 16khz data. I've set parameters in `config.ini `as
```
sr = 16000
fft_size = 480
hop_size = 240
nb_erb = 16
nb_df = 48
min_nb_erb_freqs = 1
```
b…