specaugment Search Results

290 results
for specaugment

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

hirofumi0810/neural_sp #228

Can't replicate CSJ's LAS results with the default setup.

Hi @hirofumi0810 , I'm trying to replicate the results of the LAS model that you shared in this [table](https://github.com/hirofumi0810/neural_sp#csj-wer). I'm using the default [run.sh](https:…

lijianhackthon updated 3 years ago
10
mlfoundations/open_clip #384

MuLaN

The new [MusicLM](https://arxiv.org/abs/2301.11325) relies on an audio CLIP named [MuLaN](https://arxiv.org/abs/2208.12415) I will build out an initial implementation [here](https://github.com/luci…

lucidrains updated 1 year ago
11
trtd56/BirdCLEF #36

teyoのマージ前にやってきたこと

## リソース + kaggleのGPU/TPU + colabpro + ローカルマシン(GTX1070) + 基本これでモデルを作ってる　15h/1fold ## 方針全データ使用、CNNモデルでclassificationのアプローチ ## 0.62でやってたこと [Code](https://www.kaggle.com/teyosan1229/b…

teyosan updated 3 years ago
3
lhotse-speech/lhotse #1086

Tutorial review: Using Lhotse with PyTorch Lightning

Hi all, Following up on the tutorials thread [here](https://github.com/lhotse-speech/lhotse/issues/618#issuecomment-1564641754), I've written a first draft of a tutorial for using Lhotse with PyTor…

fauxneticien updated 1 year ago
9
TensorSpeech/TensorFlowASR #198

Preprocess Dataset

Hi, @usimarit In 'datasets/asr_dataset.py' line 141 u called line 41 of 'augmentations/augmentation.py', which is calling self.signal_augmentations = self.parse(config.pop("signal_augment", {})) …

atanumandal0491 updated 3 years ago
4
galv/lingvo-copy #13

Gender and Accent Recognition Brainstorm

After a performant forced alignment pipeline is done, my next thought goes to how to add gender and accent recognition. First of all, I will assume that each segment output by the forced aligner co…

galv updated 3 years ago
1
tensorflow/tensor2tensor #1121

[Question] ASR Transformer performance vs. Google Speech-to-…

### Description We used the ["ASR with Transformer" colab notebook](https://colab.research.google.com/github/tensorflow/tensor2tensor/blob/master/tensor2tensor/notebooks/asr_transformer.ipynb) which …

mabergerx updated 4 years ago
4
ictnlp/StreamSpeech #13

Trained model can generate correct text but incorrect speech

I tried to reproduce the training of the fr-en simultaneous model. I follows the instruction to prepare the dataset and run the script train.simul-s2st.sh The model training seems to go fine but the …

chentuochao updated 1 month ago
13
kkoutini/PaSST #49

Where is input normalization applied?

Hi Khaled, Could you please point me to where normalization is applied to inputs? (for the esc50 case or any other cases) I am talking about channels mean and std such as written in the code bel…

Antoine101 updated 6 months ago
4
lhotse-speech/lhotse #548

Some problems when loading the TedLium3 dataset for transduc…

Currently, I am trying to build a transducer-stateless recipe based on Tedlium3 for icefall. This is the PR. (https://github.com/k2-fsa/icefall/pull/183). This PR shows the concrete codes for processi…

luomingshuang updated 2 years ago
11

上一页 1...2 3 4 5 6 7 8...29 下一页

290 results for specaugment

290 results
for specaugment