TensorSpeech / TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
https://huylenguyen.com/asr
Apache License 2.0
913 stars 242 forks source link

Preprocess Dataset #198

Open atanumandal0491 opened 3 years ago

atanumandal0491 commented 3 years ago

Hi, @usimarit In 'datasets/asr_dataset.py' line 141 u called line 41 of 'augmentations/augmentation.py', which is calling self.signal_augmentations = self.parse(config.pop("signal_augment", {})) self.feature_augmentations = self.parse(config.pop("feature_augment", {})) In config file no dict of signal_augment is present, whereas feature_augment is showing None, results in outputs as None in sub-class '_augment'.

Can you guide me?

nglehuy commented 3 years ago

@atanumandal0491 Since version >1.0, I removed the predefined signal augmentations (I prefer augmentations implemented in tensorflow rather than nlpaug - to avoid the use of tf.numpy_function for TPU compatibility). If you want to use signal augmentations, you'll have to code them yourself. And the feature augmentations methods currently "predefined" are time masking and frequency masking in specaugment.

Showing the error messages is better for me to know what is happening in your case.

atanumandal0491 commented 3 years ago

Hi, I was able to solve the problem. But yet on training using RnntLoss, I am getting "inf" as training loss. Unable to find the solution.

nglehuy commented 3 years ago

@atanumandal0491 Please tell me more about what rnnt loss you are using (warp rnnt, tf rnnt)? What language you are training? What type of text decoding you are using (characters, subwords, sentence piece)? What version you are using?

atanumandal0491 commented 3 years ago

Hi, I did a mail to you.