Open timolohrenz opened 6 years ago
What you are hitting is a validation check, which checks for a single sequence per sample. In this case we are getting 266 sequences per sample and therefore the error. Can you make the labels file one line per label?
Hmm, sorry for the bad formatting of my issue. Actually my labels are one label per line as I have corrected now in my former post. Both, label and feature frames have exactly the same lengths.
As I initially guessed this might be an issue with the HTKfeature serializer, I also converted my features to ctf files with also one feature per line. Nevertheless I still get this error. Only workaround that I found was to skip the sequence IDs for both feature and label files.
Sure, any possibility for me to get a minimum repro? I can debug it to see what is really happening.
Yeah for sure. Thanks in advance for your efforts.
I tested the repro on CNTK binary versions 2.0 and 2.3 both compiled with GPU support. Simply start it with
cntk configFile=timit_dnn_conf.cntk
As you will see my data consists of 1101 frames from 4 sequences (=speech utterances) stored in the _featssmall.ctf. Each speech utterance has its own sequence ID. The labels in _tgtssmall.ctf have a parallel structure with one label per line/frame.
Please note the outcommented skipSequenceIds options in the reader section. When skipping IDs, the error won't occur. However, I think this workaround is quite risky as feats might easily get assigned to wrong labels/sequences.
After all it can be a simple misinterpretation of me of the frameMode option, but maybe you can shed some light on it. It would be of great comfort for me to use the same labels and features for sequence and non-sequence trainings.
As I said, thanks a lot and keep up the great work on this toolkit!
Hi, Thank you for the update. One of my collegue quickly looked at this issue and suggested that we need to set frameMode and truncation outside the reader section.
frameMode = true reader = [ ... ] Could you please give this a try and let me know.
oh, i would have been ashamed if this was the case.
Unfortunately it's not. I tried out the frameMode setting at any possible level and still get the same error. It seems as the reader still treats the input as sequences even though frame Mode is activated.
Hello CNTK team,
for my reader setup I use the 'HTKFeatureDeserializer' for my reading features from my scp-File. The head of the .scp file looks like this:
For the input labels I use the CNTKtextFormatDeserializer with my sparse labels having the following form:
... and so on, with one line per frame. First characters in lines are the corresponding sequence IDs.
For non-recurrent networks I want to run my training in frameMode. For my understanding, all sequences are then split and processed as single frames. However I get the following error message:
Am I missing some configuration or might this be some missing feature in the readers. Thanks in advance for your help and time.
PS: This is what my reader section looks like: