error_analysis method not working for the discriminative model

snorkel-team / snorkel

A system for quickly generating training data with weak supervision

Apache License 2.0

5.81k stars 857 forks source link

It seems to me the error_analysis method of the Classifier class is not working for the rnn implementation. If I try to run disc_model.error_analysis(sess, L_dev, L_gold_dev), where the arguments are sparse matrices, I get errors (due to a bad implementation of len in sparse matrices ?)

The problems seem to mainly come from the _make_tensor method

EDIT: It seems that in some cases (i.e. on some experiments), at the prediction step, the embedding_lookup call in snorkel/snorkel/learning/disc_models/rnn/rnn_base.py, line 93 fails with

indices[0,2] = -1 is not in [0,5005)

As the training runs properly, the embedding must be properly constructed. However, I do not find any reason why the lookup should yield a -1 value and cannot find any documentation on this.

snorkel-team / snorkel

error_analysis method not working for the discriminative model #833