Exception encountered when calling layer "gru" (type GRU).

iancovert / sage

For calculating global feature importance using Shapley values.

MIT License

255 stars 34 forks source link

Model: Model: "sequential"

Layer (type)	Output Shape	Param #
embedding (Embedding)	(None, 32, 64)	768000
spatial_dropout1d (SpatialD ropout1D)	(None, 32, 64)	0
gru (GRU)	(None, 32, 64)	24960
dropout (Dropout)	(None, 32, 64)	0
gru_1 (GRU)	(None, 64)	24960
dropout_1 (Dropout)	(None, 64)	0
dense (Dense)	(None, 32)	2080
dropout_2 (Dropout)	(None, 32)	0
dense_1 (Dense)	(None, 100)	3300
dense_2 (Dense)	(None, 1)	101

Layer (type)

Output Shape

Param #

embedding (Embedding)

(None, 32, 64)

768000

spatial_dropout1d (SpatialD ropout1D)

(None, 32, 64)

gru (GRU)

(None, 32, 64)

24960

dropout (Dropout)

(None, 32, 64)

gru_1 (GRU)

(None, 64)

24960

dropout_1 (Dropout)

(None, 64)

dense (Dense)

(None, 32)

2080

dropout_2 (Dropout)

(None, 32)

dense_1 (Dense)

(None, 100)

3300

dense_2 (Dense)

(None, 1)

101

Total params: 823,401 Trainable params: 823,401 Non-trainable params: 0

Hi Deepak, it looks like the issue is that the package isn't correctly handling held-out features and making predictions with your model. This is one of the core operations when calculating SAGE values, and I wrote the package to work mainly with tabular data where the model input is size (batch, num_features). So it's just not currently set up for your use-case, but we should be able to make it work here.

The main thing we need to figure out is the feature imputer. Since you're working with embeddings, it may be simplest to impute held-out feature values with zeros (and this seems reasonable given that you're already training with 1d dropout in the second layer). The package's way of doing this is implemented in the DefaultImputer class (here), but a couple possible issues jump out to me. First, can I ask which dimensions you want to consider as features? I'm guessing you want the dimension 32 to be features because the 64 dimension looks like the embedding size - is that right? Let me know and I can help write a corrected imputer class.

Also, can I ask what kind of data you're using with a GRU where you want to understand global rather than local feature importance?

iancovert / sage

Exception encountered when calling layer "gru" (type GRU). #14

Model: Model: "sequential"