Foundations --> CNN Doubts

GokuMohandas / Made-With-ML

Learn how to design, develop, deploy and iterate on production-grade ML applications.

MIT License

37.52k stars 5.95k forks source link

The token index is 0 in our code (see the Tokenizer class), unless it's configured otherwise. But you made me realize that we should pass in the index to pad_sequences instead of assuming it's 0. These kinds of mismatches lead to silent bugs! I'll push this change towards the end of this month.
Good question, I'll add more details to the lesson to make this clear. But we're actually not dropping the FC layers. If you look at our __init__ function for InterpretableCNN, it has all the layers. The only difference is that we're returning an earlier artifact in the forward function.
I've seen this happen if you don't train it to completion but also make sure that the PAD token index is zero. Until I solve the mismatch in pad_sequences function, we force PAD to be zeros.

GokuMohandas / Made-With-ML