mokemokechicken / keras_npi

NPI(Neural Programmer-Interpreters) implementation with Keras
MIT License
244 stars 53 forks source link

About

Implementation of Neural Programmer-Interpreters with Keras.

How to Demo

Demo Movie

requirement

setup

pip install -r requirements.txt

create training dataset

create training dataset

sh src/run_create_addition_data.sh

create training dataset with showing steps on terminal

DEBUG=1 sh src/run_create_addition_data.sh

training model

Create New Model (-> remove old model if exists and then create new model)

NEW_MODEL=1 sh src/run_train_addition_model.sh

Training Existing Model (-> if a model exists, use the model)

sh src/run_train_addition_model.sh

test model

check the model accuracy

sh src/run_test_addition_model.sh

check the model accuracy with showing steps on terminal

DEBUG=1 sh src/run_test_addition_model.sh

Implementation FAQ

These are questions about implementation that I received in the past.

about pydot

Q: I am using Python3. I am getting an error "module 'pydot' has no attribute 'find_graphviz'".

A: Let's try pydot-ng.

train_f_enc method

Q: What is the purpose of 'env_model' in 'train_f_enc' method which gets called by 'fit' method? My guess is, it is to train the weights of 'f_enc' layer.

A: Yes, that's right.


Q: Why is the target output of 'env_model' - [[first digit of sum], [ carry of sum]]? Also, why does the target output not have 'output' As per my understanding the weights of 'f_enc' layer should be trained only in 'self.model'.

A: Yes, in the original paper, 'f_enc' is trained with other layers. It is better not to be trained separately.

The reason of that in my implementation is just difficulty to train the model. Especially it seemed to hard to train layers before LSTMs (like f_enc layer). f_enc weights often became some NaNs. (I don't know why... keras problem? or ??) So, I tried to train f_enc separately, and it seemed good (not best).

NOP program

Q: what's the purpose of NOP program?

A: I do not remember it much, but NOP (No Operation) is program_id = 0. I thought that in the early days of learning, the predicted value often becomes 0, and harmless NOPs that do not perform unnecessary movements will learn more efficiently. Although it is not certain whether it is effective...

weights = [[1.]]

Q: what's the purpose of weights = [[1.]] this initialization?

A: You mean weights = [[1.]] in AdditionNPIModel#convert_output(), don't you?

The weights means learning weights of [f_end, f_prog, f_args]. The first weights = [[1.]] means "f_end's weight=1". f_prog and f_args weights are set to 1 if the teacher returns valid values.