[one-cmds] pythorch (or ONNX) LSTM op import fails

seanshpark commented 2 years ago

With current version of other packages, import LSTM op of ONNX model fails (from pytorch)

Our pytorch example also fails and we can use this example for tracing.

$ cd res/PyTorchExamples
$ python3 ptem.py LSTM

PyTorch version= 1.10.0+cpu ONNX version= 1.10.1 ONNX-TF version= 1.9.0 TF version= 2.6.0 ... loc("lstm_bias_lstm_0"): error: is not immutable, try removing mutable variables in your model since mutable variables are currently not supported through this converter

seanshpark commented 2 years ago

LSTM.onnx

seanshpark commented 2 years ago

Seems converter.experimental_enable_resource_variables = True will convert to tflite without error.

seanshpark commented 2 years ago

tflite2circle fails: tflite::BuiltinOperator_CALL_ONCE

tfldump result of LSTM.tflite

Operator Codes: [order] OpCodeName (OpCode Enum)
[0] CALL_ONCE (code: 129, dep_code: 127, version: 1)
[1] CUSTOM(VarHandleOp) (code: 32, dep_code: 32, version: 1)
[2] STRIDED_SLICE (code: 45, dep_code: 45, version: 1)
[3] WHILE (code: 119, dep_code: 119, version: 1)
[4] RESHAPE (code: 22, dep_code: 22, version: 1)
[5] CUSTOM(AssignVariableOp) (code: 32, dep_code: 32, version: 1)
[6] LESS (code: 58, dep_code: 58, version: 1)
[7] LOGICAL_AND (code: 86, dep_code: 86, version: 1)
[8] ADD (code: 0, dep_code: 0, version: 1)
[9] GATHER (code: 36, dep_code: 36, version: 1)
[10] CONCATENATION (code: 2, dep_code: 2, version: 1)
[11] CUSTOM(ReadVariableOp) (code: 32, dep_code: 32, version: 1)
[12] TRANSPOSE (code: 39, dep_code: 39, version: 1)
[13] FULLY_CONNECTED (code: 9, dep_code: 9, version: 1)
[14] SPLIT (code: 49, dep_code: 49, version: 1)
[15] LOGISTIC (code: 14, dep_code: 14, version: 1)
[16] TANH (code: 28, dep_code: 28, version: 1)
[17] MUL (code: 18, dep_code: 18, version: 1)
[18] SLICE (code: 65, dep_code: 65, version: 1)

seanshpark commented 2 years ago

These need attention. @lemmaa , do you have any comments for these?

[0] CALL_ONCE (code: 129, dep_code: 127, version: 1)
[1] CUSTOM(VarHandleOp) (code: 32, dep_code: 32, version: 1)
[3] WHILE (code: 119, dep_code: 119, version: 1)
[5] CUSTOM(AssignVariableOp) (code: 32, dep_code: 32, version: 1)
[11] CUSTOM(ReadVariableOp) (code: 32, dep_code: 32, version: 1)

For CALL_ONCE , we need to upgrade circle schema, to follow TF2.6.0 or higher.

seanshpark commented 2 years ago

Or maybe try follow try removing mutable variables in onnx-tf 😭

seanshpark commented 2 years ago

Current LSTM.tflite

main subgraph

T(0:0) FLOAT32 (1, 3, 20) B(1) serving_default_2:0
T(0:1) FLOAT32 (1, 3, 20) B(2) serving_default_1:0
T(0:2) FLOAT32 (5, 3, 10) B(3) serving_default_input:0
T(0:16) FLOAT32 (5, 3, 20) B(17) StatefulPartitionedCall:2
T(0:22) FLOAT32 (1, 3, 20) B(23) StatefulPartitionedCall:1
T(0:23) FLOAT32 (1, 3, 20) B(24) StatefulPartitionedCall:0

O(0:0) CALL_ONCE
O(0:1) CUSTOM(VarHandleOp)
    container() dtype(0) shared_name(lstm_bias_lstm_0)
    O T(0:10) lstm_bias_lstm_0
O(0:2) CUSTOM(VarHandleOp)
    container() dtype(0) shared_name(lstm_kernel_lstm_0)
    O T(0:11) lstm_kernel_lstm_0

O(0:5) WHILE
    cond_subgraph_index(2) body_subgraph_index(3)
    I T(0:4) ExpandDims/dim
    I T(0:4) ExpandDims/dim
    I T(0:3) LSTM_eec48014/rnn/TensorArrayV2
    I T(0:13) LSTM_eec48014/strided_slice2
    I T(0:12) LSTM_eec48014/strided_slice_1
    I T(0:11) lstm_kernel_lstm_0
    I T(0:10) lstm_bias_lstm_0
    I T(0:2) serving_default_input:0
    O T(0:14) LSTM_eec48014/rnn/while
    O T(0:15) LSTM_eec48014/rnn/while1
    O T(0:16) StatefulPartitionedCall:2
    O T(0:17) LSTM_eec48014/rnn/while2
    O T(0:18) LSTM_eec48014/rnn/while3
    O T(0:19) LSTM_eec48014/rnn/while4
    O T(0:20) LSTM_eec48014/rnn/while5
    O T(0:21) LSTM_eec48014/rnn/while6

Inputs/Outputs: I(input)/O(output) T(tensor index) OperandName
I T(0:0) serving_default_2:0
I T(0:1) serving_default_1:0
I T(0:2) serving_default_input:0
O T(0:16) StatefulPartitionedCall:2
O T(0:22) StatefulPartitionedCall:1
O T(0:23) StatefulPartitionedCall:0

seanshpark commented 2 years ago

Export to TF (2.6) model warnings

UserWarning: tf.nn.rnn_cell.LSTMCell is deprecated and will be removed in a future version. This class is equivalent as tf.keras.layers.LSTMCell, and will be replaced by that in Tensorflow 2.0.
WARNING:tensorflow:tf.nn.rnn_cell.MultiRNNCell is deprecated. This class is equivalent as tf.keras.layers.StackedRNNCells, and will be replaced by that in Tensorflow 2.0.
dynamic_rnn (from tensorflow.python.ops.rnn) is deprecated and will be removed in a future version. Instructions for updating: Please use keras.layers.RNN(cell), which is equivalent to this API
UserWarning: layer.add_variable is deprecated and will be removed in a future version. Please use layer.add_weight method instead.

--> #8216

seanshpark commented 2 years ago

Using onnx-tensorflow from source

Uninstall installed from normal package

cd .../bin/venv/lib/python3.6/site-packages
pip3 uninstall onnx_tf

Install for source

cd (somewhere)/onnx-tensorflow
pip3 install -e .

(-e: editable)

in site-packages folder, onnx-tf.egg-link file will show up

To uninstall with -e version

pip3 uninstall onnx_tf

seanshpark commented 2 years ago

Calling run with the prepare data, seems to remove Variable nodes...

in one-import-onnx

tf_savedmodel = onnx_tf.backend.prepare(onnx_model)
tf_savedmodel.run(input_data) <-- THIS, input_data with np.zeros()

--> tflite is generated --> circle2circle fails with WHILE BODY subgraph, where Slice Op input is NOT constant but output of Concat --> ShapeInference fails where current implementation assumes CircleSlice 2nd, 3rd inputs are CircleConst

binarman commented 2 years ago

@seanshpark Why solution from onnx_legalize.py is not applicable? It simply unrolls LSTM and there are no complex operations left.

seanshpark commented 2 years ago

Why solution from onnx_legalize.py is not applicable?

I cannot answer this as I do not have deep knowledge on this. @lemmaa , can you please check this with application part?

seanshpark commented 2 years ago

Why solution from onnx_legalize.py is not applicable?

As of current implemetation ALWAYS convert without a switch

For backends that support these can be needless

And as I know for general,

Network size is larger
Same constant buffer may exist if not optimized for this
We may enable WHILE in our backend(s)(for currently don't)

seanshpark commented 2 years ago

CC @jyoungyun

seanshpark commented 2 years ago

About tf_savedmodel.run(input_data)

https://cv-tricks.com/how-to/freeze-tensorflow-models/

Q) how to prepare input_data ?

current draft tried with zero values. is this OK? if not what value should we give?

seanshpark commented 2 years ago

next Concat input is strange... and the output also

concat((1,1,1), (1,1,1), (1,3,20)) ==> (5,3,20) ???

anyway shape should be dynamic cause of Slice...

llFreetimell commented 2 years ago

@seanshpark Can you share the last tflite file?

seanshpark commented 2 years ago

Can you share the last tflite file?

Here is LSTM.tflite in LSTM.zip

seanshpark commented 2 years ago

@llFreetimell , FYI; from LSTM.onnx to LSTM.tflite, draft is #8219 tf_prep.run(module._dummy_) is called to remove CUSTOM(VarHandleOp) (https://github.com/Samsung/ONE/issues/8217#issuecomment-1000680271)

Samsung / ONE

[one-cmds] pythorch (or ONNX) LSTM op import fails #8217