Initializing TensorDescriptor from std::vector<int> is very inefficient due to extra checks and multiple intermediate vector, since internally std::vector<size_t> is used.
Changed all the initializations to the native size_t, removed constructors with std::vector<int> and added workarounds for a legacy descriptors initializations with int's.
It increased performance for the current RNN implementation for a few percents.
Initializing TensorDescriptor from
std::vector<int>
is very inefficient due to extra checks and multiple intermediate vector, since internallystd::vector<size_t>
is used.Changed all the initializations to the native
size_t
, removed constructors withstd::vector<int>
and added workarounds for a legacy descriptors initializations withint
's.It increased performance for the current RNN implementation for a few percents.