The current PARADE implementation only supports the Tensorflow trainer. It should be straightforward to port to Pytorch, however, since the transformers library already supports both. This is mainly a matter of replacing the TF aggregation layers with their Pytorch versions.
The current PARADE implementation only supports the Tensorflow trainer. It should be straightforward to port to Pytorch, however, since the transformers library already supports both. This is mainly a matter of replacing the TF aggregation layers with their Pytorch versions.