AMP only works on CUDA devices

If a user tries to train a mixed-precision enabled model (bert-ner, bert-ner-re) with the CPU device and with Apex installed, they are faced with this error:

RuntimeError: Found param bert.embeddings.word_embeddings.weight with type torch.FloatTensor, expected torch.cuda.FloatTensor.
When using amp.initialize, you need to provide a model with parameters
located on a CUDA device before passing it no matter what optimization level
you chose. Use model.to('cuda') to use the default device.

Therefore, amp.initialize() should only be used when both Apex is installed and a CUDA device is available for training.

BaderLab / saber

AMP only works on CUDA devices #167