Cannot run Elmo model with multi-GPUs because of "ParameterList"

I use nn.DataParallel for running with multi-GPUs,However
"nn.ParameterList is being used with DataParallel but this is not supported. This list will appear empty for the models replicated on each GPU except the original one." the empty list really bothers me a lot，I wonder how to use the elmo model with multi-GPUs? nn.ParameterList appears in ELmo model just like this:

from allennlp.modules.elmo import Elmo
  (scalar_mix_0): ScalarMix(
    (scalar_parameters): ParameterList(
        (0): Parameter containing: [torch.FloatTensor of size 1]
        (1): Parameter containing: [torch.FloatTensor of size 1]
        (2): Parameter containing: [torch.FloatTensor of size 1]
    )
  )
  (scalar_mix_1): ScalarMix(
    (scalar_parameters): ParameterList(
        (0): Parameter containing: [torch.FloatTensor of size 1]
        (1): Parameter containing: [torch.FloatTensor of size 1]
        (2): Parameter containing: [torch.FloatTensor of size 1]
    )
  )
)

My allennlp Version is 1.3.0 and my pytorch version is 1.7.1. Thanks a lot. Hope you have a nice day !

allenai / allennlp

Cannot run Elmo model with multi-GPUs because of "ParameterList" #5097