CERC-AAI / multimodal

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Apache License 2.0
8 stars 3 forks source link

Write conversion script to convert from Robin to Hugginface Checkpoint #27

Open kshitijkg opened 1 year ago

kshitijkg commented 1 year ago
  1. Create Robin Class by modifying GPT-NeoX class in Hugginface
  2. Adapt conversion script to convert robin weights to this