Defaults multiple to 1 thereby not resizing the embedding unless user wishes to resize it by explicitly using this feature.
Motivation is to support vLLM inference for LoRA which needs merging of adapter when embedding layer is resized with no real tokens. Findings from @Ssukriti and team.
Defaults multiple to 1 thereby not resizing the embedding unless user wishes to resize it by explicitly using this feature.
Motivation is to support vLLM inference for LoRA which needs merging of adapter when embedding layer is resized with no real tokens. Findings from @Ssukriti and team.