davmacario / MDI-LLM

Implementation of Model-Distributed Inference for Large Language Models, built on top of LitGPT
MIT License
3 stars 2 forks source link

Move `ln_f` (final normalization) to starter node #15

Closed davmacario closed 7 months ago

davmacario commented 7 months ago

This allows to eliminate the distinction between the intermediate and finisher nodes.