Open junhwi opened 2 months ago
torchtitan https://github.com/pytorch/torchtitan
How Good Are Low-bit Quantized LLAMA3 Models? An Empirical Study https://arxiv.org/abs/2404.14047
OpenELM https://arxiv.org/abs/2404.14619
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding https://arxiv.org/abs/2404.16710
Phi-3 sus
Dreamtonics Vocoflex https://twitter.com/dreamtonics_en/status/1780235008155726047
Many-Shot In-Context Learning https://arxiv.org/abs/2404.11018
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone https://arxiv.org/abs/2404.14219
https://github.com/apple/corenet