junhwi / next-gen-ai

0 stars 0 forks source link

24/04/28 #22

Open junhwi opened 2 months ago

junhwi commented 2 months ago

Many-Shot In-Context Learning https://arxiv.org/abs/2404.11018

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone https://arxiv.org/abs/2404.14219

https://github.com/apple/corenet

shylee2021 commented 2 months ago

torchtitan https://github.com/pytorch/torchtitan

How Good Are Low-bit Quantized LLAMA3 Models? An Empirical Study https://arxiv.org/abs/2404.14047

OpenELM https://arxiv.org/abs/2404.14619

LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding https://arxiv.org/abs/2404.16710

Phi-3 sus

image
seyong92 commented 2 months ago

Dreamtonics Vocoflex https://twitter.com/dreamtonics_en/status/1780235008155726047