Open junhwi opened 6 months ago
GPT-4o https://openai.com/index/hello-gpt-4o/ https://twitter.com/LiamFedus/status/1790064963966370209
Why Llama 3 is not a MoE? https://www.reddit.com/r/LocalLLaMA/comments/1c7h4wq/why_llama_3_is_not_a_moe/
Daniel Han's Opinion on \<LoRA Learns Less and Forgets Less> https://x.com/danielhanchen/status/1791900967472140583
mambaout https://arxiv.org/abs/2405.07992
https://lmsys.org/blog/2024-05-08-llama3/
https://openai.com/index/openai-and-reddit-partnership/
https://arxiv.org/abs/2405.09673
https://developers.googleblog.com/en/gemma-family-and-toolkit-expansion-io-2024/
https://deepmind.google/technologies/gemini/flash/
https://arxiv.org/abs/2405.09818