junhwi / next-gen-ai

0 stars 0 forks source link

24/04/21 #21

Open asynclee opened 4 months ago

asynclee commented 4 months ago

https://arxiv.org/pdf/2403.20329.pdf

https://arxiv.org/abs/2404.07143

junhwi commented 4 months ago

JetMoE: Reaching Llama2 Performance with 0.1M Dollars

https://arxiv.org/abs/2404.07413

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

https://arxiv.org/abs/2404.08801

https://github.com/anthropics/anthropic-cookbook

https://github.com/ollama/ollama/releases/tag/v0.1.32

https://github.com/openai/simple-evals

https://huggingface.co/MaziyarPanahi/Meta-Llama-3-8B-Instruct-GGUF/discussions/5

shylee2021 commented 4 months ago

Llama 3 https://llama.meta.com/llama3/ https://twitter.com/karpathy/status/1781028605709234613

GPT uses 'delve' a lot https://twitter.com/then_there_was/status/1780711545665949723

seyong92 commented 4 months ago

Suno AI 관련 비하인드 스토리

https://x.com/ednewtonrex/status/1781060923131879680