junhwi / next-gen-ai

0 stars 0 forks source link

24/11/12 #48

Open junhwi opened 1 week ago

junhwi commented 1 week ago

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

https://arxiv.org/abs/2404.05961

https://code.visualstudio.com/updates/v1_95

seyong92 commented 5 days ago

ComposerX: Multi-Agent Music Generation with LLMs

https://arxiv.org/abs/2404.18081

https://github.com/lllindsey0615/ComposerX

https://glossy-scowl-a33.notion.site/ComposerX-Demo-e53b59f17540401785437f3bee38c308

shylee2021 commented 5 days ago

PyTorch conda deprecation https://dev-discuss.pytorch.org/t/pytorch-deprecation-of-conda-nightly-builds/2590

Chinese company trained GPT-4 rival with just 2,000 GPUs — 01.ai spent $3M compared to OpenAI's $80M to $100M https://www.tomshardware.com/tech-industry/artificial-intelligence/chinese-company-trained-gpt-4-rival-with-just-2-000-gpus-01-ai-spent-usd3m-compared-to-openais-usd80m-to-usd100m

MSR releases 1M synthetic instruction pairs https://huggingface.co/datasets/microsoft/orca-agentinstruct-1M-v1

xAI pauses training Grok-3, insisting it proved Riemann's hypothesis (lol) https://x.com/hyhieu226/status/1858028679747829769

Long Context RAG Performance of Large Language Models https://arxiv.org/pdf/2411.03538v1