junhwi / next-gen-ai

0 stars 0 forks source link

24/01/24 #9

Open junhwi opened 5 months ago

junhwi commented 5 months ago

https://deepmind.google/discover/blog/alphageometry-an-olympiad-level-ai-system-for-geometry/

https://manifestai.com/blogposts/faster-after-all/

https://www.theverge.com/2024/1/18/24042354/mark-zuckerberg-meta-agi-reorg-interview

Meta’s H100 shipments for 2023 at 150,000, a number that is tied only with Microsoft’s shipments and at least three times larger than everyone else’s. When its Nvidia A100s and other AI chips are accounted for, Meta will have a stockpile of almost 600,000 GPUs by the end of 2024, according to Zuckerberg.

https://huggingface.co/moreh/MoMo-70B-lora-1.8.6-DPO

MoMo-70B is trained using Moreh's MoAI platform, which simplifies the training of large-scale models, and AMD's MI250 GPU.

https://www.reddit.com/r/LocalLLaMA/comments/18xbevs/open_llm_leaderboard_is_disgusting/

https://english.elpais.com/technology/2024-01-19/yann-lecun-chief-ai-scientist-at-meta-human-level-artificial-intelligence-is-going-to-take-a-long-time.html

http://karpathy.github.io/2024/01/21/selfdriving-agi/

https://medium.com/@kunal_79217/an-llm-benchmark-for-financial-document-question-answering-e63e9d2bda25

https://github.com/lucidrains/self-rewarding-lm-pytorch

https://github.com/gabrielchua/RAGxplorer

hippothewild commented 5 months ago
shylee2021 commented 5 months ago

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads https://arxiv.org/abs/2401.10774

seyong92 commented 5 months ago

Ultra-lightweight Neural Differential DSP Vocoder For High Quality Speech Synthesis https://arxiv.org/abs/2401.10460

DITTO: Diffusion Inference-Time T-Optimization for Music Generation https://arxiv.org/abs/2401.12179