konabuta / my-scratch-book

MIT License
0 stars 0 forks source link

Video: AI Forum 2023 | Future of Foundation Models #5

Open konabuta opened 5 months ago

konabuta commented 5 months ago

AI Forum 2023 | Future of Foundation Models

Link: https://www.youtube.com/watch?v=f6m0MpbNicU&list=WL&index=15&t=9s&ab_channel=MicrosoftResearch

This presentation starts by raising questions about the cost of LLMs compared to human brain.

konabuta commented 5 months ago

Sparsity

konabuta commented 5 months ago

RetNet

Inference cost, Parallelism, Performance ...

image

BitRet

Low-bit training is not stable. And quantization aftter training shows poor performance significantly.

image

Overview of BitNet

image

Architecture details

image