konabuta / my-scratch-book

MIT License

0 stars 0 forks source link

Open konabuta opened 5 months ago

konabuta commented 5 months ago

AI Forum 2023 | Future of Foundation Models

This presentation starts by raising questions about the cost of LLMs compared to human brain.

konabuta commented 5 months ago

konabuta commented 5 months ago

Future base architecture of Foundation Models will be 1-bit RetNet (RetNet + BitNet)

Inference cost, Parallelism, Performance ...

Low-bit training is not stable. And quantization aftter training shows poor performance significantly.

Overview of BitNet

Architecture details