Gemma: Open Models Based on Gemini Research and Technology, 2024

AkihikoWatanabe / paper_notes

たまに追加される論文メモ

https://AkihikoWatanabe.github.io/paper_notes

16 stars 0 forks source link

Open AkihikoWatanabe opened 5 months ago

AkihikoWatanabe commented 5 months ago

AkihikoWatanabe commented 3 months ago

アーキテクチャはTransformer Decoderを利用。モデルのサイズは2Bと7B。オリジナルのTransformer Decoderアーキテクチャから、下記改善を実施している：

AkihikoWatanabe commented 3 months ago

Mistral #1309 よりも高い性能を示している：