Check if Token embedding indicates the nature of word embedding

speed1313 / jax-llm

JAX implementation of Large Language Models. You can train GPT-2-like model with 青空文庫 (aozora bunko-clean dataset) or any other text dataset.

MIT License

10 stars 2 forks source link

Open speed1313 opened 6 months ago

speed1313 commented 6 months ago

Calculate inner products between tokens