KIMYA.DEV
™
Blog
Manifesto
ACTIVE
MASKED-ATTENTION — ARCHIVE
記録
01
TRANSFORMER
Transformer verstehen — Schritt 4b: Multi-Head & Masked Attention
Schritt 4b: Masked Attention & Multi-Head
Tokenizer ✓
→
Embedding ✓
→
Pos. Encoding ✓ …
!-->
2026.04.06