Block D · Embedder
A 32,294 × 64 lookup table that converts token IDs into dense vectors. Xavier-initialized and int8-quantized, awaiting end-to-end training.
Block D · Architecture
One row per vocab slot. At inference: O(1) index into the table. The upper model (Block E) trains this entire table end-to-end.
Block D · Init baseline
94.7% of Block E's total parameter count lives in this embedder. Training signals from the language model head will flow back through Block E's transformer layers and into this table, gradually encoding semantic structure.
Block D · Artifacts
Block D · Roadmap
Related PRs