Adam
Gets trapped on C19’s oscillating surface and fails to reach the exact roundtrip frontier.
A 16-dimensional byte embedding unit that round-trips every input byte exactly, trains through a nonlinear C19 encoder, and deploys as a baked int8 lookup table with zero runtime compute.
The encoder path is nonlinear C19; the decoder is mirrored, tied, and linear. This asymmetry is the winning shape: expressive enough to separate bytes, simple enough to decode them back exactly.
The byte enters as an 8-dimensional {-1,+1} signal.
Int4 weights plus learnable C19 parameters create the nonlinear encoder.
The same learned weight geometry is reused in reverse to reconstruct the original byte.
The optimizer story and the quantization story are separate, but they stack: L-BFGS finds the exact float solution, then a 96-step staged int4 freeze preserves it.
Gets trapped on C19’s oscillating surface and fails to reach the exact roundtrip frontier.
Uses curvature information to converge to the exact byte-embedder solution.
Reach 100% roundtrip with float weights and the full nonlinear encoder intact.
Lock the next staged chunk into int4. Public story stays on the 96-step schedule throughout.
Fine-tune after each freeze step so the exact roundtrip never drops.
The current byte embedder is exact as a unit, deployable as a LUT, and still meaningfully useful downstream. The chart below mixes current artifact points with smaller historical deploy variants for context.
This is now a truthful dual-path viewer. One mode shows the neural encoder’s float latent. The other shows the baked LUT latent used for zero-compute deployment.
One slide, three lenses: raw neuron shapes, signed weight structure, and the resulting geometry in embedding space.
A–Z cosine similarity in the baked 16D space. Diagonal stays bright by construction; nearby letters reveal shared structure.