Back to Publications

Rethinking the Primitives: Next Generation LLM Architecture

Jiankai Sun, Chuanyang Zheng, others

https://web.stanford.edu/ jksun/blog/llm-architecture.html, 2026

Abstract

A layer-by-layer redesign of the Transformer stack. Starting from the outermost layer — how position is encoded — and working inward through attention mechanisms, linear hybrid architectures, sparse expert routing, and finally normalization. Each work finds the hidden mathematical structure of one component and replaces engineering convention with a principled derivation.

BibTeX

@article{sun2026clawsbench,
  title = {Rethinking the Primitives: Next Generation LLM Architecture},
  author = {Sun, Jiankai and Zheng, Chuanyang and others},
  journal = {https://web.stanford.edu/~jksun/blog/llm-architecture.html},
  year = {2026},
  keywords = {machine learning, LLM, reasoning}
}