Older post
Language as Mathematical Structure: Examining Semantic Field Theory Against Language Games
Newer post
Geometric Regularization in Mixture-of-Experts: The Disconnect Between Weights and Activations