• Large language models (LLMs) often appear to vindicate a radical empiricist picture: train on vast corpora of experience-like text, and capacities emerge without explicit symbolic rules. Yet contemporary machine learning research repeatedly emphasizes that what is learned, how quickly it is learned, and how well it generalizes depend crucially on prior constraints: architectural structure, training objectives, optimization dynamics, and representational bottlenecks. These constraints constitute …Read more