Ion Stoica
The distributed-compute infrastructure that almost every modern LLM is either trained on, served from, or evaluated through.
Stoica's name is on the founding papers of Spark (distributed data processing), Ray (distributed Python for ML workloads), and the broader stack vLLM uses to serve language models efficiently — artifacts that have become the de facto substrate for ML in industry. The pattern is consistent: design a system around the actual computational bottlenecks researchers and engineers face, ship it open-source with a commercial company alongside, and let the ecosystem prove the design choices. For ai100, which runs hundreds of audits against multiple LLM engines, the infrastructure side Stoica's group built is not background — it's the layer that determines how feasibly you can run any large-scale model comparison at all.